Senior Cloud Data Engineer
Space Telescope Science Institute7 months ago
Baltimore, Maryland, United States
Hybrid
Full-time
Junior Level (1-3 years)
Job Description
Position Overview
Start your new year off with an amazing career at STScI! The Space Telescope Science Institute (STScI) is a multi-mission science and flight operations center for NASA’s flagship observatories on the Johns Hopkins University Homewood campus in Baltimore, Maryland. This role as a Senior Data Cloud Engineer in the Catalog Science Branch plays a key role in designing, building, and managing data infrastructure and processing for the state-of-the-art astronomical archive, MAST. This position supports hybrid work and requires candidates to reside in or be willing to relocate to our local market (MD, DE, VA, PA, DC & WV). US Citizenship or Permanent Residence is required to meet ITAR requirements. Compensation: Salary range - $130,000 - $150,000
Key Responsibilities
- Design, construct, install, test, and maintain robust, highly scalable, flexible, and cost-effective cloud-based data management systems including data store, backup, integration, governance, recovery, and retrieval.
- Develop and maintain scalable data ingestion and migration pipelines to support increasing mission data volume and complexity.
- Build and support cloud-based scientific catalog data exploration tools and services.
- Design, install, deploy, and support hybrid environments combining legacy and active mission astronomical data.
Required Qualifications
- Bachelor’s degree in Computer Science, Engineering, Information Technology, or a related field with 8+ years of industry experience.
- Strong SQL programming skills with experience in cloud SQL databases (e.g., Amazon RDS) and extensive knowledge of ETL/ELT processes and cloud data Lakehouse solutions.
- Proficiency in utilizing big data tools (e.g., Spark), data modeling, data architecture, and data governance practices.
- Hands-on experience with cloud platforms, especially Amazon Web Services (AWS), including cloud data storage, access strategies, and programming in Python along with containerization tools like Docker and Kubernetes.
Preferred Qualifications
- AWS Certified Solutions Architect or equivalent certification.
- Experience with cloud data warehousing, lakehousing solutions (e.g., Trino, Iceberg) and distributed database platforms.
- Knowledge in machine learning and advanced analytics.
Benefits & Perks
- Employer retirement contribution – direct STScI contribution of 10% of your salary from your first day.
- 12 days sick leave, 24 days’ vacation, and 10 paid holidays.
- Comprehensive medical, dental, vision, and prescription plans, and more!
Required Skills
Orchestration (Kubernetes)
Distributed Database Systems
SQL Programming
Data Modeling & Governance
Data Lakehouse Solutions (Trino, Iceberg)
Containerization (Docker)
AWS Cloud
Machine Learning
Python Programming
ETL/ELT Processes
Big Data Tools (Spark)