Cloud Data Engineer

Space Telescope Science Institute6 months ago
Baltimore, Maryland, United States
Hybrid
Full-time
Junior Level (1-3 years)

Job Description

Position Overview

The Space Telescope Science Institute (STScI), a key NASA science and flight operations center, is hiring a Cloud Data Engineer to join our Catalog Science Branch. You will help manage large-scale data infrastructure and processing for the Mikulski Archive for Space Telescopes (MAST), which supports missions including HST, JWST, Roman, Kepler, and TESS. This role supports a hybrid work model (currently work from home) for candidates residing in or willing to relocate to MD, DE, VA, PA, DC, & WV. Compensation: For senior-level roles (8+ years of experience) the annual salary range is $130,000–$150,000 and for Cloud Data Engineer II roles (5+ years of experience) it is $115,000–$125,000. US Citizenship or Permanent Residence is required to meet ITAR requirements.

Key Responsibilities

  • Design, construct, install, test, and maintain highly scalable cloud-based data management systems
  • Develop and maintain scalable data pipelines and build out new API integrations to support increasing mission data volume and complexity
  • Ensure systems meet mission requirements and industry software development best practices
  • Demonstrate proficiency in designing, deploying, and supporting hybrid environments combining legacy and modern astronomical data
  • Collaborate with application engineers, archive scientists, data architects, and storage/networking teams on projects and architectural design
  • Ensure scientific data quality and consistency across various catalog sources

Required Qualifications

  • Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field
  • Minimum of 8 years (senior level) or 5 years (Cloud Data Engineer II) of industrial experience in an AWS Cloud environment along with hands-on data engineering and data management expertise
  • Proficiency in SQL and experience with cloud-based SQL databases (e.g., Amazon PostgreSQL RDS)
  • Strong experience with ETL processes and data warehousing solutions
  • Familiarity with big data tools and frameworks
  • Knowledge of data modeling, data architecture, and governance
  • Understanding of data and database security, including performance tuning at the SQL/database level
  • Experience with cloud platforms (especially AWS) and using various cloud applications to automate data processes
  • Strong programming skills in SQL and Python
  • Experience with containerization and orchestration tools (e.g., Docker, Kubernetes)
  • Excellent problem-solving abilities and strong communication and collaboration skills

Preferred Qualifications

  • Experience with Massive Parallel Processing (MPP) technology-based database clusters
  • Background in areas such as machine learning, advanced analytics, or database architecture
  • Experience with continuous integration and continuous deployment, and modern Infrastructure as Code (IaC) practices

Benefits & Perks

  • Employer retirement contribution – direct STScI contribution of 10% of your salary from your first day
  • 12 days sick leave, 24 days vacation, and 10 paid holidays
  • Comprehensive medical, dental, vision, and prescription plans, among other benefits

Required Skills

Data Modeling
Python
Docker
Data Warehousing
AWS Cloud
Cloud Data Management
Infrastructure as Code
Data Engineering
SQL
Kubernetes
ETL
Big Data Tools