Cloud Data Platform Engineer AWS
THE EVOLVERS GROUP26 days ago
Baltimore, MD, United States
Remote
Full-time
Junior Level (1-3 years)
Job Description
Job Title
Cloud Data Platform Administrator is the hands-on technical resource responsible for implementing, securing, and operating EDP. This role is accountable for platform operations, security, and governance configuration end-to-end—ensuring the environment is compliant, reliable, cost-controlled, and enables secure analytics and AI/ML workloads at scale.
Required Experience:
- Identity & Access Management proficiency: SSO concepts, SCIM provisioning, group based RBAC, service principals, least-privilege patterns.
- Security fundamentals: secrets management, secure connectivity, audit logging, access monitoring, and evidence-ready operations.
- Automation skills: IaC using Terraform, CLI, and REST APIs for repeatable configuration and environment promotion.
- 3 years' experience building AWS Infrastructure using Terraform.
- 3 years' experience building CI/CD pipelines, preferably using Azure DevOps or Gitlab.
- CI/CD practices for promotion across SDLC environments.
- Strong troubleshooting and problem-solving; communicate clearly during incidents and changes.
- Cloud platform expertise (AWS): IAM roles/policies, object storage security patterns, networking basics (VPC concepts), logging/monitoring integration.
- Hands-on experience with AWS security and networking services including PrivateLink, Secrets Manager/Systems Manager integration, CloudWatch/CloudTrail integration, S3 bucket policies, cross-account access patterns, and KMS encryption key management.
Education / Experience:
- Bachelor's degree: in a related field or equivalent practical experience.
- Highly valued (Desirable, but not required): knowledge, skills and experience SQL proficiency and data engineering fundamentals for troubleshooting query performance issues, understanding ETL/ELT workflow patterns, and debugging data pipeline failures; basic Python/Scala familiarity for notebook/code issue diagnosis.
- Experience: with compliance and regulatory frameworks (FedRAMP, HIPAA, SOC2, or similar) including implementation of data residency requirements, retention policies, and audit-ready evidence collection.
- SLA/SLO management: incident management, and stakeholder communication skills; ability to define platform service levels, produce operational reports, translate technical issues to business stakeholders, and manage vendor relationships (Databricks account teams).
Certifications:
- AWS Certified Solutions Architect Associate or Professional
The Contractor shall deliver, but not limited to, the following:
- Implement platform monitoring/alerting, operational dashboards, and health checks; maintain runbooks and operational procedures.
- Provision and administer AWS GovCloud infrastructure components supporting EDP environments (networking, compute, storage, IAM, logging/monitoring).
- Implement and maintain standardized secure-by-default configurations aligned to agency security requirements (baseline hardening, patching coordination, configuration management).
- Operate cloud services supporting data and analytics platforms (e.g., storage integrations, encryption/KMS patterns, secure service endpoints, VPC constructs).
- Establish and maintain operational monitoring/alerting, health checks, runbooks, and incident support in coordination with platform and security teams.
- Manage change control for upgrades, feature rollouts, configuration changes, and integration changes; document impacts and rollback plans.
- Enable and maintain audit logging and access/event visibility; support security reviews and evidence requests.
- Configure logging and auditability (e.g., CloudTrail/CloudWatch patterns) and support evidence collection for security/compliance activities.
- Coordinate secure networking patterns (private connectivity, egress controls, firewall/proxy constraints) with network and security stakeholders.
- Build and manage POC environments (isolated accounts/VPCs where applicable), ensuring repeatability, cost controls, and safe teardown.
- Coordinate secure connectivity and guardrails with cloud/network teams: private connectivity patterns, egress controls, firewall/proxy needs.
- Implement cost guardrails: cluster policies, auto-termination, scheduling, workload sizing standards, and capacity planning.
- Produce usage/cost insights and optimization recommendations; address waste drivers (idle compute, oversized clusters, inefficient jobs).
- Automate administration and configuration using APIs/CLI/IaC (e.g., Terraform) to reduce manual drift and improve repeatability.
- Maintain platform documentation: configuration baselines, security/governance standards, onboarding guides, and troubleshooting references.
- Manage third-party integrations and ecosystem connectivity, including BI tool integrations (e.g., Power BI), and external metadata catalog integrations.
- Conduct capacity planning and scalability analysis, including forecasting concurrent user/workload growth, platform scaling strategies, and proactive resource allocation during peak usage periods.
- Facilitate user onboarding and enablement, including new user/team onboarding procedures, training coordination, workspace access provisioning, and creation of self-service documentation/guides.
Required Skills
Incident Management
CloudTrail
Gitlab
Networking
Automation
Terraform
Data Engineering
AWS
Compliance Frameworks
SQL
Azure DevOps
Scala
Monitoring
Python
IAM
Operational Procedures
CloudWatch
Security Management
Cost Management
CI/CD