Cloud Operations Engineer - Department of Technology (1043)
City and County of San Francisco13 days ago
San Francisco, CA
Hybrid
Full-time
Junior Level (1-3 years)
Job Description
Position Overview
Join the Cloud Center of Excellence (CCoE) team at the City and County of San Francisco, where you will support mission‐critical infrastructure for essential public services. As a Cloud Operations Engineer with the Department of Technology (DT), you will leverage your technical expertise to design, develop, and maintain secure, scalable, and highly available commercial cloud infrastructure for over 50 city departments. This is aPermanent Exempt - Full Timeposition on a 36‑month project that supportsremote workand offers ahybridschedule. Work primarily at 1 S Van Ness Ave, San Francisco, CA 94103, with occasional travel and a requirement to relocate to California within 4 weeks if needed.
Key Responsibilities
- Contribute cloud expertise to develop optimized commercial cloud services for multiple city departments and applications.
- Consult with and advise business partners on efficient cloud technology usage.
- Architect, build, operate, deploy, and maintain secure, scalable, and highly available cloud infrastructure.
- Migrate business systems and data to commercial cloud environments for both production and disaster recovery.
- Develop and maintain software solutions and frameworks that automate cloud configuration and administration.
- Design and implement cloud-native solutions to support rapid enhancements and perform capacity planning.
- Collaborate with development, applications, operations, and security teams to ensure functional and non‐functional requirements are met.
- Gather and analyze metrics from operating systems and applications for performance tuning and fault analysis.
- Configure and deploy mechanisms for cloud cost management, budgeting, and scalable capabilities.
- Establish and enforce cloud infrastructure best practices and guidelines for audits and compliance.
- Develop automation tools for consistent, efficient, transparent, and secure management of cloud operations and finances.
- Enhance and maintain disaster recovery and business continuity plans, including backup and restore processes.
- Implement and maintain monitoring and alerting systems to proactively identify and resolve issues.
- Serve as the 24x7 lead for escalating and resolving problems with cloud providers.
- Create and manage Operations Scope of Procedures (SOPs), detailing standard procedures, lessons learned, diagnostic steps, and resolutions.
- Research and evaluate industry trends to continuously improve infrastructure solutions.
Benefits & Perks
- Remote Work:Enjoy a work environment that supports remote and hybrid schedules.
- Stability:Be part of a mission‐critical team serving over 50 city departments in aPermanent Exempt - Full Timerole.
- Dynamic Environment:Work on cutting‑edge cloud infrastructure and participate in innovative projects that modernize IT services.
Required Skills
Operational Monitoring
Cloud Architecture
Cloud Migration
Capacity Planning
Security and Compliance
Cloud Cost Management
Disaster Recovery
Performance Tuning
Automation
Infrastructure Design