Analytics Engineer, DataSF (1042)
City and County of San Francisco3 months ago
San Francisco, CA, United States
Hybrid
Full-time
Junior Level (1-3 years)
Job Description
The 1042 Analytics Engineer
The 1042 Analytics Engineer is responsible for maintaining, developing, and coordinating engineering services to support City data sharing via the City’s data platform as well as assisting our data science work. You will:
Improve the data services DataSF offers to departments
- Help implement new automation patterns that leverage cloud analytics platforms (including Snowflake and dbt) to publish new datasets to the open data portal more efficiently and reliably
- Update and improve documentation to support both our own internal operations as well as self-service data automation for other departments
- Continuously assess and help improve our suite of data automation services by evaluating new and emerging technologies, streamlining existing business processes, and identifying opportunities for automation and self-service tooling
- Support the building and deployment of new data services for departments
Build analytics pipelines to support data-driven work
- Work with the team to develop extract, transform, load (ETL) requirements for individual datasets and consult with departments on the best way to automate and publish datasets
- Apply an ethical lens to the appropriate use of data
- Create new analytics pipelines using ETL/ELT approaches according to standards and patterns you help develop and refine
- Implement analytics pipelines and/or data models to support data science and data analytics work as needed
Maintain existing data pipelines
- Monitor existing data automations developed on our legacy infrastructure (Safe Feature Manipulation Engine (FME) Server), respond to incidents, and manage updates
- Migrate existing data automations to leverage cloud analytics tools (dbt, Snowflake, etc.)
- Analyze pipeline throughput, issues, and other metrics to inform improvements to the automation platform
Required Skills
Snowflake
Documentation
Cloud Analytics
Data Pipeline Management
ETL
Data Automation
ELT
Data Analysis
dbt