Senior DevOps / Cloud Site Reliability Engineer

UBS4 months ago
Raleigh, NC, United States
Hybrid
Full-time
Junior Level (1-3 years)

Job Description

Position Overview

Job Reference #314173BR – Full Time. UBS is looking for a DevOps/Cloud Site Reliability Engineer to work in a complex, global environment with a dedicated Site Reliability Engineering team. In this role you will deploy, monitor, and support applications in modern cloud-oriented environments, drive automation initiatives, and work cross-functionally to ensure system stability. You will also benefit from flexible working arrangements and an inclusive, diverse culture that values your ideas and efforts.

Key Responsibilities

  • Deploy, monitor, and support applications in modern cloud environments.
  • Promote overall system stability through monitoring, change control management, and implementing best practices in observability.
  • Support Gitlab Pipelines and Terraform to manage Infrastructure as Code across multiple Azure resources/services.
  • Research automation opportunities, troubleshoot issues, and implement migrations, upgrades, and patches.
  • Collaborate with business and development teams at all levels, and work with cross-functional teams (Database, UNIX, Cloud, etc.) to resolve complex issues.
  • Establish root causes of application errors and escalate serious concerns when necessary.
  • Provide occasional on-call or weekend support for critical activities during non-production hours.

Required Qualifications

  • Bachelor's degree with ideally 7+ years of work experience in an Information Technology field.
  • Deep understanding of Site Reliability Engineering concepts and strong knowledge of Azure Cloud.
  • Hands-on experience with Kubernetes and Docker containers.
  • Proficiency with DevOps tools such as Gitlab, Azure DevOps, and Nexus along all phases of modern DevOps pipelines.
  • Working knowledge of key Azure technologies including Virtual Machines, Key Vaults, Storage Accounts, Virtual Networks, and Hub Networks.
  • Demonstrable experience in delivering quality operational services in complex and secure environments.
  • Practical experience in developing automation routines to enhance production stability and service quality.
  • Experience with Azure repos, branching strategies, code reviews, and code analysis tools.
  • Excellent proficiency in programming languages such as Python, Bash, and PowerShell.
  • Strong understanding of database platforms like PostgreSQL, Oracle, or Azure SQL.
  • Solid grasp of ITIL lifecycle and processes.
  • Proven team player with strong organizational skills and the ability to prioritize in a complex environment.
  • Certification as a Kubernetes Administrator (CKA) or Kubernetes App Developer (CKAD).

Preferred Qualifications

  • Backstage Open Platform experience is a plus.

Benefits & Perks

  • Flexible working arrangements including part-time, job-sharing, and hybrid (office and home) options.
  • Opportunities for professional growth through diverse roles and continuous learning.
  • Inclusive and diverse work environment that values every individual’s contributions.
  • Global career exposure at the world’s largest wealth manager.

Required Skills

Python
GitLab
PowerShell
On-Call Support
Database Administration
Automation
Azure
Docker
Terraform
Cloud Computing
DevOps
Kubernetes
ITIL
Site Reliability Engineering
Bash