Senior System Engineer
On-Demand Group8 days ago
Minneapolis, MN, United States
Hybrid
Full-time
Junior Level (1-3 years)
Job Description
Job Location:
Occasional on-site - SW Metro, Minneapolis
Job Type:
Full-time
Role Overview
We are seeking a highly experienced Senior Systems Engineer to lead and own enterprise backup and recovery systems, Microsoft SCCM infrastructure, and storage platforms. This role is responsible not only for day‑to‑day administration, but also for architecture, reliability, continuous improvement, and mentorship of junior engineers. The ideal candidate brings deep technical expertise, strong operational discipline, and the ability to drive infrastructure strategy in a complex hybrid environment.
Core Responsibilities
Backup & Disaster Recovery (Primary Ownership)
- Serve as the technical owner for enterprise backup and recovery platforms.
- Architect, implement, and continuously optimize backup solutions across on‑prem, virtual, and cloud workloads.
- Lead and execute complex recovery efforts during incidents, outages, security events, and disaster recovery scenarios.
- Define, test, and maintain disaster recovery strategies, runbooks, and recovery time objectives (RTO/RPO).
- Perform regular restore testing and audit recovery readiness.
- Ensure backup policies align with business continuity, security, and compliance requirements.
- Evaluate new backup technologies and lead tool selection and migrations where appropriate.
- Architect, administer, and support Microsoft SCCM (Configuration Manager) environments at scale.
- Lead OS deployments, patching strategies, application packaging, and compliance configuration.
- Design and maintain servicing plans, maintenance windows, and update deployment strategies.
- Troubleshoot complex SCCM infrastructure, client, and content distribution issues.
- Improve automation, reporting, and reliability of endpoint management processes.
- Act as an escalation point for complex SCCM issues.
- Own and administer enterprise storage platforms (SAN/NAS), including performance optimization, capacity planning, and lifecycle management.
- Lead storage provisioning, migrations, upgrades, and refresh initiatives.
- Monitor and analyze storage performance, growth trends, and risk indicators.
- Troubleshoot complex storage‑related performance and availability issues.
- Partner with virtualization and application teams to ensure storage solutions meet workload requirements.
- Drive standardization and best practices across storage platforms.
Infrastructure Leadership & Engineering
- Act as a senior technical escalation point for infrastructure issues.
- Mentor and provide guidance to junior and mid‑level systems engineers.
- Lead infrastructure projects, upgrades, and strategic initiatives.
- Create and maintain high‑quality technical documentation, diagrams, and operational runbooks.
- Participate in architecture discussions and long‑term infrastructure planning.
- Collaborate with security, network, cloud, and application teams to deliver stable and secure platforms.
On-Call & Operational Expectations
- Participate in a 5?week on-call rotation providing off‑hours support for critical infrastructure systems.
- Serve as a senior escalation point during on‑call periods for backup failures, storage incidents, SCCM issues, and system outages.
- Lead or assist with incident response, root‑cause analysis, and post‑incident reviews.
- Ensure proper documentation and knowledge transfer to reduce repeat incidents and improve operational stability.
- Availability for scheduled maintenance, patching, and recovery testing outside of normal business hours as required.
Required Qualifications
- 8–10+ years of experience in enterprise systems engineering or infrastructure operations.
- Deep, hands‑on experience with enterprise backup and recovery platforms (e.g., Veeam, Commvault, Rubrik, Cohesity).
- Advanced experience designing and supporting Microsoft SCCM / MECM environments.
- Extensive experience administering enterprise storage systems (SAN/NAS).
- Strong expertise with Windows Server; solid working knowledge of Linux.
- Experience supporting VMware, Nutanix and/or Hyper‑V environments.
- Proven ability to lead complex troubleshooting and root‑cause analysis efforts.
- Strong documentation, communication, and stakeholder engagement skills.
Preferred Qualifications
- Experience with hybrid and cloud...
Required Skills
Microsoft SCCM (Configuration Manager)
Enterprise backup and recovery systems (Veeam, Commvault, Rubrik, Cohesity)
Hyper-V
Technical documentation and communication
SAN/NAS storage administration
Nutanix
Troubleshooting and root-cause analysis
Linux
Mentorship and leadership
Windows Server
VMware