Principal Software Engineer – Red Hat OpenShift AI
Red Hat, Inc.3 months ago
Raleigh, NC, United States
Hybrid
Full-time
Junior Level (1-3 years)
Job Description
Position Overview
Join the Red Hat OpenShift AI team to create open, hybrid, and intelligent solutions. You will work alongside AI engineers, data scientists, and MLOps professionals in a fast-paced, innovative environment that combines open source expertise with the latest advances in AI and hybrid cloud. At Red Hat, you’ll be part of a culture that embraces change, drives innovation, and empowers associates to focus on high-impact work.
Key Responsibilities
- Lead the design and development of new component solutions for RHOAI.
- Provide technical guidance and innovation insights to engineering and product leadership.
- Contribute to the architecture and design of RHOAI, ensuring scalability, reliability, and an exceptional user experience.
- Define and implement policies that ensure RHOAI components are observable, secure, and scalable.
- Design cross-cutting plans to ensure consistent implementation across teams and components.
- Apply CI/CD best practices to develop and deliver high-quality software solutions.
- Collaborate with upstream communities such as Kubeflow, MLflow, and Kubernetes to ensure smooth integration on OpenShift.
- Represent Red Hat at conferences and in AI/ML communities, both internally and externally.
- Engage with open source communities to evaluate emerging AI, ML, and data technologies, creating integration examples with RHOAI.
- Mentor and guide engineers, fostering technical excellence and collaboration.
- Experiment with emerging AI technologies to enhance development workflows and tools.
Required Qualifications
- 8+ years of progressive experience in software engineering.
- Proven experience in a technical leadership role, guiding teams and providing technical direction.
- Demonstrated innovation skills and a passion for technology.
- Strong software architecture background ensuring consistency and technical excellence.
- Excellent communication skills, with fluency in English.
- Solid understanding of the Linux operating system.
- Strong experience with the Go programming language.
- Demonstrated expertise in Python.
- Experience building and deploying containerized applications, including operator-based install and upgrade processes.
- Deep understanding of open source software communities and collaboration models.
- Knowledge of Kubernetes architecture, operators, and controllers.
- Experience troubleshooting complex issues in distributed systems.
- Hands-on experience with public cloud services (AWS, GCP, Azure).
- Proficiency in utilizing LLMs (e.g., Google Gemini) for brainstorming, research, documentation, and problem solving.
- Experience with AI-assisted development tools (e.g., GitHub Copilot, Cursor, Claude Code) for code generation and auto-completion.
Preferred Qualifications
- Security background or experience integrating security best practices.
- Knowledge of the Prometheus monitoring stack.
- Bachelor’s degree or higher in Computer Science or a related field.
Benefits & Perks
- Compensation: Salary range of $148,540.00 - $245,050.00. Actual offer will be based on qualifications, job location, experience, skills, market value, and internal pay equity.
- Comprehensive medical, dental, and vision coverage.
- Flexible Spending Account for healthcare and dependent care.
- Health Savings Account with a high deductible medical plan.
- Retirement 401(k) with employer match.
- Paid time off and holidays.
- Paid parental leave plans for all new parents.
- Leave benefits including disability, paid family medical leave, and paid military leave.
- Additional benefits such as an employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, and employee assistance program.
Required Skills
Technical Leadership
Python
Distributed Systems Debugging
Go Programming
Open Source Collaboration
Security Best Practices
Containerization
Operator Development
AI-assisted Development
Prometheus Monitoring
Software Architecture
Linux
Cloud Services (AWS, GCP, Azure)
CI/CD
Kubernetes