Staff+ Software Engineer - Cloud Availability Platform Engineering (CAPE)

Crusoe4 days ago
San Francisco, CA
Hybrid
Full-time
Junior Level (1-3 years)

Job Description

Position Overview

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability. In this role as a Staff/Senior Staff/Principal Software Engineer, you will architect, design, and develop cloud infrastructure management systems for our AI-first Crusoe Cloud, driving meaningful innovation, operational efficiency, and transformative growth.

Key Responsibilities

  • Collaborate extensively across teams to architect, design, and implement physical infrastructure management software systems, availability platforms, and frameworks that support our AI Infrastructure.
  • Champion thereliability, scalability, and securityof our systems and platforms as the guardian of our cloud environment.
  • Develop workflows that drive efficiency while meeting key business objectives and revenue metrics.
  • Design and implement high-performing, highly available cloud architectures optimized for both performance and cost-effectiveness.
  • Streamline cloud deployment, configuration management, and operations by developing and maintaining effective platforms, interfaces, and automation tooling.
  • Actively contribute to the evolution of our platform by collaborating closely with cross-functional development teams.

Required Qualifications

  • Bachelor’s degree in Computer Science or Software Engineering with 10+ years of relevant experience.
  • 10+ years of experience building and operating distributed systems at scale.
  • Proven experience designing and running reliable, scalable, efficient, and secure cloud platforms in production environments.
  • Fluency in programming languages such as Go, Rust, Java, or C++.
  • A collaborative approach to working with development and operations teams, ensuring robust platform adoption.
  • Strong understanding of cloud security best practices and ability to implement secure configurations.
  • Excellent troubleshooting and problem-solving skills for complex infrastructure challenges.
  • Excellent communication skills.
  • Alignment with company values and a passion for innovation.

Preferred Qualifications

  • Hands-on experience deploying, managing, and troubleshooting Kubernetes clusters.
  • Experience in a fast-paced startup environment.
  • A passion for building an energy-first, scalable AI Infrastructure and a commitment to sustainability and innovation.

Benefits & Perks

  • Compensation:$215,000 - $290,000 with Restricted Stock Units included in all offers. Compensation is determined by education, experience, knowledge, skills, abilities, internal equity, and market alignment.
  • Join a pioneering team at the forefront of the AI revolution with a focus on sustainable technology.
  • Be a part of a diverse and inclusive work environment where innovation is celebrated. Crusoe is an Equal Opportunity Employer.

Required Skills

Security Best Practices
Automation
Scalability
C++
System Architecture
Distributed Systems
Mentoring
Go
Reliability
Java
Rust
Kubernetes
Cloud Infrastructure Management