<section style="margin-bottom: 20px;">
 <h2 style="color: #34495e; font-size: 1.2em; margin: 20px 0 10px 0;">Job Description</h2>
 <p style="line-height: 1.6; margin-bottom: 15px;">Genesis AI is seeking an experienced individual to develop low-latency inference pipelines for on-device deployment in robotics. The role involves designing and optimizing distributed systems on GPU clusters, implementing efficient low-level code such as CUDA and Triton, and managing workloads to ensure high throughput and low latency.</p>
 <p style="line-height: 1.6; margin-bottom: 15px;">Ideal candidates will have over 8 years of experience in distributed systems, a strong Python background, and mastery in kernel optimization. This position is essential for our cutting-edge work in machine learning infrastructure.</p>
</section>

Low-Latency Inference Systems Engineer - On-Device & GPU

Job Description

Job Description

Required Skills