<section>
  <h2 style="color: #34495e; font-size: 1.2em; margin: 20px 0 10px;">Position Overview</h2>
  <p style="font-family: system-ui, -apple-system, sans-serif; line-height: 1.6; color: #333;">
    We are looking for an ML Engineer with hands-on experience in deploying large language models (LLMs) on GPU infrastructure. This role combines ML engineering with DevOps, focusing on scalable deployments, API integration, and optimization of LLM performance.
  </p>
</section>

<section style="background: #f8f9fa; padding: 20px; border-radius: 8px; margin: 20px 0;">
  <h2 style="color: #34495e; font-size: 1.2em; margin: 0 0 10px;">Key Responsibilities</h2>
  <ul style="padding-left: 20px; margin: 10px 0;">
    <li style="margin: 8px 0;">Deploy and optimize LLMs on GPU-based infrastructure.</li>
    <li style="margin: 8px 0;">Build and manage APIs for model serving (Python-based).</li>
    <li style="margin: 8px 0;">Implement CI/CD, monitoring, and scaling for ML models.</li>
    <li style="margin: 8px 0;">Collaborate on prompt engineering and model optimization.</li>
    <li style="margin: 8px 0;">Manage containerized workloads (Docker/Kubernetes).</li>
  </ul>
</section>

<section style="background: #f8f9fa; padding: 20px; border-radius: 8px; margin: 20px 0;">
  <h2 style="color: #34495e; font-size: 1.2em; margin: 0 0 10px;">Required Qualifications</h2>
  <ul style="padding-left: 20px; margin: 10px 0;">
    <li style="margin: 8px 0;">4–5 years of ML/DevOps engineering experience.</li>
    <li style="margin: 8px 0;">Strong in Python, APIs, and LLM architecture.</li>
    <li style="margin: 8px 0;">Experience with GPU deployments and cloud platforms (AWS/GCP/Azure).</li>
    <li style="margin: 8px 0;">Familiarity with prompt engineering and inference optimization.</li>
    <li style="margin: 8px 0;">Machine Learning: 5 years of experience (Required).</li>
  </ul>
</section>

<section style="background: #f8f9fa; padding: 20px; border-radius: 8px; margin: 20px 0;">
  <h2 style="color: #34495e; font-size: 1.2em; margin: 0 0 10px;">Benefits &amp; Additional Information</h2>
  <ul style="padding-left: 20px; margin: 10px 0;">
    <li style="margin: 8px 0;"><strong>Pay:</strong> $60.00 - $75.00 per hour</li>
    <li style="margin: 8px 0;"><strong>Job Type:</strong> Full-time</li>
    <li style="margin: 8px 0;"><strong>Expected Hours:</strong> 40 per week</li>
    <li style="margin: 8px 0;"><strong>Location:</strong> In person; <strong>Ability to Relocate:</strong> Phoenix, AZ 85003 (Relocate before starting work)</li>
  </ul>
</section>

ML Engineer (LLM Deployment & GPU)

Job Description

Position Overview

Key Responsibilities

Required Qualifications

Benefits & Additional Information

Required Skills