Lead AI Engineer (FM Hosting, LLM Inference)

Capital One3 months ago

San Francisco, CA, United States

Hybrid

Full-time

Junior Level (1-3 years)

Job Description

Position Overview

At Capital One, we are creating responsible and reliable AI systems that change banking for good. For years, Capital One has led the industry in using machine learning to deliver real‐time, personalized customer experiences by leveraging breakthrough AI and scalable, high-performance infrastructure.

The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI to life. Working hand-in-hand with partners across the company, the team advances the state of the art in science and AI engineering by building and deploying proprietary solutions that deliver value to millions of customers.

Key Responsibilities

Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.
Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability.
Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large-scale production AI systems.
Contribute to the technical vision and long-term roadmap of foundational AI systems at Capital One.

Required Qualifications

Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI/ML algorithms or technologies, OR a Master's degree in a related field with at least 2 years of relevant experience.
At least 4 years of experience programming with Python, Go, Scala, or Java.

Preferred Qualifications

6 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud).
Experience designing, developing, delivering, and supporting AI services.
Experience developing AI/ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang.
Experience applying state-of-the-art techniques to optimize training and inference software for improved hardware utilization, latency, throughput, and cost.
A passion for staying current with the latest AI research and a proven ability to apply novel techniques in production environments.

Benefits & Perks

Compensation: Cambridge, MA: $193,400 - $220,700; McLean, VA: $193,400 - $220,700; New York, NY: $211,000 - $240,800; San Francisco, CA: $211,000 - $240,800; San Jose, CA: $211,000 - $240,800 for Lead AI Engineer.
Performance-based Incentives: Eligible to earn performance-based incentive compensation, including cash bonus(es) and/or long term incentives (LTI).
Comprehensive Benefits: Capital One offers a competitive and inclusive set of health, financial, and other benefits to support your total well-being. Learn more on the Capital One Careers website.

Required Skills

Machine Learning

Software Engineering

AI Systems Design

Optimization Techniques

Python

Large Language Model Inference

Cloud Services (AWS, GCP, Azure)