Generative AI Engineer

Regard4 months ago

San Francisco, CA, United States

Hybrid

Full-time

Junior Level (1-3 years)

Job Description

Position Overview

As a Generative AI Engineer at Regard, you’ll work across the full lifecycle of developing and deploying AI-driven features—from ideation and design to prototyping, implementation, evaluation, and iteration. You’ll collaborate closely with product and clinical teams to build systems that transform medical records into structured insights and clinician-ready documentation. Your work will center on applying modern LLMs to extract, summarize, normalize, and generate clinical information from diverse electronic health record (EHR) data sources. This includes developing robust pipelines, running model and prompt-engineering experiments, integrating models into production services, and ensuring outputs remain factual, safe, and clinically aligned. You’ll directly contribute to high-priority product initiatives, shape new AI capabilities, and advance our LLM platform, making a tangible impact on care quality.

About Regard:

Our mission is to bring world-class healthcare to everyone. Regard is an AI-powered Proactive Documentation platform that advances care delivery by reviewing all patient data in the EHR to recommend diagnoses and surface clinical evidence. We draft a note even before the physician sees the patient—enabling documentation right at the point of care. This approach improves quality of care, reduces physician burden, and enhances hospital finances. We value mission-oriented work, innovation, and strong relationships as we partner with leading health systems across the country.

Compensation: $170,000 to $240,000

Key Responsibilities

Build and refine LLM-powered systems to extract structured medical concepts, diagnoses, medications, labs, and timelines from unstructured records.
Develop generation pipelines that produce clinically accurate drafts of notes (H&P, progress notes, discharge summaries, etc.) from factual inputs.
Design, prototype, and evaluate prompts, agent workflows, and retrieval-augmented generation (RAG) components.
Benchmark LLM systems to evaluate new models and audit accuracy.
Optimize inference cost, latency, and throughput through batching, caching, and model-selection strategies.

Required Qualifications

BS in Computer Science or equivalent experience.
3+ years of professional experience with software development in one or more programming languages (Python preferred).
1+ years of professional experience building generative AI products, such as RAGs, agents, and chatbots.
Ability to participate in on-call operational support for assigned areas of responsibility.
Willingness to travel up to 4 weeks per year for company co-working sessions and/or retreats.
Strong verbal and written communication skills.

Preferred Qualifications

Familiarity with vector databases and embeddings generation.
Experience working on a mature enterprise SaaS technology product.
Exposure to startup and/or high-growth environments.

Benefits & Perks

Benefits: Eligible for equity; 99% employer paid health benefits (Medical, Dental, and Vision) + One Medical subscription; 18 PTO days/yr + 1 week holiday break; Monthly health & wellness budget; Company-sponsored team retreat and social events; A sabbatical program.
Location: Candidates must be authorized to work in the US without visa sponsorship and reside within the New York City metro area, San Francisco Bay Area, or Los Angeles (with relocation assistance provided for those outside the NYC metro area).
Schedule: Hybrid work environment with in-office presence on Tuesdays and Thursdays, and the flexibility to work remotely for up to 6 weeks per year.

Required Skills

Model Benchmarking

Clinical Documentation

Redis

Large Language Models (LLMs)

Generative AI

React

PostgreSQL

AWS

Python

Prompt Engineering

Pipeline Development

Retrieval-Augmented Generation (RAG)

TypeScript