AI/ML Prompt Scientist/Engineer

STAND 84 months ago

San Francisco, CA, United States

Remote

Contract

Junior Level (1-3 years)

Job Description

Position Overview

STAND 8 provides end-to-end IT solutions to enterprise partners across the United States and globally. With offices in Los Angeles, New York, New Jersey, Atlanta, Mexico, India, and more, we are at the forefront of technology innovation. We are looking for an AI/ML Prompt Scientist/Engineer to design, evaluate, and optimize prompt-based systems that drive real-world impact in cutting-edge AI applications. This role is ideal for a Data Scientist or Machine Learning Engineer experienced in prompt optimization, orchestration frameworks, retrieval pipelines, and evaluation tooling. You will collaborate with AI Engineers and Data Scientists to build scalable, reliable, and safe AI systems.

Key Responsibilities

Design, optimize, and evaluate prompts and orchestration pipelines to enhance LLM performance and scalability, supporting both data science and engineering workflows.
Implement programmatic prompting using frameworks such as LangChain, LlamaIndex, DSPy, Guidance, and LangGraph.
Define and manage evaluation frameworks, including A/B testing, human and automated scoring, and model quality assessments using tools like Ragas, UpTrain, DeepEval, TruLens, Promptfoo, and OpenAI Evals.
Engineer and maintain context management systems, including conversation memory, retrieval logic, and context compression strategies (chunking, summarization, prioritization).
Design and optimize retrieval and RAG pipelines by integrating vector databases (e.g., Weaviate) and rerankers (e.g., Cohere) to enhance relevance and accuracy.
Establish observability and traceability mechanisms (e.g., Langfuse) to monitor performance metrics, latency, and operational costs.
Implement guardrails, safety mechanisms, and bias mitigation policies using frameworks such as Guardrails AI, NeMo Guardrails, Outlines, and Rebuff.
Collaborate with cross-functional teams—including Data Scientists, ML experts, and AI Engineers—to ensure high-quality, compliant, and well-documented system delivery.

Required Qualifications

Proven experience in LLM and NLP model operations, including tokenization, context window management, and safety considerations.
Strong technical expertise in prompt engineering, orchestration frameworks, and context management.
Hands-on experience with retrieval-augmented generation, evaluation frameworks, and observability tooling.
Sound understanding of evaluation metrics such as faithfulness, groundedness, coherence, toxicity, latency, and cost.
Experience with compliance and ethical AI practices, including bias and hallucination mitigation.
Excellent analytical, documentation, and communication skills with a proven ability to collaborate effectively across teams.

Preferred Qualifications

Experience in model fine-tuning and alignment (e.g., LoRA, DPO/RLHF, preference tuning).
Background in developing AI agents and integrating tool-use capabilities.

Benefits & Perks

Compensation: Base range is $100,000 - $200,000 per year, depending on experience.
Medical coverage and Health Savings Account (HSA) through Anthem.
Dental, Vision, and various Ancillary coverages through Unum.
401(k) retirement savings plan.
Paid-time-off options.
Company-paid Employee Assistance Program (EAP) and discount programs through ADP WorkforceNow.

Required Skills

Retrieval-Augmented Generation

Machine Learning

Safety Mechanisms

Bias Mitigation

NLP

Python

Evaluation Frameworks

LLM Optimization

Context Management

Prompt Engineering