Technical Architect - AI / ML

WinWire4 months ago

San Jose, CA, United States

On-site

Full-time

Junior Level (1-3 years)

Job Description

Position Overview

Based in California and offered as an FTE/Contract role, this position calls for a Technical Architect specializing in LLMs and Agentic AI. You will own the architecture, strategy, and delivery of enterprise-grade AI solutions, working with cross-functional teams and customers to define the AI roadmap, design scalable solutions, and ensure the responsible deployment of Generative AI across the organization.

Key Responsibilities

Own the architecture, strategy, and delivery of enterprise-grade AI solutions ensuring scalable and secure implementations.
Architect scalable and secure AI/ML/LLM platform solutions including data, model, and inference pipelines; establish enterprise reference architectures, reusable components, best practices, and governance standards for AI adoption; integrate cloud-native, open-source, and enterprise tools.
Implement automated MLOps/LLMOps workflows covering deployment, monitoring, observability, compliance, and performance optimization; collaborate with cross-functional teams (engineering, data science, security, and product) to align platform capabilities with business goals and drive adoption.

Required Qualifications

10-16 years of experience in AI/ML-related roles with a strong focus on LLMs and Agentic AI technology.
6-10 years of experience in designing and implementing large-scale distributed systems, microservices, serverless, and event-driven architectures.
5-8 years of experience in cloud-native architecture (Azure/AWS/GCP) including networking, storage, compute scaling, GPU workloads, and managed AI services; plus experience with platform components, API design, integration patterns, and high-performance compute architecture.
4-7 years of experience building or integrating AI/ML platforms, pipelines, model lifecycle components, inference gateways, and/or enterprise GenAI frameworks.
3-6 years of experience using AI platform tools such as Databricks, Vertex AI, Azure AI Studio, AWS Bedrock, LangChain, Prompt Flow, Ray, Kubeflow, MLflow, Airflow, or Kafka.
2-5 years of experience designing and integrating vector database solutions such as Pinecone, Weaviate, FAISS, Milvus, Qdrant, Elastic, OpenSearch, or Cosmos DB Vector.
2-3 years of experience in LLM architectures, RAG pipelines & patterns, evaluation frameworks, embeddings, tokenization, prompt engineering, evaluation strategies, hallucination reduction, and Agentic AI frameworks; plus 1-2 years in Agentic AI frameworks like MCP or A2A.
2-3 years of experience building GenAI applications, agent workflows, or knowledge retrieval systems using frameworks like LangChain, LlamaIndex, Graph RAG, or custom implementations.
Bachelor's or Master's degree in Computer Science, Data Science, or a related field.
Prior experience in Agile/Scrum projects with exposure to tools like Jira or Azure DevOps.
Strong interpersonal skills for building and maintaining productive relationships with team members and customer representatives.
Ability to provide and receive constructive feedback during code reviews.
An analytical mindset with the ability to translate ideas into tangible technology implementations and drive adoption.
Robust problem-solving and analytical thinking skills with the capability to troubleshoot and resolve issues efficiently.
Demonstrated ability to provide regular updates and communicate effectively with internal and customer stakeholders through verbal, email, and instant messaging channels.

Required Skills

Data, model, and inference pipelines

Agile/Scrum methodologies

Microservices and serverless architectures

Cross-functional team collaboration

MLOps and LLMOps workflows

Enterprise AI/ML architecture

API design and integration

Cloud-native solutions (Azure, AWS, GCP)

Vector database integration (Pinecone, FAISS, etc.)

Scalable distributed systems

LLM and Agentic AI