Data Scientist - AI

Internet Brands3 months ago
El Segundo, CA, United States
On-site
Full-time
Junior Level (1-3 years)

Job Description

Position Overview

Internet Brands and WebMD are seeking a Data Scientist to work on exciting personalization initiatives at our Los Angeles headquarters. This role involves applying machine learning and deep learning techniques—including natural language understanding, large language models (LLMs), RAG pipelines, and agentic AI frameworks—to enhance content recommendation, personalization, and business intelligence models. Compensation for this role is expected to range from $100-120k, depending on your skills, qualifications, and experience. Note that the position is on-site five days a week.

Key Responsibilities

  • Process, cleanse, and verify the integrity of data used for analysis.
  • Perform ad-hoc analysis and present clear, actionable results.
  • Apply feature synthesis and selection techniques to build robust models.
  • Utilize regression and classification algorithm frameworks effectively.
  • Apply machine learning techniques at scale on massive datasets.
  • Leverage large language models (LLMs) for content summarization, recommendations, and personalization, incorporating RAG pipelines for knowledge retrieval.
  • Experiment with agentic AI frameworks to enhance personalization and automation.
  • Implement model evaluation benchmarks and large-scale LLM testing frameworks.
  • Utilize prompt fine-tuning and LLM optimization techniques (pre-training, post-training, etc.).
  • Collaborate with cross-functional agile teams, including software engineers and domain experts, to build new product features.
  • Work closely with other data scientists to prioritize and promote machine learning initiatives.

Required Qualifications

  • 2-4 years of experience in data science with a focus on predictive analytics.
  • B.S., M.S., or PhD in Data Science, Computer Science, Software Engineering, Information Science, Mathematics, Statistics, Electrical Engineering, Physics, or a related field (or equivalent experience).
  • Strong understanding of transformer-based architectures, embeddings, and tokenization techniques.
  • Proficiency in LLM frameworks and APIs such as LangChain, Hugging Face Transformers, OpenAI, Google Vertex AI (Gemini), or Anthropic Claude, along with expertise in evaluation and performance benchmarking.
  • Demonstrated ability in prompt engineering, fine-tuning, and evaluation for LLMs.
  • Solid understanding of personalization and recommendation algorithms (e.g., collaborative filtering, content-based filtering, hybrid recommenders, sequence-based or session-based models, and graph-based models).
  • Proficiency in SQL and experience with common data science toolkits such as R, Scikit-learn, NumPy, and Tensorflow.
  • Ability to work on-site at the Los Angeles headquarters five days a week.

Benefits & Perks

  • Comprehensive benefits package including health insurance (medical, dental, and vision), flexible spending accounts (FSA) for medical and dependent care, short-term and long-term disability insurance, and life and AD&D insurance.
  • 401(k) retirement savings plan with a company match.
  • Paid time off (PTO), paid holidays, and commuter benefits.
  • Access to an Employee Assistance Program (EAP), well-being coaching services, and voluntary benefits such as home, auto, and pet insurance, as well as discounted legal and financial services.

Required Skills

Agentic AI systems
Deep Learning
Data Cleansing
LangChain
Feature Synthesis
SQL
Generative AI
Transformer Architectures
Natural Language Understanding
Regression and Classification
RAG pipelines
Hugging Face Transformers
Prompt Engineering
Large Language Models
Machine Learning