Data Science Summer Intern (Remote & Paid)

Experianabout 2 months ago
Baltimore, MD, United States
Remote
Internship
Junior Level (1-3 years)

Job Description

Position Overview

Experian is a global data and technology company that powers opportunities for people and businesses worldwide. As part of the Experian NA Innovation Lab – a dedicated R&D unit focused on enhancing client relationships and acquiring strategic datasets – you will join the North America R&D Data Lab. In this role, you will concentrate on research and development of novel analytical solutions, new product prototyping, and evaluating new data assets. A background in machine learning and data mining is essential, and previous experience analyzing large datasets and developing data-driven statistical models is highly desirable.

Key Responsibilities

  • Create advanced machine learning analytical solutions to extract insights from diverse structured and unstructured data sources
  • Unearth data value by selecting and applying the right machine learning, deep learning, and processing techniques
  • Refine data manipulation and retrieval through the design of efficient data structures and storage solutions
  • Innovate with tools designed for data processing and information retrieval
  • Dissect and document vast datasets, analyzing and processing them to highlight patterns and insights
  • Solve complex challenges by developing impactful algorithms
  • Ensure model excellence by validating performance scores and analyzing ROI and benefits
  • Articulate model processes and outcomes by documenting and presenting findings and performance metrics

Required Qualifications

  • Return to school in the Fall of 2026 to complete degree program
  • Currently enrolled in a PhD degree program in Machine Learning, Computer Science, Electrical Engineering, Physics, Statistics, Applied Math, or another quantitative field
  • Experience in analytics, data mining, and/or predictive modeling
  • Experience modifying and applying advanced algorithms to address practical problems
  • Experience with deep learning (e.g., CNN, RNN, LSTM, attention models), machine learning methodologies (e.g., SVM, GLM, boosting, random forest), graph models, and/or reinforcement learning
  • Familiarity with open-source tools for deep learning and machine learning such as PyTorch, Keras, TensorFlow, scikit-learn, and pandas
  • Experience with large data analysis using Spark (pySpark preferred)
  • Proficient in more than one programming language such as Python, R, Java, C++, or C

Benefits & Perks

  • Benefits: Fully remote
  • Benefits: Volunteer Time Off
  • Benefits: Great compensation
  • Benefits: Flexible work schedule
  • Benefits: Eligible for 401(k) participation in 90 days

Required Skills

scikit-learn
machine learning
data mining
statistical modeling
Python
tensorflow
Spark
pandas
deep learning