Principal Data Scientist - Generative AI, Machine Learning, Python, R - Remote
Molina Healthcare6 months ago
Tampa, FL, United States
Remote
Full-time
Junior Level (1-3 years)
Job Description
Position Overview
Responsible for overseeing data science projects, managing and mentoring a team, and aligning data initiatives with business goals. Lead the development and implementation of data models, collaborate with cross-functional teams, and stay updated on industry trends. Ensure ethical data use and communicate complex technical concepts to non-technical stakeholders. Lead initiatives on model governance and model ops to align with regulatory and security requirements. This role requires technical expertise, strategic thinking, and leadership to drive data-driven decision-making within the organization and be the pioneer on generative AI healthcare solutions, aimed at revolutionizing healthcare operations as well as enhancing member experience.
Key Responsibilities
- Research and Development: Stay current with the latest advancements in AI and machine learning to improve existing models and develop new methodologies.
- AI Model Deployment, Monitoring & Model Governance: Deploy AI models into production, monitor performance, and adjust to meet accuracy and regulatory requirements.
- Innovation Projects: Lead pilot projects to test and implement new AI technologies within the organization.
- Data Analysis and Interpretation: Extract insights from complex datasets, identify patterns, and inform strategic decision-making.
- Machine Learning Model Development: Design, develop, and train models using supervised, unsupervised, deep learning, and reinforcement learning techniques.
- Agentic Workflows Implementation: Develop workflows that utilize AI agents for autonomous task execution and enhanced operational efficiency.
- RAG Pattern Utilization: Employ retrieval-augmented generation patterns to enhance language model outputs with external knowledge.
- Model Fine-Tuning: Fine-tune pre-trained models to ensure optimal performance and relevance for specific tasks.
- Data Cleaning and Preprocessing: Prepare and clean data by handling missing values and removing outliers.
- Collaboration: Work closely with cross-functional teams including software engineers, product managers, and business analysts.
- Documentation and Reporting: Create comprehensive documentation of models, methodologies, and results for non-technical stakeholders.
- Mentorship: Mentor and coach newer data scientists.
- Business Partnership: Partner with business and technology teams to build ML models that improve star ratings, reduce care gaps, and meet business objectives.
- Presentation: Present complex analytical information clearly to various audiences.
- Project Management: Collaborate with analytics teams to assign and manage analytical project delivery.
- Other Duties: Adapt to changing business requirements by identifying innovative data and technology solutions.
- Industry Insight: Use a broad range of tools to extract insights from current industry or sector trends.
Required Qualifications
- Master's Degree in Computer Science, Data Science, Statistics, or a related field.
- 10+ years' work experience as a data scientist, preferably in a healthcare environment (candidates with relevant experience from other industries will be considered).
- Knowledge of big data technologies (e.g., Hadoop, Spark) and relational database concepts.
- Understanding of SDLC concepts and the ability to bring order to unstructured problems.
- Strong technical proficiency with programming languages such as Python and R, and experience with machine learning frameworks like TensorFlow, Keras, or PyTorch.
- Excellent understanding of statistical methods and machine learning algorithms (e.g., k-NN, Naive Bayes, SVM, and neural networks).
- Familiarity with designing and implementing agentic workflows for autonomous operations.
- Knowledge of retrieval-augmented generation techniques to enhance AI outputs.
- Proven experience in fine-tuning models to meet specific performance metrics.
- Proficiency in data visualization tools (e.g., Tableau, Power BI) to present complex insights.
- Experience with SQL and NoSQL databases, data warehousing, and ETL processes.
- Strong analytical and problem-solving skills.
Preferred Qualifications
- PhD or additional experience.
- Experience with cloud platforms (e.g., Databricks, Snowflake, Azure AI Studio) for AI workflows and model deployment.
- Familiarity with natural language processing (NLP) and computer vision techniques.
Benefits & Perks
- Compensation: Pay Range: $117,731 - $275,491 / ANNUAL (actual compensation may vary based on geographic location, work experience, education and/or skill level).
- Employment Type: Full Time.
- Benefits: Molina Healthcare offers a competitive benefits package.
Required Skills
Mentorship
NoSQL
Keras
R
Leadership
Machine Learning
PyTorch
Python
Big Data (Hadoop, Spark)
TensorFlow
Cloud Platforms
Model Fine-Tuning
Generative AI
RAG Techniques
Statistical Analysis
SQL
Agentic Workflows
Data Visualization
Deep Learning