Job details

Data Scientist (AI Quality & Evaluation)

About the Role

We're looking for a Data Scientist to own the quality, reliability, and trustworthiness of our clinical AI outputs. You'll build the systems that ensure our AI "knows what it doesn't know" — developing evaluation frameworks, calibrated confidence scoring, and automated quality assurance that physicians can actually trust.

What You'll Do

Design and implement automated evaluation pipelines that assess AI output quality, accuracy, and safety at scale
Develop uncertainty quantification systems where confidence scores meaningfully correlate with accuracy
Build comprehensive evaluation frameworks combining automated assessment with clinician-validated test cases
Implement feedback loops that continuously improve model outputs based on validation signals
Establish scalable quality gates that catch errors before they reach end users
Contribute to model alignment and fine-tuning efforts

Qualifications

Required

Strong foundation in deep learning frameworks (PyTorch) and LLM architectures
Experience with model evaluation, benchmarking, and quality metrics
Proficiency in Python and modern ML development tools
Strong statistical foundations
Ability to read, implement, and extend research papers
Excellent communication skills

Preferred

Master's degree in Computer Science, Machine Learning, Statistics, or related quantitative field (PhD preferred)
Publications in top ML/AI venues (NeurIPS, ICML, ICLR, ACL)
Experience with RLHF, DPO, or preference optimization techniques
Background in healthcare AI or regulated industries
Experience building evaluation systems for production LLM applications

Data Scientist ML LLM PyTorch Uncertainty Calibration Model evaluation Benchmarking RLHF DPO Healthcare AI Quality assurance Statistics

Average salary estimate

$170000 / YEARLY (est.)

min

max

$140000K

$200000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Lead Data Scientist - UGC

Scribd Hybrid No location specified

VIEW

Posted 13 hours ago

Lead the measurement and experimentation strategy for Scribd's user-generated content ecosystem, turning ambiguous product questions into rigorous, high-impact analytics and AI evaluation.

Data Scientist, Marketing

The Mirage Hybrid New York

VIEW

Posted 16 hours ago

Mirage, a leading AI short-form video company based in NYC, is hiring a Marketing Data Scientist to build measurement foundations, improve acquisition performance, and inform go-to-market strategy.

Intern - Applications, AI and Machine Learning (ET25164)

TMEIC Corporation Americas Hybrid No location specified

VIEW

Posted 56 minutes ago

TMEIC is hiring an AI/ML Applications intern to help design, build, and deploy data analytics, visualizations, and machine learning solutions for its Energy and Infrastructure business.

Jr Data Scientist - AI

Spring Venture Group Hybrid Kansas City, MO, USA

VIEW

Posted 10 hours ago

Contribute to production-ready machine learning and analytics at Spring Venture Group as a Junior Data Scientist, applying Python, SQL, and modern AI tools to improve KPIs and automate workflows.

Data Scientist, Content

Suno Hybrid New York City

VIEW

Posted 24 hours ago

Suno seeks a Content Data Scientist to drive experimentation, content-health tracking, and discovery optimization for its AI music platform in New York City.

Staff Machine Learning Engineer, AI Product and Platform

BetterUp Hybrid No location specified

VIEW

Posted 47 minutes ago

Lead the design and production of generative AI coaching systems at BetterUp, shaping product direction while mentoring engineers and partnering across product, design, and research.

Optimization Analytics Lead

INTERSECT Hybrid United States

VIEW

Posted 12 hours ago

Lead the development of a production-ready, multi-asset optimization and real-time bidding platform to enable automated ERCOT market decisions at scale for Intersect.

Lead Machine Learning Engineer – Next Best Action (NBA) Platform

Humana Hybrid Remote Nationwide

VIEW

Posted 24 hours ago

Lead the architecture and production deployment of low-latency, explainable machine learning systems that drive personalized next-best-action decisioning across digital and assisted channels at Humana.

Senior Data Scientist, Creators

Suno Hybrid New York City

VIEW

Posted 23 hours ago

Suno is hiring a Senior Data Scientist to define creator and artist success metrics, run experiments, and inform programs that accelerate creator growth on its AI-driven music platform.

AI Technical Architect / Senior AI Engineer

Knowhirematch Hybrid No location specified

VIEW

Posted 13 hours ago

Senior AI Engineer / AI Technical Architect sought by a top-ranked hospital to design and operationalize cloud-based AI/ML solutions using Epic and Azure to improve clinical and operational outcomes.

B Bioscope AI

3 jobs

MATCH

Calculating your matching score...

FUNDING

Early

DEPARTMENTS

Data Science

SENIORITY LEVEL REQUIREMENT

Senior Level

TEAM SIZE

No info