Learning Evaluation Jobs

Posted 10 days ago

Lead frontier LLM experimentation and productionize interpretable AI agent workflows as an early research scientist at DimRed.

Founding AI Research Scientist

Awesome Motive Hybrid San Francisco

Posted 10 days ago

Lead research-to-production work on LLM reasoning, agent decision-making, and interpretability at an early-stage startup building scalable AI automation.

Certified Home Access Consultant

MobilityWorks Regular Full-Time READING, Pennsylvania

Sponsored

Engineering Manager, R&D

SharkNinja Hybrid NEEDHAM, Massachusetts

Sponsored

U

Insurance Professional - Sales and Service

USAA Full-Time COLORADO SPRINGS, Colorado

Sponsored

Handshake AI Research Intern, Summer 2026

Handshake Hybrid San Francisco

Posted 11 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake AI is hiring a Summer 2026 PhD research intern to develop LLM post-training, evaluation, or data-efficiency methods that can be pushed into production and prepared for publication.

O

Occupational Therapist - PRN

OhioHealth Neuro Transitional Center Hybrid Dublin

Posted 11 days ago

Provide per-diem occupational therapy at OhioHealth's new Neuro Transitional Center in Dublin, delivering intensive, team-based rehab for patients with acquired brain or spinal cord injuries.

Machine Learning Manager - Content Enterprise

Netflix Hybrid USA - Remote

Posted 12 days ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Work/Life Harmony

Customer-Centric

Fast-Paced

Growth & Learning

Medical Insurance

Dental Insurance

401K Matching

Paid Time-Off

Maternity Leave

Paternity Leave

Mental Health Resources

Flex-Friendly

Lead a cross-disciplinary data science and ML team to deliver LLM-driven solutions, scalable pipelines, and enterprise analytics for Netflix's Content organization.

L

Member of Technical Staff - Applied ML Scientist

Liquid AI Hybrid No location specified

Posted 12 days ago

Lead high-impact, product-aligned experiments on foundation models using PyTorch and distributed training to improve real-world customer outcomes at Liquid AI.

New Grads 2026 - Data Engineer

WeRide.ai Hybrid San Jose, CA

Posted 12 days ago

Work on the Metrics Team to design evaluation frameworks and data pipelines that quantify and improve the safety and performance of WeRide's autonomous driving systems.

Machine Learning Engineer

Percepta Hybrid New York City

Posted 12 days ago

Percepta is seeking experienced Machine Learning Engineers to design, deploy, and optimize production LLM agents and ML pipelines while working directly with enterprise customers to deliver high-impact AI solutions.

Instructional Designer

LLNL Hybrid Livermore, CA, USA

Posted 12 days ago

Lawrence Livermore National Laboratory is hiring an Instructional Designer to modernize and manage ES&H training using e-learning, simulations, multimedia, and regulatory-aligned instructional strategies.

Product Development Manager

SharkNinja Hybrid NEEDHAM, Massachusetts

Sponsored

A

Maintenance Specialist

Advanced Technology Services Hybrid CANUTILLO, Texas

Sponsored

Product Development Engineer

SharkNinja Hybrid NEEDHAM, Massachusetts

Sponsored

C

Engineering Manager, Applied AI

Compa Hybrid No location specified

Posted 12 days ago

Lead and grow Compa’s inaugural Applied AI team, driving production ML systems and MLOps practices to power enterprise compensation intelligence.

P

Research Engineer - Midtraining

Periodic Labs Hybrid Menlo Park

Posted 12 days ago

Lead large-scale LLM training and synthetic data pipelines at Periodic Labs to build scientifically knowledgeable models and scale training across supercomputing infrastructure.

Product Manager, AI (Remote - US)

Jobgether Hybrid No location specified

Posted 14 days ago

Lead the AI product strategy for an enterprise cloud data protection platform, turning real-world customer needs into high-impact, AI-enabled product features and commercial launches.

Senior Learning Experience Designer

Aprio Hybrid No location specified

Posted 14 days ago

Aprio is looking for a Senior Learning Experience Designer to develop engaging, compliant learning programs and oversee end-to-end instructional design and delivery across virtual, blended, and in-person modalities.

Chief Improvement Officer

The Renaissance Network Hybrid Remote

Posted 18 days ago

Cognia is hiring an experienced Chief Improvement Officer to drive strategy, operational excellence, and global growth for its Evaluation & Improvement Services.

Senior Analyst- Market Intelligence & Insights

Adtalem Global Education Hybrid Columbia, MD, USA

Posted 20 days ago

Adtalem is seeking a Senior Analyst, Market Intelligence & Insights to lead always-on research and translate AI and edtech competitive intelligence into actionable insights and executive briefings for enterprise AI strategy.

Data Annotation Expert with Linguistic Experience - South Bay, CA (On Site)

Welocalize Hybrid No location specified

Posted 20 days ago

Welocalize is hiring an on-site Data Analyst in South Bay, CA to perform natural language data annotation, QA, and support ML model improvements for NLP products.

Leadership Program Coordinator - Sorensen Center

Brigham Young University (BYU) Hybrid Provo, UT

Posted 20 days ago

Serve as the Leadership Program Coordinator at BYU's Sorensen Center, coordinating leadership programs and events while mentoring student employees and strengthening campus engagement.

Machine Learning Engineer, Agentic AI – User Understanding

Zillow Hybrid Remote-USA

Posted 21 days ago

Inclusive & Diverse

Customer-Centric

Mission Driven

Fast-Paced

Growth & Learning

Transparent & Candid

Diversity of Opinions

Work/Life Harmony

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Learning & Development

Fitness Stipend

401K Matching

Equity

Life insurance

Disability Insurance

WFH Reimbursements

Flex-Friendly

Paid Time-Off

Maternity Leave

Paternity Leave

Paid Holidays

Paid Volunteer Time

Sabbatical

Zillow's Agentic AI team is hiring a Machine Learning Engineer to design, train, evaluate, and ship agentic LLM solutions that improve user understanding and decision-making across the home search experience.

Senior Machine Learning Engineer

Atlassian Hybrid San Francisco

Posted 21 days ago

Customer-Centric

Empathetic

Collaboration over Competition

Feedback Forward

Inclusive & Diverse

Mission Driven

Diversity of Opinions

Rise from Within

Medical Insurance

Paid Time-Off

Dental Insurance

Vision Insurance

Maternity Leave

Mental Health Resources

Equity

401K Matching

Employee Resource Groups

Performance Bonus

Education Stipend

Life insurance

Atlassian is hiring a Senior Machine Learning Engineer to design and ship LLM-driven features and scalable ML pipelines for its DevAI product suite.

Senior Electrical Engineer

SharkNinja Hybrid NEEDHAM, Massachusetts

Sponsored

F

Senior Research Scientist – Computational Wind Engineering

FM Hybrid NORWOOD, Massachusetts

Sponsored

A

Regional Field Service Maintenance Technician

Advanced Technology Services Hybrid GREENVILLE, South Carolina

Sponsored

Member of Technical Staff - Research

Awesome Motive Hybrid San Francisco

Posted 21 days ago

Work on the core intelligence at a seed-stage startup, designing experiments, optimizing inference, and building training and eval systems that turn messy UI and behavioral data into production-ready models.

A

Research Engineer (Data Science)

Ataraxis AI Hybrid New York

Posted 22 days ago

Join Ataraxis AI as a Research Engineer (Data Science) to advance AI-driven precision oncology through rigorous data pipelines, reproducible research, and publication-grade scientific contributions.

Machine Learning Engineer Intern, Agentic ML (Summer 2026)

Robinhood Hybrid Menlo Park, CA

Posted 22 days ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Work/Life Harmony

Dare to be Different

Reward & Recognition

Fast-Paced

Maternity Leave

Paternity Leave

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Paid Holidays

Paid Sick Days

Paid Time-Off

Learning & Development

Social Gatherings

A Machine Learning Engineer Intern to join Robinhood's Agentic ML team to prototype agent development tools, run scalable experiments, and support production evaluation and fine-tuning.

B

Data Analyst

Beyondsoft Consulting Hybrid United States (Remote)

Posted 27 days ago

Beyondsoft is hiring a Data Analyst to prepare training data, anonymize documents, and validate LLM/model outputs for AI projects in a remote US-based role.

W

4-H PROGRAM EDUCATOR

Wisconsin Hybrid Ashland, WI

Posted 27 days ago

Lead Ashland County’s 4-H program for UW–Madison Extension, directing program administration, volunteer management, and youth education to grow quality 4-H experiences.

Research Scientist

Lead Allies Inc Hybrid San Francisco

Posted 29 days ago

Work as a Machine Learning Research Scientist at The Client to design rigorous evaluation experiments and metrics that advance understanding of LLM behavior and human preference signals.

Senior Research Engineer, LLM Data

MLabs Hybrid No location specified

Posted 30 days ago

MLabs, a fast-growing research lab supporting foundation model teams, is hiring a Senior Research Engineer to develop scalable RL recipes, modular environments, and production-ready data pipelines for post-training.

AI Simulation engineer

WeRide.ai Hybrid San Jose, CA

Posted last month

WeRide seeks an AI Simulation Engineer to design AI-based simulation scenarios and agent behaviors that validate and accelerate autonomous vehicle algorithms.

Senior Research Engineer - Evaluations

Canva Hybrid San Francisco, CA

Posted last month

Inclusive & Diverse

Diversity of Opinions

Passion for Exploration

Dare to be Different

Empathetic

Growth & Learning

Paid Holidays

Medical Insurance

Equity

401K Matching

Learning & Development

Social Gatherings

Flex-Friendly

Maternity Leave

Paternity Leave

Sabbatical

Canva is hiring a Senior Research Engineer to engineer agentic, multimodal evaluation systems that automatically assess and improve the quality and human alignment of generative design models.

E

Founding AI Research Scientist

Eigenplane Hybrid San Francisco

Posted last month

Eigenplane is hiring a Founding AI Research Scientist to drive LLM and agent research into scalable, interpretable production systems at an early-stage AI startup.

F

Research Area Director - Physical Systems Reliability & Resilience

FM Hybrid NORWOOD, Massachusetts

Sponsored

U

Licensed P&C Insurance Professional - Sales and Service (Signing Bonus)

USAA Full-Time TAMPA, Florida

Sponsored

Certified Home Access Consultant

MobilityWorks Regular Full-Time PHILADELPHIA, Pennsylvania

Sponsored

O

Tribal Affairs Director (Business Operations Manager 1)

Oregon Hybrid Salem | DELC | Summer Street

Posted last month

Lead DELC’s Office of Tribal Affairs to advance Tribal sovereignty and shape statewide early learning policy, funding, and partnerships with Oregon’s nine federally recognized Tribes.

D

Research Engineer (multiple positions)

DepthFirst Hybrid No location specified

Posted last month

DepthFirst AI is hiring a Research Engineer to develop and evaluate AI agents and training pipelines that discover and exploit software vulnerabilities at scale.

T

Machine Learning Engineer Intern (Fall 2025, Hybrid in San Jose, CA)

Tessera Labs Hybrid San Jose

Posted last month

Tessera Labs seeks a Machine Learning Engineer Intern (Fall 2025, Hybrid in San Jose) to build and fine-tune LLM-driven multi-agent pipelines and enterprise tool integrations.

Research Coordinator, General Internal Medicine (per diem)

BMC Hybrid Boston

Posted last month

Support a pragmatic clinical trial investigating a mindfulness-based pain management program by coordinating participant interactions, data collection, device setup, and study operations on a per-diem, hybrid basis at BMC.

Manager Learning & Development

Abbott Hybrid United States - Illinois - Abbott Park

Posted last month

Abbott is hiring a Manager, Learning & Development to design and measure global key talent programs that build critical skills and align talent strategy with business needs at the Abbott Park HR center.

D

Senior Machine Learning Engineer II

Dandy Hybrid No location specified

Posted last month

Lead the design and deployment of cutting-edge 3D computer vision and generative ML models at Dandy to automate and improve dental manufacturing workflows.

G

AI Fellows (12-month LTE*)

Gates Foundation Hybrid Seattle, WA

Posted last month

A 12-month AI Fellowship at the Gates Foundation to design, prototype, and deploy responsible AI solutions for global health and development while building capacity across program teams.

Research Engineer / Scientist, Personality and Model Behavior

OpenAI Hybrid San Francisco

Posted last month

Inclusive & Diverse

Feedback Forward

Collaboration over Competition

Growth & Learning

OpenAI is hiring a Research Engineer/Scientist to advance personality and model-behavior research and integrate novel methods into products used by hundreds of millions of users.

Research Engineer / Research Scientist - ChatGPT Agent

OpenAI Hybrid San Francisco

Posted last month

Inclusive & Diverse

Feedback Forward

Collaboration over Competition

Growth & Learning

Join a research team building agentic capabilities for ChatGPT, contributing to research, large-scale training, evaluations, and production deployment in a hybrid San Francisco role.

Data Scientist

MLabs Hybrid No location specified

Posted last month

MLabs is hiring a Data Scientist to develop and productionize statistical and machine learning models that detect insurance fraud across workers' compensation and personal injury domains.

F

Field Quality Assurance Compliance Auditor - Manufacturing

FM Hybrid WESTMINSTER, Maryland

Sponsored

Engineering Manager, R&D

SharkNinja Hybrid NEEDHAM, Massachusetts

Sponsored

Service Mechanic

MobilityWorks Regular Full-Time PLAIN CITY, Ohio

Sponsored

T

Founding Senior Research Engineer

The LLM Data Company Hybrid San Francisco

Posted last month

Lead research and engineering to build production-ready post-training RL recipes, environments, and evaluation pipelines for a fast-growing startup powering foundation model labs.

P

Data Scientist 3 (Maryland)

Prime Time Consulting Hybrid Annapolis Junction, Maryland