Browse 42 exciting jobs hiring in Learning Evaluation now. Check out companies hiring such as Adtalem Global Education, DimRed, Awesome Motive in Oceanside, Grand Rapids, Newport News.
Adtalem is hiring a Market Intelligence & AI Insight Strategist to track AI and education trends, evaluate technologies and vendors, and deliver concise, executive-quality intelligence to guide enterprise AI initiatives.
Lead frontier LLM experimentation and productionize interpretable AI agent workflows as an early research scientist at DimRed.
Lead research-to-production work on LLM reasoning, agent decision-making, and interpretability at an early-stage startup building scalable AI automation.
Handshake AI is hiring a Summer 2026 PhD research intern to develop LLM post-training, evaluation, or data-efficiency methods that can be pushed into production and prepared for publication.
Provide per-diem occupational therapy at OhioHealth's new Neuro Transitional Center in Dublin, delivering intensive, team-based rehab for patients with acquired brain or spinal cord injuries.
Lead a cross-disciplinary data science and ML team to deliver LLM-driven solutions, scalable pipelines, and enterprise analytics for Netflix's Content organization.
Lead high-impact, product-aligned experiments on foundation models using PyTorch and distributed training to improve real-world customer outcomes at Liquid AI.
Work on the Metrics Team to design evaluation frameworks and data pipelines that quantify and improve the safety and performance of WeRide's autonomous driving systems.
Percepta is seeking experienced Machine Learning Engineers to design, deploy, and optimize production LLM agents and ML pipelines while working directly with enterprise customers to deliver high-impact AI solutions.
Lawrence Livermore National Laboratory is hiring an Instructional Designer to modernize and manage ES&H training using e-learning, simulations, multimedia, and regulatory-aligned instructional strategies.
Lead and grow Compa’s inaugural Applied AI team, driving production ML systems and MLOps practices to power enterprise compensation intelligence.
Lead large-scale LLM training and synthetic data pipelines at Periodic Labs to build scientifically knowledgeable models and scale training across supercomputing infrastructure.
Lead the AI product strategy for an enterprise cloud data protection platform, turning real-world customer needs into high-impact, AI-enabled product features and commercial launches.
Aprio is looking for a Senior Learning Experience Designer to develop engaging, compliant learning programs and oversee end-to-end instructional design and delivery across virtual, blended, and in-person modalities.
Cognia is hiring an experienced Chief Improvement Officer to drive strategy, operational excellence, and global growth for its Evaluation & Improvement Services.
Adtalem is seeking a Senior Analyst, Market Intelligence & Insights to lead always-on research and translate AI and edtech competitive intelligence into actionable insights and executive briefings for enterprise AI strategy.
Welocalize is hiring an on-site Data Analyst in South Bay, CA to perform natural language data annotation, QA, and support ML model improvements for NLP products.
Serve as the Leadership Program Coordinator at BYU's Sorensen Center, coordinating leadership programs and events while mentoring student employees and strengthening campus engagement.
Zillow's Agentic AI team is hiring a Machine Learning Engineer to design, train, evaluate, and ship agentic LLM solutions that improve user understanding and decision-making across the home search experience.
Atlassian is hiring a Senior Machine Learning Engineer to design and ship LLM-driven features and scalable ML pipelines for its DevAI product suite.
Work on the core intelligence at a seed-stage startup, designing experiments, optimizing inference, and building training and eval systems that turn messy UI and behavioral data into production-ready models.
Join Ataraxis AI as a Research Engineer (Data Science) to advance AI-driven precision oncology through rigorous data pipelines, reproducible research, and publication-grade scientific contributions.
A Machine Learning Engineer Intern to join Robinhood's Agentic ML team to prototype agent development tools, run scalable experiments, and support production evaluation and fine-tuning.
Beyondsoft is hiring a Data Analyst to prepare training data, anonymize documents, and validate LLM/model outputs for AI projects in a remote US-based role.
Lead Ashland County’s 4-H program for UW–Madison Extension, directing program administration, volunteer management, and youth education to grow quality 4-H experiences.
Work as a Machine Learning Research Scientist at The Client to design rigorous evaluation experiments and metrics that advance understanding of LLM behavior and human preference signals.
MLabs, a fast-growing research lab supporting foundation model teams, is hiring a Senior Research Engineer to develop scalable RL recipes, modular environments, and production-ready data pipelines for post-training.
WeRide seeks an AI Simulation Engineer to design AI-based simulation scenarios and agent behaviors that validate and accelerate autonomous vehicle algorithms.
Canva is hiring a Senior Research Engineer to engineer agentic, multimodal evaluation systems that automatically assess and improve the quality and human alignment of generative design models.
Eigenplane is hiring a Founding AI Research Scientist to drive LLM and agent research into scalable, interpretable production systems at an early-stage AI startup.
Lead DELC’s Office of Tribal Affairs to advance Tribal sovereignty and shape statewide early learning policy, funding, and partnerships with Oregon’s nine federally recognized Tribes.
DepthFirst AI is hiring a Research Engineer to develop and evaluate AI agents and training pipelines that discover and exploit software vulnerabilities at scale.
Tessera Labs seeks a Machine Learning Engineer Intern (Fall 2025, Hybrid in San Jose) to build and fine-tune LLM-driven multi-agent pipelines and enterprise tool integrations.
Support a pragmatic clinical trial investigating a mindfulness-based pain management program by coordinating participant interactions, data collection, device setup, and study operations on a per-diem, hybrid basis at BMC.
Abbott is hiring a Manager, Learning & Development to design and measure global key talent programs that build critical skills and align talent strategy with business needs at the Abbott Park HR center.
Lead the design and deployment of cutting-edge 3D computer vision and generative ML models at Dandy to automate and improve dental manufacturing workflows.
A 12-month AI Fellowship at the Gates Foundation to design, prototype, and deploy responsible AI solutions for global health and development while building capacity across program teams.
OpenAI is hiring a Research Engineer/Scientist to advance personality and model-behavior research and integrate novel methods into products used by hundreds of millions of users.
Join a research team building agentic capabilities for ChatGPT, contributing to research, large-scale training, evaluations, and production deployment in a hybrid San Francisco role.
MLabs is hiring a Data Scientist to develop and productionize statistical and machine learning models that detect insurance fraud across workers' compensation and personal injury domains.
Lead research and engineering to build production-ready post-training RL recipes, environments, and evaluation pipelines for a fast-growing startup powering foundation model labs.
Prime Time Consulting is hiring a Level 3 Data Scientist in Maryland to develop and evaluate NLP tokenization and POS annotation solutions for government-focused language datasets.