Browse 36 exciting jobs hiring in Research Evaluation now. Check out companies hiring such as Adtalem Global Education, Emory University, DimRed in Jackson, Irvine, Aurora.
Adtalem is hiring a Market Intelligence & AI Insight Strategist to track AI and education trends, evaluate technologies and vendors, and deliver concise, executive-quality intelligence to guide enterprise AI initiatives.
Experienced program leader needed to direct strategic program development, manage grants and budgets, and cultivate community and institutional partnerships for Emory's Department of Gynecology and Obstetrics.
Lead frontier LLM experimentation and productionize interpretable AI agent workflows as an early research scientist at DimRed.
Academic pediatric center seeks a board-certified/eligible Child Abuse Pediatrician to join a multidisciplinary CAP team delivering clinical care, education, and forensic consultation across inpatient, ED, and outpatient settings.
Lead research-to-production work on LLM reasoning, agent decision-making, and interpretability at an early-stage startup building scalable AI automation.
Handshake AI is hiring a Summer 2026 PhD research intern to develop LLM post-training, evaluation, or data-efficiency methods that can be pushed into production and prepared for publication.
Lead high-impact, product-aligned experiments on foundation models using PyTorch and distributed training to improve real-world customer outcomes at Liquid AI.
Sentient seeks research scientists/engineers to design and implement novel fine-tuning and agentic techniques that advance long-horizon reasoning and strategic multi-agent decision making for an open-source AGI platform.
Experienced wet-lab biology PhDs are needed to assess and annotate experimental failure modes and recommend mitigations for an AI research benchmark.
Lead large-scale LLM training and synthetic data pipelines at Periodic Labs to build scientifically knowledgeable models and scale training across supercomputing infrastructure.
Lead commercial growth for NIQ’s Brand and Advertising Research practice in North America by driving adoption of brand and communications tracking through strategic client engagement and high-impact proposals.
Lead a multidisciplinary team at Texas A&M AgriLife Research as Program Director to manage, grow, and deliver landscape-scale natural resource conservation programs supported by research, extension, and policy work.
Lead applied research to design, evaluate, and productionize agentic systems that replace workflows and enable fully autonomous businesses.
Lead mixed-methods implementation and impact research to improve K-12 curricula, inform product and professional learning decisions, and mentor junior researchers at Great Minds.
Adtalem is seeking a Senior Analyst, Market Intelligence & Insights to lead always-on research and translate AI and edtech competitive intelligence into actionable insights and executive briefings for enterprise AI strategy.
Work on the core intelligence at a seed-stage startup, designing experiments, optimizing inference, and building training and eval systems that turn messy UI and behavioral data into production-ready models.
Carnegie Mellon University is hiring an Institutional Research and Student Affairs Assessment Specialist to design and execute research and assessment projects, analyze institutional and student affairs data, and deliver actionable insights to university leadership and stakeholders.
Join Ataraxis AI as a Research Engineer (Data Science) to advance AI-driven precision oncology through rigorous data pipelines, reproducible research, and publication-grade scientific contributions.
Experienced Project Manager needed to oversee communications-intensive innovation and human-centered design projects for a mission-driven organization supporting Veterans.
Senior Program Analyst (Research/HCD) to lead user-centered research and program evaluation efforts that drive evidence-based improvements in healthcare programs.
Old Dominion University's Jeannine Smith Public Health program is seeking an Assistant Professor (Clinical) in health behavior/promotion to teach both in-person and online, advise students, and maintain an active research and service portfolio.
Work as a Machine Learning Research Scientist at The Client to design rigorous evaluation experiments and metrics that advance understanding of LLM behavior and human preference signals.
MLabs, a fast-growing research lab supporting foundation model teams, is hiring a Senior Research Engineer to develop scalable RL recipes, modular environments, and production-ready data pipelines for post-training.
Lead the creation and evaluation of a member- and provider-focused Medicaid FFS quality assurance system at the Oregon Health Authority, using quantitative and qualitative methods to advance health equity and improve statewide standards.
Canva is hiring a Senior Research Engineer to engineer agentic, multimodal evaluation systems that automatically assess and improve the quality and human alignment of generative design models.
Eigenplane is hiring a Founding AI Research Scientist to drive LLM and agent research into scalable, interpretable production systems at an early-stage AI startup.
DepthFirst AI is hiring a Research Engineer to develop and evaluate AI agents and training pipelines that discover and exploit software vulnerabilities at scale.
Lead the design and evaluation of long-term memory systems for LLMs at an early-stage AI startup focused on building self-improving agents.
Work with a top AI research lab to evaluate and improve LLM performance on advanced economics tasks by providing expert, written feedback.
Help shape next-generation AI by evaluating advanced physics solutions and guiding research teams to improve model performance as a contract Physics AI Trainer.
ICF is hiring a Child & Youth Research Analyst to perform qualitative and quantitative data collection, analysis, and reporting for child welfare and youth-focused research and evaluation projects.
Support a pragmatic clinical trial investigating a mindfulness-based pain management program by coordinating participant interactions, data collection, device setup, and study operations on a per-diem, hybrid basis at BMC.
Lead the design and deployment of cutting-edge 3D computer vision and generative ML models at Dandy to automate and improve dental manufacturing workflows.
OpenAI is hiring a Research Engineer/Scientist to advance personality and model-behavior research and integrate novel methods into products used by hundreds of millions of users.
Join a research team building agentic capabilities for ChatGPT, contributing to research, large-scale training, evaluations, and production deployment in a hybrid San Francisco role.
Lead research and engineering to build production-ready post-training RL recipes, environments, and evaluation pipelines for a fast-growing startup powering foundation model labs.
Below 50k*
0
|
50k-100k*
1
|
Over 100k*
0
|