85 Evaluation Jobs Hiring Now (December 2025)

Senior Machine Learning Engineer

Jobgether Hybrid US

Posted 7 hours ago

Senior Machine Learning Engineer needed to build and deploy scalable, production ML systems that improve healthcare outcomes and operational efficiency.

Executive Director

Acelero, Inc. Hybrid Remote

VIEW

Posted 11 hours ago

Lead and shape the Acelero Charitable Foundation as its Founding Executive Director, driving strategy, fundraising, grantmaking, and partnerships to expand high-quality early childhood opportunities across the U.S.

School Psychologist - Alabama

Parallel Hybrid Remote

VIEW

Posted 11 hours ago

Parallel is hiring a remote School Psychologist to perform psycho-educational evaluations and deliver therapeutic and consultative services to students nationwide while supporting IEP development and multidisciplinary care.

F

FM Approvals Quality Assurance Compliance Auditor - Manufacturing

FM Hybrid FRISCO, Texas

VIEW

Sponsored

F

Senior Research Scientist - Material Flammability, Fire Dynamics and Lithium-ion Battery Safety

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

F

FM Research Cybersecurity Co-op - Summer/Fall 2026

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

School Psychologist - Indiana

Parallel Hybrid Remote

VIEW

Posted 11 hours ago

Parallel is hiring remote, licensed School Psychologists in Indiana to deliver psycho-educational evaluations, IEP development, and MTSS-aligned psychological services to support student success.

School Psychologist - Ohio

Parallel Hybrid Remote

VIEW

Posted 11 hours ago

Provide remote psycho-educational evaluations and school psychology services for students with IEPs as a licensed school psychologist in Ohio with Parallel's Provider Network.

Prompt Engineering Intern

Anduril Industries Hybrid Costa Mesa, California, United States

VIEW

Posted 18 hours ago

Anduril's Thunderforge team is hiring a Prompt Engineering Intern to develop prompts, agent graph architectures, and test/evaluation tooling for AI-enabled wargaming.

Data Scientist, AI Agent

Replit Hybrid Foster City

VIEW

Posted 2 days ago

Inclusive & Diverse

Mission Driven

Work/Life Harmony

Diversity of Opinions

Friends Outside of Work

Empathetic

Collaboration over Competition

Fast-Paced

Transparent & Candid

Medical Insurance

Dental Insurance

Vision Insurance

Disability Insurance

Learning & Development

401K Matching

Paid Time-Off

WFH Reimbursements

Paid Holidays

Equity

Flex-Friendly

Lead experimentation, trace analysis, and metric design to measure and improve Replit's AI agent, converting agent traces into product-changing insights for engineering and leadership.

Data Scientist/AI Engineer (Remote)

YouGov Hybrid New York, United States of America

VIEW

Posted 2 days ago

YouGov seeks a hands-on Data Scientist/AI Engineer to build and deploy LLM-based applications and advanced analytics for market research using survey, census, and behavioral datasets.

Staff Engineer, Applied AI

Zapier Hybrid San Francisco

VIEW

Posted 3 days ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Work/Life Harmony

Lead the design and delivery of Zapier’s unified AI platform as a Staff Applied AI Engineer, shaping runtime, orchestration, and evaluation systems that power the company’s AI products.

QA Engineer for Generative AI

Jump Hybrid Salt Lake City

VIEW

Posted 3 days ago

Jump seeks a US-based QA Engineer to own AI evaluation, labeling campaigns, and QA processes that improve generative AI outputs for our meeting assistant product.

F

Sr. Research Engineer - Electrical/Power Generation - Design, operation, maintenance of electrical/power generation equipment

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

F

FM IT/OT Infrastructure & Security Co‑op - Winter/Spring 2026

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

F

Senior Research Engineer – Mechanical - Rotating Machinery

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

Research & Evaluation Internship (Unpaid)

Passion for Life, Inc. Hybrid Passion for Life, Inc Atlanta, Atlanta, Georgia, United States

VIEW

Posted 3 days ago

Passion for Life, a nonprofit helping under-resourced youth build career pathways, seeks a part-time Research & Evaluation Intern to support program measurement, data collection, and impact reporting.

Contract Life Cycle Management & Technology Expert (Senior Director, Analyst – Fully Remote United States)

Gartner Hybrid Remote - United States

VIEW

Posted 3 days ago

Lead Gartner’s CLM research and advisory efforts by producing market-leading insights, advising General Counsel and legal operations leaders, and evaluating CLM technology and vendor strategies.

Business Analyst

AECOM Hybrid Sacramento, CA

VIEW

Posted 3 days ago

AECOM is hiring a Business Analyst in Sacramento to define business and technical requirements, design workflows, and lead validation for an enterprise document control system.

Support Specialist I (Youth Programming) Full Time Days

Northwestern Memorial Healthcare Hybrid 541 N. Fairbanks, Chicago, IL

VIEW

Posted 3 days ago

Northwestern Medicine is hiring a Support Specialist I to coordinate youth programming, community outreach, and partnership activities across its service area.

Senior Product Manager - AI Systems & Context

MagicSchool AI Hybrid Remote

VIEW

Posted 3 days ago

Lead product strategy and execution for context, memory, and retrieval systems that power MagicSchool’s AI agents to deliver reliable, educator-focused assistance at scale.

Prompt Engineer

Mursion, Inc Hybrid No location specified

VIEW

Posted 3 days ago

Mursion is hiring a Prompt Engineer to craft production-grade LLM prompts, manage RAG/JSON workflows, and translate learning objectives into reliable AI-driven simulation behavior.

San Diego Regional Program Manager

HealthCorps Hybrid No location specified

VIEW

Posted 4 days ago

HealthCorps seeks a motivated Regional Program Manager in San Diego to lead school-based wellness initiatives, supervise near-peer mentors, and grow community partnerships to improve teen health outcomes.

s

Engineering Intern (AI Ethics)

sonyglobal Hybrid Remote - California

VIEW

Posted 4 days ago

Sony AI's Research Ethics team is hiring a remote Engineering Intern (AI Ethics) to help build agentic AI infrastructure, run LLM evaluations, and develop tools for responsible AI in a research-driven environment.

S

School Psychologist - Contract job description

Sankofa Montessori Hybrid No location specified

VIEW

Posted 4 days ago

Sankofa Montessori seeks a Georgia-certified School Psychologist for an evaluation-only contract role conducting psychoeducational assessments, producing legally compliant reports, and advising teams on special education eligibility.

Applied AI Engineer

PermitFlow Hybrid New York City

VIEW

Posted 4 days ago

Build and productionize multi-step AI agents and the backend infrastructure that powers PermitFlow’s pre-construction platform in a fast-moving, hybrid NYC startup.

F

Senior Research Scientist – Computational Wind Engineering

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

F

FM Approvals Quality Assurance Compliance Auditor - Manufacturing

FM Hybrid ALPHARETTA, Georgia

VIEW

Sponsored

F

FM Approvals Research Campus Engineering Technician - Materials

FM Hybrid WEST GLOCESTER, Rhode Island

VIEW

Sponsored

Product Manager, AI

Arcade Hybrid Presidio

VIEW

Posted 5 days ago

Lead development of Arcade’s conversational AI product creation agent as the company’s first dedicated Product Manager for AI, reporting directly to the CEO.

Forward Deployed Engineer - AI Solutions

Pear VC Hybrid New York City

VIEW

Posted 6 days ago

Atrix is seeking a New York–based Forward Deployed Engineer to embed with enterprise pharma customers and deliver accurate, trusted AI workflows that drive medical and commercial decision-making.

Vice President of Collective Action, K12 & Youth Development

America's Promise Alliance Hybrid Remote

VIEW

Posted 6 days ago

America's Promise Alliance seeks a seasoned nonprofit leader to direct collective action, member engagement, program design, and fundraising for its Aligning K12 Education and Youth Development issue area.

VISITING PSYCHIATRIC SERVICES (VPS) DATA ANALYST

City of New York Hybrid New York City, NY

VIEW

Posted 7 days ago

The Department of Social Services is hiring a City Research Scientist I (VPS Data Analyst) to manage VPS program data, produce analytic reports, and support evaluation and linkage-to-care efforts for people experiencing homelessness.

Staff Software Engineer, Agent Orchestration

Decagon Hybrid San Francisco

VIEW

Posted 9 days ago

Lead the architecture and long-term evolution of Decagon’s agent orchestration engine to enable reliable, high-performance AI agent behavior at scale.

K

Deputy Director - Nonprofit Youth Health

Keller Executive Search Hybrid No location specified

VIEW

Posted 10 days ago

Experienced nonprofit leader needed to oversee Arkansas operations, lead state policy and advocacy, and build cross-sector partnerships to advance youth health equity.

Senior Data Analyst, AI Evaluation

Elsevier Hybrid Remote

VIEW

Posted 10 days ago

Elsevier is hiring a Senior Data Analyst to lead analytics and evaluation frameworks for generative AI models used in healthcare, ensuring accuracy, safety, and clinical relevance.

Senior Project Manager

GLIDE® Hybrid San Francisco

VIEW

Posted 11 days ago

GLIDE seeks a Senior Project Manager to lead pilot programs and cross-functional projects that advance its mission to alleviate suffering and break cycles of poverty and marginalization.

BAML Engineer

Vetcove Hybrid Remote

VIEW

Posted 11 days ago

Mission Driven

Inclusive & Diverse

Growth & Learning

Transparent & Candid

Medical Insurance

Dental Insurance

Vision Insurance

401K Matching

Flex-Friendly

Equity

Vetcove seeks an AI-focused BAML Engineer to design, implement, and maintain BAML-driven LLM workflows and evaluation tooling for its veterinary software platform.

Exempt to Permanent - Senior Community Development Specialist I (9774) Citywide (E161826)

City and County of San Francisco Hybrid 1650 Mission St, San Francisco, CA 94103, USA

VIEW

Posted 11 days ago

The City and County of San Francisco seeks a Senior Community Development Specialist I to manage funding, monitor compliance, and evaluate community development projects across city departments.

F

Field Quality Assurance Compliance Auditor - Manufacturing

FM Hybrid MALVERN, Pennsylvania

VIEW

Sponsored

J

Developer Relations Engineer

Judgment Labs Hybrid San Francisco

VIEW

Posted 11 days ago

Help developers adopt Judgment Labs' SDK and evaluation tools by building docs, demos, and sample agent setups as a Developer Relations Engineer in San Francisco.

J

Technical Writer

Judgment Labs Hybrid San Francisco

VIEW

Posted 11 days ago

Be part of a San Francisco-based venture-backed team as a Technical Writer crafting deep technical content on agent evaluation, monitoring, and reward modeling for a technical audience.

AI Evaluation Nursing Expert

Elsevier Hybrid Remote

VIEW

Posted 11 days ago

Elsevier seeks a Clinical AI Evaluation Specialist (RN, MSN) to lead evaluation cycles for generative AI in nursing education, ensuring data integrity and educational alignment to improve clinical outcomes.

Director of the Center for the Advancement of Art Education

The Art of Education University Hybrid Remote

VIEW

Posted 11 days ago

Lead AOEU's new Center for the Advancement of Art Education to drive research, partnerships, and practice that elevate arts education at a national scale.

C

Freelance Luxury Brand Evaluator - Costa Mesa, CA

CXG Hybrid No location specified

VIEW

Posted 12 days ago

Freelance evaluators assess luxury retail and online experiences for top brands, completing short missions and submitting feedback via CXG's mobile platform.

O

Research Scientist

Oumi Hybrid New York

VIEW

Posted 12 days ago

Oumi seeks a Research Scientist to advance open-source LLM and VLM research by developing models, datasets, benchmarks, and publishing results with the community.

Senior Project Engineer

Moog Inc. Hybrid Buffalo, NY

VIEW

Posted 13 days ago

Moog SDG seeks a Senior Project Engineer in Buffalo, NY to lead technical execution, cross-functional teams, and customer-facing aspects of development programs within the Mission Enabling Services Group.

Technical Product Manager, AI

webAI Hybrid Austin

VIEW

Posted 13 days ago

Lead the strategy and delivery of distributed inference, LLM integrations, and on-device ML features at webAI to enable privacy-first, enterprise-grade AI on the edge.

Staff machine learning engineer

Watershed Hybrid No location specified

VIEW

Posted 13 days ago

Experienced ML/AI engineer needed to lead development and productionization of LLM- and embedding-based features for Watershed's enterprise sustainability platform.

Transfer Evaluation Assistant

Western Governors University Hybrid Remote

VIEW

Posted 14 days ago

WGU seeks a meticulous Transfer Evaluation Assistant to evaluate transcripts, maintain student documentation, and ensure policy compliance in a remote role supporting prospective students.

A

Director of Applied Research & Institutional Impact

Achieving the Dream Hybrid Remote

VIEW

Posted 14 days ago

Achieving the Dream seeks a seasoned research leader to direct applied research, evaluation, and analytics initiatives that drive institutional change and improve student outcomes across its network.

Director, Partnerships

Kiddom Hybrid Remote

VIEW

Posted 14 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Paid Holidays

Lead Kiddom’s strategic alliance efforts to identify, evaluate, and operationalize high-impact partnerships that accelerate the company’s growth in K–12 education.

Senior Software Engineer, AI Platform

Vanta Hybrid No location specified

VIEW

Posted 15 days ago

Inclusive & Diverse

Growth & Learning

Customer-Centric

Collaboration over Competition

Medical Insurance

Maternity Leave

Flex-Friendly

401K Matching

Lead design and implementation of scalable AI infrastructure and developer tooling to accelerate Vanta’s AI-powered product initiatives.

Senior Software Engineer, AI Product

Vanta Hybrid No location specified

VIEW

Posted 15 days ago

Inclusive & Diverse

Growth & Learning

Customer-Centric

Collaboration over Competition

Medical Insurance

Maternity Leave

Flex-Friendly

401K Matching

Lead applied AI product work at Vanta by designing, shipping, and scaling LLM-powered features that accelerate customer compliance and trust.

Red Teaming Domain Expert - AI Training (Contract)

Handshake Hybrid Remote

VIEW

Posted 15 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake AI is hiring a contract Red Teaming Domain Expert to craft adversarial prompts and stress-test LLMs for safety and robustness across real-world edge cases.

Technical Program Manager, AI Platform

Figma Hybrid Remote

VIEW

Posted 16 days ago

Empathetic

Collaboration over Competition

Growth & Learning

Passion for Exploration

Fast-Paced

Startup Mindset

Diversity of Opinions

Rise from Within

Figma is hiring a seasoned Technical Program Manager to drive AI platform programs that scale annotation, evaluation, and model delivery across engineering, research, and product teams.

Member of Technical Staff (AI Engineering)

Awesome Motive Hybrid San Francisco

VIEW

Posted 16 days ago

An AI engineering role focused on building and improving voice-first and omnichannel credit-servicing agents using Python and integrated language models at an early-stage fintech startup.

Program Coordinator - Oxford College of Emory University

Jobgether Hybrid No location specified

VIEW

Posted 16 days ago

Oxford College at Emory University is hiring a Program Coordinator to plan and manage student engagement and leadership programs, including events, budgets, and cross-departmental collaboration.

Senior Compensation Analyst

Western Governors University Hybrid Salt Lake City

VIEW

Posted 16 days ago

WGU is hiring a Senior Compensation Analyst to design and manage global compensation programs and deliver strategic analysis that supports competitive pay and organizational goals.

Senior Backend Engineer, Evals and AI Infra

Commure Hybrid Mountain View

VIEW

Posted 17 days ago

Join Commure's Ambient Scribe team as a Senior Backend Engineer to build and scale eval and AI infrastructure that powers next-generation clinical AI products.

Below 50k* 0 0%
50k-100k* 5 24%
Over 100k* 16 76%

Evaluation Jobs

How much do evaluation jobs pay?

Top companies hiring for evaluation jobs

Best cities to find evaluation jobs

Sign up for our weekly newsletter of fresh jobs

Sign up for our weekly
newsletter of fresh jobs