77 Ai Evaluation Jobs Hiring Now (January 2026)

Senior C# Full-Stack Engineer — AI Data & Infrastructure

Alignerr Hybrid San Francisco

Posted 15 hours ago

Senior C# Full-Stack Engineer (contract, remote) to build high-performance C# systems and full-stack tooling for AI data, evaluation, and annotation pipelines.

Electronics Engineers, Except Computer - AI Trainer (Contract)

Handshake Hybrid Remote

VIEW

Posted 19 hours ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Contribute your electronics engineering expertise to train and evaluate AI models as a remote contract AI Trainer for Handshake at $120/hr.

Software Engineer, Applied AI

The Mirage Hybrid New York

VIEW

Posted yesterday

Join Mirage's NYC engineering team to build end-to-end applied AI systems that enable new creative experiences for short-form video at scale.

Health Data Scientist - AI & Clinical Data (ARPA-H)

Ripple Effect Hybrid No location specified

VIEW

Posted yesterday

Provide technical leadership to ARPA-H and external performers by designing and validating healthcare datasets and evaluation frameworks for AI-driven rare disease diagnostics.

Community Health Workers - AI Trainer (Contract)

Handshake Hybrid San Francisco

VIEW

Posted yesterday

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake seeks Community Health Worker professionals to provide expert, asynchronous evaluations of AI outputs—no prior AI experience required.

Advertising and Promotions Managers - AI Trainer (Contract)

Handshake Hybrid San Francisco

VIEW

Posted 2 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake seeks seasoned Advertising and Promotions Managers to perform flexible, remote contract work evaluating AI outputs and providing structured feedback to improve models.

Advertising Sales Agents - AI Trainer (Contract)

Handshake Hybrid San Francisco

VIEW

Posted 2 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake seeks advertising sales professionals to work remotely as contract AI Trainers, reviewing model outputs and crafting prompts to improve AI understanding of advertising tasks.

Electrical Engineers - AI Trainer (Contract)

Handshake Hybrid San Francisco

VIEW

Posted 2 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake seeks experienced electrical engineers to work remotely and asynchronously as contract AI trainers, evaluating model outputs and crafting domain-aligned prompts to improve AI understanding of electrical engineering tasks.

Business Teachers, Postsecondary - AI Trainer (Contract)

Handshake Hybrid San Francisco

VIEW

Posted 2 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Contribute your business teaching or professional experience to a remote, flexible AI training program that evaluates model outputs and improves workplace-relevant AI understanding.

Software Engineer - Artificial intelligence

Boeing Hybrid USA - Tukwila, WA

VIEW

Posted 2 days ago

Boeing is looking for Software Engineers specializing in LLMs to design, implement, and integrate AI/ML capabilities into aerospace and autonomy systems at its Tukwila, WA site.

AI Engineer (All Levels)

Fieldguide Hybrid San Francisco

VIEW

Posted 3 days ago

Join Fieldguide as an AI Engineer to design, build, and operate agentic systems and production-ready LLM-powered features for mission-critical audit workflows.

A

Python Insfrastructure Engineer - Model Evaluation

Alignerr Hybrid Los Angeles

VIEW

Posted 3 days ago

Work remotely as a contract Senior Python Full-Stack Engineer to build scalable evaluation and data infrastructure powering model training, benchmarking, and quality assurance at Alignerr.

Lead, Data Operations & Evaluation Engineering

Arcade Hybrid Presidio

VIEW

Posted 3 days ago

Arcade seeks an experienced technical lead to build and run DataOps, annotation, and evaluation systems that power generative AI-driven product design.

A

Data Infrastructure Developer (C++)

Alignerr Hybrid Seattle

VIEW

Posted 3 days ago

Alignerr is hiring a Senior C++ Full-Stack Engineer to design and optimize high-performance C++ systems and full-stack tooling for AI data annotation, validation, and evaluation pipelines.

Software Engineer, AI/ML – San Francisco

Flow Engineering Hybrid San Francisco

VIEW

Posted 3 days ago

Flow Engineering is hiring an AI/ML Software Engineer in San Francisco to build agentic, LLM-driven features that help engineers author, review, and validate complex system requirements.

Senior AI Engineer (Rapid Prototyping & Analytics )

Prompt Hybrid No location specified

VIEW

Posted 4 days ago

Experienced AI engineer needed to rapidly prototype and productionize LLM and ML-driven systems for healthcare-focused products and internal tools at a fast-growing startup.

A

C++ Backend Engineer - AI Data Platforms

Alignerr Hybrid Seattle

VIEW

Posted 5 days ago

Work remotely as a Senior C++ Full-Stack Engineer at Alignerr to build and optimize high-performance C++ services and tooling for AI data pipelines and evaluation workflows.

A

Lead Systems Engineer (Rust) - AI Platform

Alignerr Hybrid Denver

VIEW

Posted 6 days ago

Alignerr is looking for a Senior Rust Full-Stack Engineer to design and optimize high-performance Rust systems and tooling that power AI data pipelines and model evaluation workflows.

A

Systems Programmer - AI Data Pipelines

Alignerr Hybrid Seattle

VIEW

Posted 6 days ago

Senior Rust Full-Stack Engineer needed to build and optimize production-grade AI data pipelines and tooling for model training and evaluation at a remote-first AI infrastructure firm.

AI Engineer

Plasmidsaurus Hybrid South San Francisco

VIEW

Posted 8 days ago

Plasmidsaurus is hiring an AI Engineer to build production LLM-driven bioinformatics agents that turn rapid RNA-seq outputs into actionable biological insights for research teams.

A

Principal Systems Engineer (C++) - AI Infrastructure

Alignerr Hybrid New York City

VIEW

Posted 8 days ago

A senior C++ full-stack systems engineer is needed to build reliable, high-performance infrastructure and tooling for AI data pipelines and evaluation workflows at a remote-first AI-focused company.

A

C++ Engineer - High Performance Computing (HPC)

Alignerr Hybrid Denver

VIEW

Posted 8 days ago

Work remotely as a senior C++ engineer building and optimizing high-performance systems and full-stack tooling for AI data pipelines and evaluation workflows.

A

Principal Rust Engineer - ML Infrastructure

Alignerr Hybrid New York City

VIEW

Posted 8 days ago

A senior Rust engineer is needed to build and optimize high-performance ML data and evaluation infrastructure for Alignerr’s AI research and production workflows on a part‑time remote contract.

Strategic Projects Lead, Handshake AI

Handshake Hybrid San Francisco

VIEW

Posted 9 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Lead and execute high-impact AI data programs at Handshake, coordinating large distributed teams and partnering with frontier AI labs to drive revenue, quality, and scalable delivery.

Learning & Development Partner

Agiloft Hybrid United States

VIEW

Posted 10 days ago

Agiloft seeks a Learning & Development Partner to design and deliver scalable, data-driven learning programs—especially AI-focused enablement—to accelerate employee development and organizational capability.

A

Senior C# Full-Stack Engineer — AI Data & Infrastructure

Alignerr Hybrid Los Angeles

VIEW

Posted 12 days ago

Experienced full-stack engineer needed to build and optimize C# systems and tooling that power large-scale AI data and evaluation workflows at Alignerr.

Commercial Pilots - AI Trainer (Contract)

Handshake Hybrid Remote

VIEW

Posted 12 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake is hiring experienced commercial pilots to remotely evaluate and refine AI model outputs using real-world aviation expertise.

P

Founding Engineer (AI / ML)

Pax Historia Hybrid San Francisco

VIEW

Posted 14 days ago

Pax Historia seeks a founding ML systems engineer in San Francisco to build production-grade infrastructure, evaluations, and model tuning that make their AI-driven game both higher-quality and more affordable.

Prompt Engineer

Take2 Hybrid New York

VIEW

Posted 15 days ago

Take2 AI seeks a hands-on Prompt Engineer to design and scale AI Interviewers and evaluation systems that automate and improve high-volume candidate screening.

Founding AI Engineer

Everstar Hybrid New York City

VIEW

Posted 15 days ago

Lead production-grade LLM and AI agent development at Everstar to accelerate nuclear deployment through rigorous evals, fine-tuning, and synthetic data pipelines.

A

Senior C++ Full-Stack Engineer — AI Data & Infrastructure

Alignerr Hybrid San Francisco

VIEW

Posted 15 days ago

Work as a contract Senior C++ Full-Stack Engineer building high-performance C++ systems and full-stack tooling to support large-scale AI data, annotation, and evaluation workflows for leading labs.

A

Senior Backend Engineer (C#) - AI Data Platform

Alignerr Hybrid San Francisco

VIEW

Posted 15 days ago

Experienced C# backend engineer needed to build and optimize high-performance services and full-stack tooling for AI data pipelines and evaluation workflows at Alignerr.

A

Backend Developer - AI Data Services

Alignerr Hybrid Los Angeles

VIEW

Posted 15 days ago

Alignerr is hiring a Senior C# Full-Stack Engineer to build high-performance backend services and tooling for AI data pipelines and evaluation systems on a remote, contract basis.

A

Backend & Tooling Engineer (Rust)

Alignerr Hybrid Los Angeles

VIEW

Posted 16 days ago

Alignerr is hiring a Senior Rust engineer to build high-performance backend services and developer tooling for AI data pipelines and evaluation workflows on a part-time contract basis.

C

Software Engineering Intern (AI & Systems)

Candid Intelligence Hybrid San Francisco

VIEW

Posted 16 days ago

Work on production-grade data ingestion, UI components, evaluation systems, and agent tooling at a small engineering company focused on automating pre-construction engineering workflows.

Senior Software Engineer (AI Applications)

Learning A-Z Hybrid Remote

VIEW

Posted 16 days ago

Cambium Assessment is hiring a Senior Software Engineer to design and implement responsible generative AI agents and integrate advanced LLM-driven capabilities into mission-critical EdTech products.

A

Lead Backend Engineer - AI Tooling

Alignerr Hybrid San Francisco

VIEW

Posted 16 days ago

Experienced Python backend engineer needed to build and optimize scalable data and evaluation tooling for cutting-edge AI research workflows at Alignerr.

A

Lead Systems Engineer (Rust) - AI Platform

Alignerr Hybrid Seattle

VIEW

Posted 16 days ago

Senior Rust systems engineer needed to architect and optimize high-performance backend and tooling for AI data pipelines and evaluation workflows at Alignerr.

A

C++ Backend Engineer - AI Data Platforms

Alignerr Hybrid San Francisco

VIEW

Posted 17 days ago

Alignerr seeks a senior C++ full-stack engineer to build high-performance backend and tooling for AI data pipelines and model evaluation on a flexible remote contract.

A

Rust Software Engineer - Distributed Systems

Alignerr Hybrid Seattle

VIEW

Posted 19 days ago

Work remotely as a contract Senior Rust Full-Stack Engineer to build and optimize distributed systems powering AI data pipelines, annotation, and evaluation workflows for Alignerr.

Data Entry & Content Review Specialist (AI Model Evaluation)

CloudFactory Hybrid No location specified

VIEW

Posted 19 days ago

Health Savings Account (HSA)

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Performance Bonus

Paid Holidays

CloudFactory is seeking US-based, detail-focused Data Entry & Content Review Specialists for a full-time, fixed-term AI model evaluation project running Feb 1–Oct 1, 2026.

Senior Machine Learning Engineer - AI Eval & Safety

Red Hat Hybrid Boston

VIEW

Posted 20 days ago

Red Hat's OpenShift AI team is hiring a Senior ML Engineer to architect and lead large-scale evaluation and safety infrastructure for LLMs and agentic systems in open-source and hybrid-cloud environments.

First-Line Supervisors of Mechanics, Installers, and Repairers - AI Trainer (Contract)

Handshake Hybrid Remote

VIEW

Posted 20 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake is seeking experienced supervisors in mechanics, installation, or repair to work remotely as contract AI Trainers evaluating and improving model outputs using their hands-on expertise.

Health Education Specialists - AI Trainer (Contract)

Handshake Hybrid Remote

VIEW

Posted 20 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Contract Health Education Specialists will use their public health experience to develop prompts and evaluate AI-generated health education content in a flexible, remote, asynchronous role.

Instructional Coordinators - AI Trainer (Contract)

Handshake Hybrid Remote

VIEW

Posted 20 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake seeks experienced instructional coordinators to assess AI outputs and provide structured, field-informed feedback on a flexible, remote contract basis.

Mathematicians - AI Trainer (Contract)

Handshake Hybrid Remote

VIEW

Posted 20 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake is contracting mathematicians to remotely evaluate AI-generated math content and provide expert feedback to improve model accuracy and domain understanding.

Meeting, Convention, and Event Planners - AI Trainer (Contract)

Handshake Hybrid Remote

VIEW

Posted 20 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Experienced event planners are sought for a remote, flexible contract to evaluate and train AI models using real-world event planning expertise.

First-Line Supervisors of Entertainment and Recreation Services - AI Trainer (Contract)

Handshake Hybrid Remote

VIEW

Posted 20 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Experienced entertainment and recreation supervisors are needed to evaluate AI responses, craft field-relevant prompts, and provide feedback in a remote, hourly contract role.

Aerospace Engineers - AI Trainer (Contract)

Handshake Hybrid Remote

VIEW

Posted 20 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake seeks aerospace engineering professionals to evaluate AI outputs and craft domain-specific prompts on a flexible, remote contract basis at $150/hr.

Agents and Business Managers of Artists, Performers, and Athletes - AI Trainer (Contract)

Handshake Hybrid Remote

VIEW

Posted 20 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake seeks seasoned talent agents and business managers to evaluate AI-generated content and craft prompts that reflect real entertainment and sports industry workflows in a flexible, contract role.

Below 50k* 7 41%
50k-100k* 1 6%
Over 100k* 9 53%

Ai Evaluation Jobs

How much do ai evaluation jobs pay?

Top companies hiring for ai evaluation jobs

Best cities to find ai evaluation jobs

Sign up for our weekly newsletter of fresh jobs

Sign up for our weekly
newsletter of fresh jobs