Model Evaluation Jobs

Browse 21 exciting jobs hiring in Model Evaluation now. Check out companies hiring such as Cohere, Prime Time Consulting, Shae Group in Charlotte, New York, Fort Worth.

VIEW COMPANIES

Member of Technical Staff, Pretraining evaluations

Cohere Hybrid No location specified

VIEW

Posted 10 hours ago

Startup Mindset

Collaboration over Competition

Growth & Learning

Inclusive & Diverse

Cohere is hiring a Member of Technical Staff for Pretraining Evals to design, implement, and improve robust evaluation and statistical pipelines that measure base model capabilities across scales.

Data Scientist 3 (Colorado)

Prime Time Consulting Hybrid Aurora, Colorado

VIEW

Posted 14 hours ago

Experienced Data Scientist needed to develop and evaluate automated tokenization and POS annotation solutions for speech and text in support of government-focused NLP projects.

Fractional CTO / AI Advisory Council Member - Remote (Contractor)

Shae Group Hybrid No location specified

VIEW

Posted yesterday

Provide fractional CTO-level AI architecture and safety advisory support to Shae Group, guiding model reliability, vendor choices, and design decisions across high-impact AI products.

Embedded Software Engineering Opportunities at SharkNinja

SharkNinja Hybrid NEEDHAM, Massachusetts

VIEW

Sponsored

Senior Research Scientist - Material Flammability, Fire Dynamics and Lithium-ion Battery Safety

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

Embedded Systems Engineer

SharkNinja Hybrid NEEDHAM, Massachusetts

VIEW

Sponsored

Sr. Staff ML Engineer, Risk Solutions

PayPal Hybrid San Jose, California, United States of America

VIEW

Posted 3 days ago

Lead the design, evaluation, and productionization of machine learning models and signal measurement frameworks to power PayPal's risk solutions and signal marketplace.

Senior Machine Learning Engineer

Jobgether Hybrid US

VIEW

Posted 9 days ago

Senior Machine Learning Engineer needed to build and deploy scalable, production ML systems that improve healthcare outcomes and operational efficiency.

Data Scientist/AI Engineer (Remote)

YouGov Hybrid New York, United States of America

VIEW

Posted 11 days ago

YouGov seeks a hands-on Data Scientist/AI Engineer to build and deploy LLM-based applications and advanced analytics for market research using survey, census, and behavioral datasets.

Product Manager, AI

Arcade Hybrid Presidio

VIEW

Posted 14 days ago

Lead development of Arcade’s conversational AI product creation agent as the company’s first dedicated Product Manager for AI, reporting directly to the CEO.

Senior Data Analyst, AI Evaluation

Elsevier Hybrid Remote

VIEW

Posted 19 days ago

Elsevier is hiring a Senior Data Analyst to lead analytics and evaluation frameworks for generative AI models used in healthcare, ensuring accuracy, safety, and clinical relevance.

BAML Engineer

Vetcove Hybrid Remote

VIEW

Posted 20 days ago

Mission Driven

Inclusive & Diverse

Growth & Learning

Transparent & Candid

Medical Insurance

Dental Insurance

Vision Insurance

401K Matching

Flex-Friendly

Equity

Vetcove seeks an AI-focused BAML Engineer to design, implement, and maintain BAML-driven LLM workflows and evaluation tooling for its veterinary software platform.

Research Scientist

Oumi Hybrid New York

VIEW

Posted 21 days ago

Oumi seeks a Research Scientist to advance open-source LLM and VLM research by developing models, datasets, benchmarks, and publishing results with the community.

Embedded Software Engineer

SharkNinja Hybrid NEEDHAM, Massachusetts

VIEW

Sponsored

Senior Research Scientist – Computational Wind Engineering

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

Software Engineer

SharkNinja Hybrid NEEDHAM, Massachusetts

VIEW

Sponsored

Technical Product Manager, AI

webAI Hybrid Austin

VIEW

Posted 22 days ago

Lead the strategy and delivery of distributed inference, LLM integrations, and on-device ML features at webAI to enable privacy-first, enterprise-grade AI on the edge.

Staff machine learning engineer

Watershed Hybrid No location specified

VIEW

Posted 22 days ago

Experienced ML/AI engineer needed to lead development and productionization of LLM- and embedding-based features for Watershed's enterprise sustainability platform.

Red Teaming Domain Expert - AI Training (Contract)

Handshake Hybrid Remote

VIEW

Posted 24 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake AI is hiring a contract Red Teaming Domain Expert to craft adversarial prompts and stress-test LLMs for safety and robustness across real-world edge cases.

Technical Program Manager, AI Platform

Figma Hybrid Remote

VIEW

Posted 24 days ago

Empathetic

Collaboration over Competition

Growth & Learning

Passion for Exploration

Fast-Paced

Startup Mindset

Diversity of Opinions

Rise from Within

Figma is hiring a seasoned Technical Program Manager to drive AI platform programs that scale annotation, evaluation, and model delivery across engineering, research, and product teams.

Senior Backend Engineer, Evals and AI Infra

Commure Hybrid Mountain View

VIEW

Posted 26 days ago

Join Commure's Ambient Scribe team as a Senior Backend Engineer to build and scale eval and AI infrastructure that powers next-generation clinical AI products.

Principal AI/ML Engineer (Remote - US)

Jobgether Hybrid No location specified

VIEW

Posted 26 days ago

Lead the architecture and delivery of large-scale, regulated AI systems—driving multi-agent, multi-modal solutions and engineering standards across cross-functional teams.

Principal Product Manager, Feed Relevance

LinkedIn Hybrid Mountain View, CA

VIEW

Posted 26 days ago

Lead the next generation of AI-driven ranking and recommendation systems for LinkedIn's Feed to improve relevance, personalization, and member engagement at massive scale.

Engineering Manager, CoreAI (HALO)

LinkedIn Hybrid Mountain View, CA

VIEW

Posted 26 days ago

Lead a small engineering team to build and scale LinkedIn’s HALO model and agent evaluation platform, combining hands-on technical delivery with people and cross-functional leadership.

Senior Staff AI Research Engineer

CampusESP Hybrid Remote

VIEW

Posted 27 days ago

Lead the design, research, and deployment of novel AI systems at Campus to personalize and measurably improve the student learning experience.

Staff AI Engineer

Awesome Motive Hybrid United States

VIEW

Posted 29 days ago

Bond Studio AI is hiring a Staff AI Engineer to design and implement production AI systems and multi-agent LLM architectures that power agentic 3D design experiences for real-world spaces.

Senior Research Engineer – Mechanical - Rotating Machinery

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

FM Approvals Research Campus Engineering Technician - Materials

FM Hybrid WEST GLOCESTER, Rhode Island

VIEW

Sponsored

Field Quality Assurance Compliance Auditor - Manufacturing

FM Hybrid MALVERN, Pennsylvania

VIEW

Sponsored

Solutions Engineer

Kilo Code Hybrid No location specified

VIEW

Posted 30 days ago

Kilo Code seeks a hands-on Solutions Engineer to run high-leverage demos and POCs, bridge sales and engineering, and help shape the company’s pre- and post-sales technical motions.

Employment type

Remote/Onsite

Application Type

Date Posted

Department

Work Experience

Industries

Skills

Company size

Funding

Company Culture

Benefits & Perks

Company Rating

Salary (USD)

Only show jobs with salary info

Keywords to Exclude

Reset filters

How much do model evaluation jobs pay?

Below 50k* 0 0%
50k-100k* 0 0%
Over 100k* 20 100%

*average yearly salary (USD)

Model Evaluation Jobs

Member of Technical Staff, Pretraining evaluations

Data Scientist 3 (Colorado)

Fractional CTO / AI Advisory Council Member - Remote (Contractor)

Embedded Software Engineering Opportunities at SharkNinja

Senior Research Scientist - Material Flammability, Fire Dynamics and Lithium-ion Battery Safety

Embedded Systems Engineer

Sr. Staff ML Engineer, Risk Solutions

Senior Machine Learning Engineer

Data Scientist/AI Engineer (Remote)

Product Manager, AI

Senior Data Analyst, AI Evaluation

BAML Engineer

Research Scientist

Embedded Software Engineer

Senior Research Scientist – Computational Wind Engineering

Software Engineer

Technical Product Manager, AI

Staff machine learning engineer

Red Teaming Domain Expert - AI Training (Contract)

Technical Program Manager, AI Platform

Senior Backend Engineer, Evals and AI Infra

Principal AI/ML Engineer (Remote - US)

Principal Product Manager, Feed Relevance

Engineering Manager, CoreAI (HALO)

Senior Staff AI Research Engineer

Staff AI Engineer

Senior Research Engineer – Mechanical - Rotating Machinery

FM Approvals Research Campus Engineering Technician - Materials

Field Quality Assurance Compliance Auditor - Manufacturing

Solutions Engineer

How much do model evaluation jobs pay?

Top companies hiring for model evaluation jobs

Best cities to find model evaluation jobs

Model Evaluation Jobs

How much do model evaluation jobs pay?

Top companies hiring for model evaluation jobs

Best cities to find model evaluation jobs

Sign up for our weekly newsletter of fresh jobs

Sign up for our weekly
newsletter of fresh jobs