Rise Jobs & Careers icon Model Evaluation Jobs

Browse 59 exciting jobs hiring in Model Evaluation now. Check out companies hiring such as Oura, Gates Foundation, OpenAI in Fayetteville, Lincoln, San Diego.

Oura Hybrid No location specified
Posted 6 hours ago

Lead evaluation and custom model development for Oura’s AI Advisor, combining production ML engineering with research to deliver reliable, actionable AI-driven health insights.

Posted 13 hours ago

A 12-month AI Fellowship at the Gates Foundation to design, prototype, and deploy responsible AI solutions for global health and development while building capacity across program teams.

Photo of the Rise User
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

OpenAI is hiring a Research Engineer/Scientist to advance personality and model-behavior research and integrate novel methods into products used by hundreds of millions of users.

MobilityWorks Regular Full-Time PHILADELPHIA, Pennsylvania
Sponsored
Photo of the Rise User
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

Join a research team building agentic capabilities for ChatGPT, contributing to research, large-scale training, evaluations, and production deployment in a hybrid San Francisco role.

MLabs Hybrid No location specified
Posted 2 days ago

MLabs is hiring a Data Scientist to develop and productionize statistical and machine learning models that detect insurance fraud across workers' compensation and personal injury domains.

Lead product and context engineering efforts to improve LLM-driven AI agent performance and user experience for advice-focused client intents within Vanguard's Discretionary Advice Platform.

Prime Time Consulting Hybrid Annapolis Junction, Maryland
Posted 3 days ago

Prime Time Consulting is hiring a Level 3 Data Scientist in Maryland to develop and evaluate NLP tokenization and POS annotation solutions for government-focused language datasets.

Photo of the Rise User
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Sabbatical
Paid Holidays

Handshake AI seeks an experienced Electrical Engineering specialist (contract) to refine and annotate AI model outputs across circuits, signal processing, and embedded/embedded-systems domains.

Posted 4 days ago

Work at the intersection of research and deployment to turn Twelve Labs’ video understanding models into scalable, production solutions for customers.

Photo of the Rise User

Build and ship mission-critical conversational AI agents at Decagon, working directly with enterprise customers to create scalable, high-impact solutions.

Posted 5 days ago

Dandy is hiring a Senior Machine Learning Engineer to advance 3D computer vision and generative ML models that automate and scale dental appliance manufacturing.

Photo of the Rise User
Mercor Hybrid San Francisco
Posted 9 days ago

Mercor is hiring an Applied AI Engineer to convert real-world human datasets into production-ready signals, deploy and evaluate LLMs, and build integrations and tooling that improve customer outcomes.

Photo of the Rise User
Posted 10 days ago
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

OpenAI seeks a Research Engineer to design, build, and iterate frontier evaluations that quantify financial reasoning and related capabilities in large-scale AI models.

Photo of the Rise User
Posted 10 days ago

An entry-level AI engineering position at OCC focused on building data integrations, evaluating AI tools, and supporting responsible AI implementations across business and technology teams.

Photo of the Rise User
Posted 10 days ago
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

Lead the development of large-scale, auditable evaluations for frontier AI models to measure capabilities and steer safety decisions at OpenAI.

Photo of the Rise User

BetterUp seeks a product-focused Staff Machine Learning Engineer to design and deliver cutting-edge Generative AI coaching experiences and help scale ML systems in production.

TrustLab is hiring a Senior AI Engineer to develop, tune, and deploy LLM-based content moderation systems that operate at enterprise scale.

Photo of the Rise User
Granted Consulting Hybrid No location specified
Posted 11 days ago

Lead development of LLM-driven systems at a mission-driven healthcare startup, focusing on prompt engineering, model optimization, and scalable AI product delivery.

Photo of the Rise User
Posted 13 days ago

PointClickCare seeks an experienced Principal AI Engineer to lead architecture and delivery of agentic AI systems that drive safe, scalable AI adoption across its healthcare platform.

Blue River Technology is hiring a CVML Engineer to drive data-centric workflows, dashboards, and model development for the See & Spray precision agriculture project.

Posted 13 days ago

MCI is hiring a Prompt Engineer to craft and refine prompts for generative AI models, improving output quality across product and customer-facing applications.

Photo of the Rise User
Posted 13 days ago
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Sabbatical
Paid Holidays

Handshake is seeking a Strategic Projects Lead to manage large-scale AI data projects, scale an expert workforce, and optimize cross-functional operations to support growth and customer success.

College Board's GenAI Studio is hiring a Data Scientist to prototype and evaluate generative AI solutions that support students, educators, and internal products in a fully remote, mission-driven environment.

Posted 14 days ago

MCI seeks a detail-oriented Prompt Engineer to craft and optimize prompts for generative AI models and integrate them into practical BPO and product workflows.

Eve Hybrid San Mateo, California
Posted 15 days ago

Eve is hiring an AI Engineer to build, optimize, and ship LLM-powered systems that transform legal workflows and improve outcomes for plaintiff attorneys.

Posted 16 days ago

Seeking licensed litigators to evaluate and train advanced legal AI models by testing realistic legal scenarios and documenting model reasoning gaps.

Photo of the Rise User
Posted 17 days ago

Zoox is hiring a Senior ML Engineer to design AutoML and evaluation systems that align autonomous driving software with expert human driving behavior.

Photo of the Rise User
Posted 17 days ago

Harvey seeks an Applied Legal Researcher who combines corporate law expertise and hands-on AI evaluation skills to design and validate legal workflows used by leading law firms.

Photo of the Rise User
Posted 19 days ago

College Board's GenAI Studio is hiring a Data Scientist to prototype, evaluate, and operationalize generative AI solutions that support students, educators, and internal teams.

Photo of the Rise User

Lead and scale new customer acquisition channels at Reprise Financial as Head of New Customer Strategy & Analytics, driving profitable growth via direct mail, affiliate, and search with a data-first approach.

FM Hybrid WALNUT CREEK, California
Sponsored
Photo of the Rise User
Posted 19 days ago

Help Zoox design and productionize ML and AutoML systems that learn from expert human drivers to benchmark and tune autonomous vehicle behavior for safer, more natural driving.

Photo of the Rise User
Posted 20 days ago
Inclusive & Diverse
Mission Driven
Social Impact Driven
Passion for Exploration
Dare to be Different
Diversity of Opinions
Reward & Recognition
Empathetic
Feedback Forward
Work/Life Harmony
Collaboration over Competition
Growth & Learning
Transparent & Candid
Customer-Centric
Rise from Within
Friends Outside of Work
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Learning & Development
Work Visa Sponsorship
Employee Resource Groups
401K Matching
Paid Time-Off
Maternity Leave
Social Gatherings
Company Retreats

Lead the development and scaling of large language model customization and adaptation as a Principal Machine Learning Engineer on Microsoft's CoreAI - PostTraining team.

Photo of the Rise User
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

Lead cross-functional research programs to discover, evaluate, and mitigate adversarial behaviors in large language models at OpenAI's San Francisco office.

Photo of the Rise User
Crux Hybrid No location specified
Posted 20 days ago

Crux is hiring an AI Product Engineer to lead the build-out of production AI features and infrastructure that modernize financing for clean energy projects.

Photo of the Rise User
Anduril Industries Hybrid Costa Mesa, California, United States; Seattle, Washington, United States; Washington, District of Columbia, United States
Posted 20 days ago

Lead engineering work to productize and deploy frontier AI models into edge and air-gapped defense systems while building evaluation and deployment pipelines for simulation-driven workflows.

Photo of the Rise User
Posted 20 days ago

Help design, train, and deploy state-of-the-art speech recognition and pronunciation models at Speak to power personalized language learning experiences worldwide.

Photo of the Rise User
Posted 20 days ago

Help build and productionize novel LLM-driven lesson experiences and assessment systems at Speak, a fast-growing Series C AI language learning company based in San Francisco.

Photo of the Rise User
Middesk Hybrid No location specified
Posted 21 days ago

Middesk is hiring a founding Data Scientist to design scalable ML analytics and operationalize model-backed identity and fraud products on a hybrid SF/NY team.

Yupp AI Hybrid Mountain View
Posted 21 days ago

Yupp seeks an experienced Staff+ AI Engineer in Mountain View to architect and ship scalable LLM applications and lead ML lifecycle work across data, model development, evaluation, and production.

Daydream Hybrid New York City
Posted 21 days ago

Join Daydream as a Data Scientist to design and deploy LLM-driven stylist features and lead model lifecycle work that reimagines fashion shopping.

Penske Truck Leasing Hybrid WEST COLUMBIA, South Carolina
Sponsored
Aarons Corporate Retail Store SCHENECTADY, New York
Sponsored
Photo of the Rise User

Mercor seeks PhD-level STEM experts with scientific Python experience to evaluate and improve LLM-generated code and reasoning in an asynchronous, remote contractor role.

Photo of the Rise User
Posted 21 days ago

Mercor seeks PhD-level biological scientists to design and evaluate advanced biology problems for a top AI lab in a flexible, remote contractor role.

Photo of the Rise User
Posted 21 days ago

Mercor is recruiting PhD-level scientists and advanced STEM graduates to perform part-time, remote evaluations of LLM outputs in biology, physics, and chemistry for a high-impact AI research program.

Photo of the Rise User
Posted 21 days ago

Mercor seeks experienced MDs/DOs to work remotely, part-time, on a 5-week project evaluating AI systems for clinical tasks and medical workflow simulations.

Yupp Hybrid Mountain View
Posted 22 days ago

Senior-level AI engineer needed to design, build, and scale production LLM applications at a fast-growing Silicon Valley startup focused on trustworthy model evaluation and GenAI products.

Photo of the Rise User
Posted 22 days ago
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Paid Holidays

Kiddom seeks an AI Researcher to drive generative-AI research and build safe, practical AI features that personalize instruction and improve student outcomes.

Photo of the Rise User
Posted 24 days ago

Lead Kyivstar.Tech’s NLP efforts to design, train, and deploy Ukrainian-focused LLMs and NLP systems while mentoring a team and shaping the product roadmap.

Yupp Hybrid Mountain View
Posted 25 days ago

Lead product strategy and execution for Yupp’s consumer and AI-builder platforms, shaping features that impact millions and improve model evaluation and adoption.

Photo of the Rise User
Guidewire Hybrid United States - Remote
Posted 25 days ago

Guidewire seeks a Product Manager focused on GenAI/ML to lead development of an AI-native underwriting solution that delivers seamless data ingestion, model-driven risk assessment, and platform integrations for P&C insurers.

Photo of the Rise User
Posted 26 days ago

Yupp is hiring a Senior Product Manager to own consumer and AI-builder product strategy, model evaluation frameworks, and growth initiatives at scale.

Aarons Corporate Retail Store DEFIANCE, Ohio
Sponsored
Employment type
Remote/Onsite
Application Type
Date Posted
Department
Work Experience
Industries
Skills
Company size
Funding
Company Culture
Benefits & Perks
Company Rating
Salary (USD)
Keywords to Exclude

How much do model evaluation jobs pay?

Below 50k*
2
4%
50k-100k*
3
7%
Over 100k*
41
89%
*average yearly salary (USD)

Top companies hiring for model evaluation jobs

Best cities to find model evaluation jobs