Browse 6 exciting jobs hiring in Evaluation Pipelines now. Check out companies hiring such as Mistral AI, Zillow, Awesome Motive in Santa Ana, Winston-Salem, Arlington.
Join Mistral AI as a Model Behavior Architect to shape LLM behavior through prompt design, evaluation pipelines, and policy work informed by humanities expertise.
Zillow's Agentic AI team is hiring a Machine Learning Engineer to design, train, evaluate, and ship agentic LLM solutions that improve user understanding and decision-making across the home search experience.
Work on the core intelligence at a seed-stage startup, designing experiments, optimizing inference, and building training and eval systems that turn messy UI and behavioral data into production-ready models.
Join Cartesia’s in-office SF research-engineering team to design and scale synthetic datasets and systems that power next-generation foundation models.
Work as a founding Backend Engineer to build scalable, secure backend infrastructure and data pipelines that power high-impact AI features at an early-stage startup in NYC.
Lead research and engineering to build production-ready post-training RL recipes, environments, and evaluation pipelines for a fast-growing startup powering foundation model labs.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
1
|