Agent Evaluation Jobs

Browse 9 exciting jobs hiring in Agent Evaluation now. Check out companies hiring such as Replit, Arcade, Decagon in San Diego, New Orleans, Atlanta.

VIEW COMPANIES

Data Scientist, AI Agent

Replit Hybrid Foster City

VIEW

Posted 21 hours ago

Inclusive & Diverse

Mission Driven

Work/Life Harmony

Diversity of Opinions

Friends Outside of Work

Empathetic

Collaboration over Competition

Fast-Paced

Transparent & Candid

Medical Insurance

Dental Insurance

Vision Insurance

Disability Insurance

Learning & Development

401K Matching

Paid Time-Off

WFH Reimbursements

Paid Holidays

Equity

Flex-Friendly

Lead experimentation, trace analysis, and metric design to measure and improve Replit's AI agent, converting agent traces into product-changing insights for engineering and leadership.

Product Manager, AI

Arcade Hybrid Presidio

VIEW

Posted 3 days ago

Lead development of Arcade’s conversational AI product creation agent as the company’s first dedicated Product Manager for AI, reporting directly to the CEO.

Staff Software Engineer, Agent Orchestration

Decagon Hybrid San Francisco

VIEW

Posted 7 days ago

Lead the architecture and long-term evolution of Decagon’s agent orchestration engine to enable reliable, high-performance AI agent behavior at scale.

FM Research Cybersecurity Co-op - Summer/Fall 2026

FM Hybrid NORWOOD, Massachusetts

VIEW

Sr. Research Engineer - Electrical/Power Generation - Design, operation, maintenance of electrical/power generation equipment

FM Hybrid NORWOOD, Massachusetts

VIEW

FM Approvals Quality Assurance Compliance Auditor - Manufacturing

FM Hybrid FRISCO, Texas

VIEW

Senior Software Engineer, AI Product

Vanta Hybrid No location specified

VIEW

Posted 14 days ago

Inclusive & Diverse

Growth & Learning

Customer-Centric

Collaboration over Competition

Medical Insurance

Maternity Leave

Flex-Friendly

401K Matching

Lead applied AI product work at Vanta by designing, shipping, and scaling LLM-powered features that accelerate customer compliance and trust.

Principal AI/ML Engineer (Remote - US)

Jobgether Hybrid No location specified

VIEW

Posted 16 days ago

Lead the architecture and delivery of large-scale, regulated AI systems—driving multi-agent, multi-modal solutions and engineering standards across cross-functional teams.

Staff AI Engineer

Awesome Motive Hybrid United States

VIEW

Posted 19 days ago

Bond Studio AI is hiring a Staff AI Engineer to design and implement production AI systems and multi-agent LLM architectures that power agentic 3D design experiences for real-world spaces.

Senior Engineer, AI Evaluation & Reliability (Agentic AI)

Anomali Hybrid Redwood City, CA

VIEW

Posted 21 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Vision Insurance

Family Medical Leave

Paid Holidays

Lead the design and execution of evaluation, reliability, and production-scale testing for Anomali’s agentic AI features that automate SOC workflows and improve analyst productivity.

Intern, Agent Development (Winter 2026)

Sierra Hybrid San Francisco

VIEW

Posted 23 days ago

Help build and ship production AI agents at Sierra as a Software Engineer intern, contributing to the design, implementation, and real-world evaluation of agent features.

Tech Lead - Applied ML

Basis AI Hybrid New York

VIEW

Posted 26 days ago

Lead the architecture and hands-on implementation of a critical ML subsystem at Basis, shaping how our AI agents think, learn, and operate in production.

Employment type

Remote/Onsite

Application Type

Date Posted

Department

Work Experience

Industries

Skills

Company size

Funding

Company Culture

Benefits & Perks