Browse 24 exciting jobs hiring in Rlhf now. Check out companies hiring such as Aurora Innovation, Prime Intellect, Cartesia in Baton Rouge, Jacksonville, New York.
Senior engineering role to lead dataset quality and model pipeline development using foundation models and RLHF to advance Aurora's self-driving stack.
Work at the intersection of RL, post-training evaluation, and production agent infrastructure to shape and deploy agentic AI systems used by real customers.
Cartesia is looking for a Post-Training Researcher to design and scale preference optimization, evaluation, and feedback-driven learning methods for multimodal foundation models.
Lead the design and delivery of adaptive, human-aware AI systems that improve trust and collaboration across text, speech, and interactive interfaces for a US-based organization.
Lead the development and scaling of LLM-driven product features as an Engineering Manager focused on ML strategy, team growth, and production-quality infrastructure in a remote-first, high-impact startup.
Impact the next generation of AI-augmented coaching by building, fine-tuning, and shipping LLM-driven features and production ML systems at BetterUp.
Lead post-training and RL efforts to enable frontier models to design, evaluate, and autonomously run scientific experiments in a real lab environment at a fast-growing AI + physical sciences lab.
Senior technical lead for designing and shipping agentic LLM systems that combine advanced context grounding, policy optimization, and low‑latency serving at scale for Gopuff’s Personal Superintelligence Lab.
Drive cutting-edge research and production implementation of agentic LLM systems at Gopuff, focusing on context grounding, alignment, and scalable low-latency inference for personalized shopping intelligence.
At Rox, an Applied AI Engineer will build and deploy agentic LLM-powered workflows in production to supercharge revenue teams and iterate rapidly with customers and product partners.
FloQast seeks a Staff Software Engineer to lead architecture and delivery of production Core AI systems that power its accounting automation platform.
Handshake AI seeks a Staff AI Research Scientist to lead and publish research on LLM evaluation, interpretability, and alignment while driving scalable benchmark systems.
Handshake AI is hiring a Staff AI Research Scientist to lead high-impact research and engineering on data quality, dataset auditing, and post-training alignment for large language models.
Handshake AI is seeking a Senior AI Research Engineer to architect and scale large post-training and evaluation systems for LLMs and lead engineering efforts that translate research into production-grade benchmarks and pipelines.
Welo Data is hiring a Prompt Engineer & Data Analyst to design and evaluate prompts, curate datasets, and perform rigorous model analysis to enhance LLM capabilities and safety.
Lead a cross-functional delivery squad at Welo Data to drive client-facing AI data programs, operational excellence, and OKR-driven results.
Epic Games is hiring a Senior Research Engineer to build and scale LLM tooling, training pipelines, and realtime AI features that enable immersive conversational experiences in games.
Help advance state-of-the-art AI safety by researching, implementing, and integrating alignment and evaluation techniques for large language models at a high-growth consumer AI company.
Handshake AI is hiring a Summer 2026 PhD Research Intern to lead a focused LLM research project remotely, with mentorship and the goal of producing an archive-ready manuscript or top-tier conference submission.
Sentient seeks research scientists/engineers to design and implement novel fine-tuning and agentic techniques that advance long-horizon reasoning and strategic multi-agent decision making for an open-source AGI platform.
Lead the design and production of multimodal, agentic AI systems at Zillow to deliver context-aware, autonomous experiences across vision, language, and voice for millions of customers.
Moonlake is looking for a hands-on Applied AI Research Engineer to develop and deploy agentic code-generation systems, from dataset curation and post-training to secure production execution and evaluation.
Periodic Labs seeks an experienced Distributed Training Engineer to optimize and operate frontier-scale LLM training systems that power AI-driven scientific research.
Latcher is hiring an AI Researcher focused on multi-modal post-training to develop state-of-the-art methods that make pre-trained models steerable, safe, and production-ready.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
24
|