Preference Model is building the next generation of training data to power the future of AI.
Today's models are powerful but fail to reach their potential across diverse use cases because so many of the tasks that we want to use these models for are outside of their training data distribution. Preference Model creates reinforcement learning environments that encapsulate real-world use cases, enabling AI systems to practice, adapt, and learn from feedback grounded in reality. We seek to bring the real world into distribution for the models.
We're seeking experienced ML engineers to build distributed training infrastructure for our RL training initiatives, including:
Design and implement scalable distributed training infrastructure using PyTorch and Ray
Create automation tools for monitoring, debugging, and recovering from infrastructure failures in distributed training environments
Ensure infrastructure reliability, security, and performance meet the demanding requirements of large-scale ML workloads
We're looking for candidates with the following qualifications and attributes:
Experience building and operating ML infrastructure at scale
Proficiency in PyTorch and distributed training paradigms
Hands-on experience with Ray
Experience with at least one modern RL training framework (verl, NeMo-RL, ART, Atropos, or similar)
Proficiency in Python and systems programming
Experience with container orchestration (Kubernetes), infrastructure as code (Terraform)
Strong systems thinking with the ability to design for scale
Excellent debugging skills across the entire stack
Collaborative mindset with strong communication skills to work effectively with researchers and engineers
Self-directed problem solver who takes ownership and drives solutions end-to-end
Passion for staying current with the rapidly evolving ML infrastructure landscape
Open-source ML infrastructure contributions
We value diverse perspectives and experiences. If you're excited about this role but don't check every box, we still encourage you to apply.
We are backed by a Tier 1 VC. We offer competitive base salary as well as generous equity (>90th percentile).
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
A 12-week onsite internship at Illumio's Kubernetes Engineering team to gain hands-on experience with container platforms, cloud-native tooling, and Zero Trust security techniques.
Help build and optimize low-latency crypto execution infrastructure at Blockhouse, focusing on exchange connectivity, real-time data pipelines, and production-ready trading systems.
Huron is hiring a Full Stack Engineer to design and deliver secure, production-ready AI applications and integrations leveraging React, Python, and cloud platforms.
Mirage is seeking a Member of Technical Staff to build scalable, high-performance training data pipelines and systems for video and multimodal model development at our Union Square HQ.
Experienced backend engineer needed to architect and operate scalable, low-latency services (Scala, Akka, gRPC, AWS) for FOX Entertainment’s high-performance platform.
Software Dev Engineer II to build a mobile shopping app on iOS using React Native, Java, JavaScript and WebSocket connectivity for the Delivery Excellence team in Bellevue.
Lead backend architecture and engineering at a rapidly scaling insurance SaaS company, owning core systems, reliability, and mentorship across the platform.
RIVO Holdings seeks a Senior Software Engineer to own and modernize its mission-critical Loan Management System using .NET and Azure while building and mentoring the engineering team that will support it.
Lead a small engineering team at Notion to improve search ranking, query understanding, and AI-powered search features that power the product's core experience and memory.
Lead the design and implementation of scalable, reliable backend systems that ensure accurate, fast, and transparent advisor payouts for a high-growth travel-tech company.
FSSI is hiring a Junior Software Engineer in Santa Ana to support data mapping, template composition, and production support while gaining hands-on experience with C#, template tooling, and secure development practices.
Develop and scale next-generation simulation training tools at Shield AI to enable realistic operator training for autonomous aircraft in classroom and field environments.
Lead the Enterprise Ecosystem engineering team at OpenAI to design and ship secure, scalable integrations and developer primitives that enable enterprises to build AI-first apps on top of ChatGPT.