Bespoke Labs is an applied AI research lab pioneering data curation and RL environment curation for the modern agentic world. We curated Open Thoughts, one of the best open reasoning datasets used by multiple frontier labs, trained SOTA specialized models such as Bespoke-MiniChart-7B and Bespoke-MiniCheck, and taught agents to do multi-turn tool-calling with reinforcement learning.
Bespoke is uniquely positioned to capture a large market share of data and RL environment curation.
As a member of our technical staff, you will work on designing RL environment curation strategies. This involves coming up with recipes and strategies of creating RL environments. Ideal candidates are problem solvers who can understand the problem in a scientific way and can solve the problem practically.
Build our curation platforms for building/collecting/curating RL environments and data curation.
Do research on cutting edge curation strategies, especially for RL environments.
Come up with data and environment recipes, and work with contractors to create RL environments.
Verify whether environments are high quality, by checking for reward hacking, and training small scale agents.
Do data analysis to uncover insights about the environments.
PhD/MS in ML, and/or industry experience
Proficiency in languages like Python and experience with cloud platforms (GCP, AWS, etc.).
Ability to design systems that scale to handle large volumes of data and complex workflows.
Have extreme patience reading transcripts of rollouts.
A self-starter who is excited about working on hard technical problems in AI and data-centric platforms.
Have experience designing robust CI/CD pipelines, automated testing, observability, and monitoring.
Passionate about data curation, AI, RL environments, and post-training.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Experienced security researcher needed to lead threat and malware analysis, build detection systems, and drive end-to-end machine learning projects at Palo Alto Networks' Santa Clara engineering team.
Experienced materials scientist needed for hands-on TEM lamella preparation and Dual Beam FIB/SEM analysis on an evening shift at EAG Laboratories in Milpitas.
Work on a fast-paced multidisciplinary team to build, instrument, and test next-generation humanoid robotics hardware at Apptronik's Austin lab.