Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Member of Technical Staff: RL Environments image - Rise Careers
Job details

Member of Technical Staff: RL Environments

About Bespoke Labs

Bespoke Labs is an applied AI research lab pioneering data curation and RL environment curation for the modern agentic world. We curated Open Thoughts, one of the best open reasoning datasets used by multiple frontier labs, trained SOTA specialized models such as Bespoke-MiniChart-7B and Bespoke-MiniCheck, and taught agents to do multi-turn tool-calling with reinforcement learning.

Bespoke is uniquely positioned to capture a large market share of data and RL environment curation.

About the Role

As a member of our technical staff, you will work on designing RL environment curation strategies. This involves coming up with recipes and strategies of creating RL environments. Ideal candidates are problem solvers who can understand the problem in a scientific way and can solve the problem practically.

What you will do

  1. Build our curation platforms for building/collecting/curating RL environments and data curation.

  2. Do research on cutting edge curation strategies, especially for RL environments.

  3. Come up with data and environment recipes, and work with contractors to create RL environments.

  4. Verify whether environments are high quality, by checking for reward hacking, and training small scale agents.

  5. Do data analysis to uncover insights about the environments.

Who you are

  1. PhD/MS in ML, and/or industry experience

  2. Proficiency in languages like Python and experience with cloud platforms (GCP, AWS, etc.).

  3. Ability to design systems that scale to handle large volumes of data and complex workflows.

  4. Have extreme patience reading transcripts of rollouts.

  5. A self-starter who is excited about working on hard technical problems in AI and data-centric platforms.

  6. Have experience designing robust CI/CD pipelines, automated testing, observability, and monitoring.

  7. Passionate about data curation, AI, RL environments, and post-training.

Average salary estimate

$195000 / YEARLY (est.)
min
max
$150000K
$240000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User

Experienced security researcher needed to lead threat and malware analysis, build detection systems, and drive end-to-end machine learning projects at Palo Alto Networks' Santa Clara engineering team.

Photo of the Rise User
Posted 12 hours ago

Experienced materials scientist needed for hands-on TEM lamella preparation and Dual Beam FIB/SEM analysis on an evening shift at EAG Laboratories in Milpitas.

Photo of the Rise User
Posted 24 hours ago

Work on a fast-paced multidisciplinary team to build, instrument, and test next-generation humanoid robotics hardware at Apptronik's Austin lab.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
August 25, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!