Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
RL Environments Engineer (Contractor, Remote) image - Rise Careers
Job details

RL Environments Engineer (Contractor, Remote)

RL Environments Engineer (Remote, Contractor) - Preference Model

About the company

Preference Model is building the next generation of training data to power the future of AI. Today's models are powerful but fail to reach their potential across diverse use cases because so many of the tasks that we want to use these models are out of distribution. Preference Model creates RL environments where models encounter research and engineering problems, iterate, and learn from realistic feedback loops.

Our founding team has previous experience on Anthropic’s data team building data infrastructure, tokenizers, and datasets behind the Claude. We are partnering with leading AI labs to push AI closer to achieving its transformative potential. We are backed by Tier 1 Silicon Valley VC.

Brief Description of the Role

We’re hiring RL Environments Engineers to design and build MLE environments. The goal is to teach LLMs better reasoning / advanced concepts from modern ML.
This is a remote contractor role with ≥4 hours overlap to PST and advanced English (C1/C2) required.

Minimum Qualifications:

  • Strong Python (engineering-quality, not notebook-only)

  • Docker + production mindset (debugging, reliability, iteration speed)

  • Clear understanding of LLMs, their current limitations

  • Ability to meet throughput expectations and respond quickly to feedback.

You may be a good fit if one of the following applies

  • Strong expertise in CUDA or Pallas kernel development, optimizing non-trivial neural modules to specific hardware

  • Expert knowledge in an active DL/ML research area, with publications or public code to show for it. We're especially interested in areas that are math-heavy and don't require massive compute. Examples include but aren't limited to:

  • Architectures: SSMs, KANs, tensor networks, Hypernetworks, etc

  • Generative modeling: diffusion, flow matching, probabilistic programming

  • Geometry and Topology: geometric DL, topological DL, optimal transport

  • Reasoning: neuro-symbolic methods, algorithmic reasoning

  • Mechanistic Interpretability: circuit analysis, causal discovery, grokking

  • Foundations: learning theory, control and constraint optimization

  • ML for science: physics-informed neural nets, computational neuroscience, quantum chemistry, structural bioinformatics, chemoinformatics, genomics

  • Numerical & simulation methods: stochastic time series, fluid dynamics, numerical relativity, Bayesian inference, Monte Carlo methods

  • You have strong fundamentals and broad research interests, you read many papers, understand them deeply and have creativity to translate them into RLVR problems

  • You have built complex interactive RL environments and have strong insights into open-ended RL-based learning systems

Average salary estimate

$225000 / YEARLY (est.)
min
max
$150000K
$300000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User

Experienced engineering technician needed to prototype, test, and iterate cardiac device concepts in a regulated medical-device development environment.

Elve seeks a postdoctoral researcher to develop and optimize high-current-density NST electron emitters for next-generation millimeter-wave power amplifiers.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Contract, remote
DATE POSTED
January 11, 2026
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!