About The LLM Data Company
The LLM Data Company (YC X25) provides post-training data and RL environments to foundation model labs and frontier applied AI companies. We have raised $3.6m from Tier 1 VCs and are growing 200%+ month-over-month.
Responsibilities
Design and implement scalable RL recipes for post-training task-specific models
Develop modular environments, reward functions, and evaluator scaffolds for internal and customer-facing tasks
Drive research at the intersection of scalable infra and modern RL frameworks to enable RL-as-a-service
Drive foundational research to publish open source environments and training data
Build data generation and curation pipelines to support frontier post-training
Collaborate with product teams to deliver a user friendly interface for non-technical users to generate data
Qualifications
Master or PhD in Computer Science or related field
Comfort with core tooling (verl, PyTorch, etc.)
Familiarity with modern post-training techniques (GRPO, etc.)
Experience with evaluations and reward engineering
Published in top journals (ICLR, NeurIPS, ICML, etc.)
Why you should join
Cutting-edge research: Work on unpublished, novel training environments
Direct lab exposure: Projects that labs actually use and validate in production
High autonomy: Wide design space to propose and run experiments with minimal oversight
Early team member: Join as one of the first 10 people with significant equity upside
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Rho is looking for a remotely-based Medical Director to provide medical oversight, SAE/AE review, protocol and clinical study report input, and strategic clinical development guidance across CRO projects.
Eurofins Environment Testing is hiring an Analyst I to prepare and analyze environmental samples using standard EPA and company methods at its San Antonio laboratory.
Lead a discovery team at Pendulum to drive early-stage microbiome research, develop assays and pre-clinical evidence, and translate scientific insights into consumer health products.
Lead a small team to drive downstream purification development, scale-up support, and technical reporting for biopharmaceutical projects at Eurofins PSS in Framingham, MA.
Work on cutting-edge online mapping research and engineering to help Waabi’s self-driving stack adapt to real-world changes across Toronto, San Francisco, and remotely across the US & Canada.
Novartis is hiring a Senior Expert in Process Analytics to lead analytical method development and assay troubleshooting for gene and cell therapy programs at the Durham site.
A hands-on R&D internship at Momentive to develop and test silicone-based pressure sensitive adhesives and release coatings in a team-driven lab environment.