About The LLM Data Company
The LLM Data Company (YC X25) provides post-training data and RL environments to foundation model labs and frontier applied AI companies. We have raised $3.6m from Tier 1 VCs and are growing 200%+ month-over-month.
Responsibilities
Design and implement scalable RL recipes for post-training task-specific models
Develop modular environments, reward functions, and evaluator scaffolds for internal and customer-facing tasks
Drive research at the intersection of scalable infra and modern RL frameworks to enable RL-as-a-service
Drive foundational research to publish open source environments and training data
Build data generation and curation pipelines to support frontier post-training
Collaborate with product teams to deliver a user friendly interface for non-technical users to generate data
Qualifications
Master or PhD in Computer Science or related field
Comfort with core tooling (verl, PyTorch, etc.)
Familiarity with modern post-training techniques (GRPO, etc.)
Experience with evaluations and reward engineering
Published in top journals (ICLR, NeurIPS, ICML, etc.)
Why you should join
Cutting-edge research: Work on unpublished, novel training environments
Direct lab exposure: Projects that labs actually use and validate in production
High autonomy: Wide design space to propose and run experiments with minimal oversight
Early team member: Join as one of the first 10 people with significant equity upside
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Serve as a founding Strategy & Operations leader at a high-growth YC-backed AI data company, owning strategic projects from customer contracts to product roadmaps.
Lead the design and implementation of scalable RL recipes and environment tooling at an early-stage company providing post-training data and RL infrastructure to foundation model labs.
Silvaco seeks an Optics & Electromagnetics Intern to work remotely on computational lithography simulations, model calibration, and GPU‑accelerated tool development.
Join Gridware's Physical R&D team as a Senior Systems Research Engineer to own sensor measurement chains, define performance requirements, and design experiments that validate technologies for electrical grid safety and reliability.
Research Innovations is hiring a Vulnerability Research Intern to perform binary/static/dynamic analysis, develop tooling, and support vulnerability discovery for defense-focused software research.
Eurofins seeks a Bioassay Scientist to lead method development and transfer for cell-based, molecular and biochemical assays at its Indianapolis laboratory.
Lead the CMC function for Pattern Bio’s AAV-based oncology pipeline, building scalable manufacturing, analytical and regulatory strategies from early development to commercialization.
Lead the development of large-scale, auditable evaluations for frontier AI models to measure capabilities and steer safety decisions at OpenAI.
Scientific, hands-on role supporting CHO cell culture process development, modeling-driven experiments, and tech transfer within AbbVie's Worcester bioresearch team.
Lead the definition and validation of sensor measurement chains and physical experiments to ensure Gridware’s platform reliably detects electromagnetic phenomena affecting the electrical grid.
As an AbbVie Medical Science Liaison in Gastroenterology, you will provide disease-state expertise and build strategic HCP and investigator relationships across Michigan to support medical, research, and commercial objectives.
DELFI Diagnostics is hiring an experienced assay and reagent development scientist to lead reagent specification, transfer, and validation for NGS-based liquid biopsy IVDs in Palo Alto.