Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best models can continuously process and reason over a year-long stream of audio, video and text—1B text tokens, 10B audio tokens and 1T video tokens—let alone do this on-device.
We're pioneering the model architectures that will make this possible. Our founding team met as PhDs at the Stanford AI Lab, where we invented State Space Models or SSMs, a new primitive for training efficient, large-scale foundation models. Our team combines deep expertise in model innovation and systems engineering paired with a design-minded product engineering team to build and ship cutting edge models and experiences.
We're funded by leading investors at Index Ventures and Lightspeed Venture Partners, along with Factory, Conviction, A Star, General Catalyst, SV Angel, Databricks and others. We're fortunate to have the support of many amazing advisors, and 90+ angels across many industries, including the world's foremost experts in AI.
The Role
The future of AI training will be built on a foundation of high-quality synthetic data. We are looking for a creative and resourceful Synthetic Data Specialist to design and build the systems that generate training data at an unprecedented scale. This is a unique, high-impact role, where you will solve critical data bottlenecks and directly accelerate our research progress.
What you’ll do
Evaluate fidelity, diversity, and usefulness of synthetic data across LLMs, audio generation, and audio understanding.
Implement techniques for steering data generation to improve model intelligence through data and mitigate bias.
Build automated quality control systems to validate and filter generated data
Design synthetic datasets at large scale to develop model capabilities.
Stay on the cutting edge of research in synthetic data generation, data augmentation, and generative models.
What we’re looking for
Experience with generative models (speech, text, or multimodal).
Strong applied ML background with a focus on data-centric approaches.
Understanding of evaluation methods for synthetic data quality.
Excitement for building scalable systems that bridge research and production.
Familiarity with building large-scale distributed systems for synthetic data generation
Our culture
🏢 We’re an in-person team based out of San Francisco. We love being in the office, hanging out together and learning from each other everyday.
🚢 We ship fast. All of our work is novel and cutting edge, and execution speed is paramount. We have a high bar, and we don’t sacrifice quality and design along the way.
🤝 We support each other. We have an open and inclusive culture that’s focused on giving everyone the resources they need to succeed.
Our perks
🍽 Lunch, dinner and snacks at the office.
🏥 Fully covered medical, dental, and vision insurance for employees.
🏦 401(k).
✈️ Relocation and immigration support.
🦖 Your own personal Yoshi.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Peraton is hiring a Lead Associate Data Scientist to develop ML, generative AI, and large-scale analytics solutions in support of USSTRATCOM at Offutt AFB.
An on-site Junior Data Scientist role in Riverview focused on building pricing algorithms, supporting LLM-powered agents, and turning disparate data into actionable insights.
Contribute as an AI/ML new graduate at AVEVA to design, prototype, and deploy machine learning models and cloud-native AI services that solve real industrial problems.
Visa is hiring a Senior Machine Learning Engineer to develop and productionize LLM-based generative AI solutions (RAG, fine-tuning, inference) for global payments products.
Experienced data scientist with supply chain optimization and strong Python engineering skills needed to develop and productionize replenishment and inventory allocation models for a leading AI consultancy in the CPG/retail space.
Contribute to production ML systems as a Machine Learning Engineering Intern, helping build, evaluate, and deploy models while working closely with engineers and researchers.
Protegrity is hiring an MS-level Machine Learning Engineer to develop and fine-tune GenAI architectures and agentic tools that secure AI workflows in a hybrid Menlo Park environment.
Lead the design, development, and deployment of enterprise-scale generative AI and ML systems at Ascend Learning as a hands-on Principal AI Engineer.
Significance is hiring a cleared Data Scientist to build and validate AI/ML pipelines and integrate models with ERP systems to support federal finance and audit automation within Advana Mercury.
Beyondsoft is hiring a remote Software Development Engineer skilled in Power BI, data processing, and statistics to deliver actionable reports now and scalable engineering solutions over time.
Lead development and deployment of statistical and machine learning solutions at Truist to generate actionable insights, improve decisions, and mitigate risk.
Work as a Machine Learning Engineer at Protegrity to build and fine-tune GenAI solutions that advance data protection and privacy.
Protegrity is hiring a PhD-level Machine Learning Engineer to secure and scale GenAI systems by developing agentic AI solutions, fine-tuning LLMs, and applying ML to enterprise data protection.
Founded in 1992, Cartesia, Inc. is a group of talented professionals providing custom solutions in the areas of engineering design automation, Web-based applications development, and Microsoft Windows-based software construction and integration. ...
4 jobs