Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in rust to search over it. We also own a $5m H200 GPU cluster and routinely run batchjobs with 10s of thousands of machines. This isn't your average startup :)
On the ML team, we train foundational models for search. Our goal is to build systems that can instantly filter the world's knowledge to exactly what you want, no matter how complex your query. Basically, put the web into an extremely powerful database.
We're looking for an ML research engineer to train embedding models for perfect search over the web. The role involves dreaming up novel transformer-based search architectures, creating datasets, creating evals, beating our internal SOTA, and repeat.
Desired Experience
You have graduate-level ML experience (or are an exceptionally strong undergrad)
You can code up a transformer from scratch in pytorch
You like creating large-scale datasets and diving deeply into the data
You care about the problem of finding high quality knowledge and recognize how important this is for the world
Example Projects
Pre-training -- train a hundred billion parameter model to
Finetuning -- Build an RLAIF pipeline for search
Dream up a novel architecture for search in the shower, then code it up and beat our best model's top score
Build an eval system that answers how do we know we're advancing our search quality? (this is an incredibly difficult question to answer)
This is an in-person opportunity in San Francisco. We're happy to sponsor international candidates (e.g., STEM OPT, OPT, H1B, O1, E3).
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Work on Exa's core backend systems to build extremely high-performance crawling, indexing, and vector search infrastructure for AI applications in San Francisco.
Lead development and analysis of quantum information protocols and high-performance algorithm implementations for mission-driven research requiring U.S. citizenship and security-clearance eligibility.
Lead and manage medical device clinical trials for PROCEPT BioRobotics, overseeing site selection, monitoring, regulatory compliance, and data quality with significant domestic travel.
Support quantitative research at UT Austin's IC² Institute by developing statistical and computational models, managing large datasets, and contributing to AI/ML and agent-based modeling projects.
Provide strategic leadership for the Oregon Longitudinal Data Collaborative, advancing the statewide longitudinal data system to inform education and workforce policy.
Albany Medical Center seeks a detail-oriented Research Technician to support molecular and cellular physiology research through routine experiments, sample preparation, and laboratory maintenance.
Spry is hiring an entry-level Intelligence Analyst in Huntsville to conduct source and open-source research, support intelligence collection and interagency coordination, and participate in an intensive training program (Top Secret clearance required).
Experienced clinical research professional wanted to manage TI oncology trials at OHSU's Knight Cancer Institute, supporting regulatory, patient-facing, and data operations for transplant and cell therapy studies.
Experienced counterintelligence professional needed to deliver expert threat assessments, analytic products, and liaison-driven CI support for the DOE‑IN program in the Washington, D.C. area.
Beth Israel Deaconess Medical Center is recruiting a Director-level Clinical Cytogeneticist to co-lead and expand a comprehensive cytogenetics laboratory while advancing clinical services, education, and translational research.
The Energy & Environment Lab at the University of Chicago is seeking a Research Director to lead rigorous applied research in energy and environmental policy, manage research teams, and partner with faculty and policymakers to produce and disseminate evidence-driven solutions.
Serve as the primary administrator for foundational research awards at the American Heart Association, managing award compliance, budgets, and stakeholder communications for assigned programs and regions.
Shriners Children’s Lexington is seeking an organized Clinical Research Coordinator 1 to manage regulatory documentation, recruit and consent pediatric participants, and ensure accurate data collection and compliance for clinical studies.