Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in rust to search over it. We also own a $5m H200 GPU cluster and routinely run batchjobs with 10s of thousands of machines. This isn't your average startup :)
On the ML team, we train foundational models for search. Our goal is to build systems that can instantly filter the world's knowledge to exactly what you want, no matter how complex your query. Basically, put the web into an extremely powerful database.
We're looking for an ML evals engineer to design and build our eval stack at Exa. The role involves investigating how to evaluate search engines in an LLM world and then building the most comprehensive, creative, and effective eval suite. You will be deciding the future of search through the evals we choose to optimize for.
Desired Experience
You have some ML experience
You have strong engineering experience
You like creating evaluation datasets and diving deeply into the data
You care deeply about the problem of search and want to create an eval suite that helps us get as perfect a search engine as we can
Example Projects
Write a manifesto of what perfect search means
Identify the biggest problems in our search and make an eval for those problems
Think of creative ways to gather evaluation data
This is an in-person opportunity in San Francisco. We're happy to sponsor international candidates (e.g., STEM OPT, OPT, H1B, O1, E3).
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Work closely with Exa’s founders as Chief of Staff to drive company strategy, operations, recruiting, financial planning, and executive communications at a fast-moving Series B AI startup in San Francisco.
A U.S.-based partner company is hiring a Summer 2026 AI/ML intern to develop and test machine learning solutions using Python, JavaScript, and C++ while gaining mentorship and practical industry experience.
Senior technical and business leader sought to drive AI, geospatial analytics, and decision‑science initiatives across AECOM’s East/LATAM Advisory practice while building strategic partnerships and commercial growth.
A fully remote team is hiring a Data Scientist (Software Engineer I/II) to develop ML/AI models, maintain data pipelines, and deliver actionable insights that inform business decisions.
FM Global seeks an experienced Data Scientist to design and deliver statistical and machine-learning solutions that advance property loss prevention and enterprise risk management.
Visa is hiring a Machine Learning Engineer Intern to build and deploy production ML pipelines and monitoring tools as part of its AI/ML Products & Platforms team for Summer 2026.
A 12-week summer Data Science internship on Coinbase's Strategy, Execution & Analytics team to support product analytics, experimentation, and scalable data tooling.
Seek a Senior Data Scientist to develop and scale repayment-risk models and monitoring for Plaid’s Earned Wage Access product, turning transaction and balance data into actionable risk signals and partner impact.
Travelers is hiring a Director of Decision Science in Hartford to lead explanatory analytics and experimental design that drive cross-enterprise business decisions and influence strategy.
Coinbase is hiring a Ph.D. student intern to research and deploy scalable ML models that advance blockchain-aware product experiences and platform security.
Humana is hiring an Associate Actuary to lead Medicaid trend forecasting and program evaluation that drives financial strategy and better member outcomes.
Paid summer internship at Visa's Global Data Office working on data preparation, analytics, and BI tooling to drive insights for payment decisioning.
Datavant is seeking a Privacy-focused Data Scientist to analyze large health datasets, evaluate re-identification risk, and help drive privacy research and analytics for a remote U.S. team.
Senior Machine Learning Engineer needed to build and deploy production ML systems that improve logistics and scale Canals' platform across North and South America.