At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end-to-end. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come.
This is a rare chance to help build a generational company at a true inflection point. With 700+ customers in 58+ countries, strong product-market fit, and world-class investor support, we’re scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth — personal, professional, and financial — is unmatched.
Our team is sharp, motivated, and deeply committed to the mission. We move fast, operate with intensity, and take real ownership of the problems we tackle — from early thinking to long-term outcomes. We stay close to our customers — from leadership to engineers — and work together to solve real problems with urgency and care. If you thrive in ambiguity, push for excellence, and want to help shape the future of work alongside others who raise the bar, we invite you to build with us.
At Harvey, the future of professional services is being written today — and we’re just getting started.
At Harvey, we’re building the AI platform for the world’s top legal and professional services teams. As we scale, our data team sits at the heart of this mission—turning raw data and research into robust, intelligent systems that power reasoning at scale. Our Data Team powers Harvey’s ability to understand and leverage both public and private data at scale — building the infrastructure that ingests, transforms, and retrieves millions of documents to make our AI systems smarter every day.
We’re looking for a Director of Engineering, Data to lead this function into its next chapter. You’ll shape the strategy, architecture, and team behind the systems that make advanced reasoning possible. The Data team owns end-to-end retrieval-augmented generation (RAG) stacks across complex domains — including Case Laws, Legislation, and Tax codes across 50+ international jurisdictions. As generation and reasoning improve, retrieval quality has become the new frontier. Solving it at scale is one of Harvey’s top priorities.
If you’re excited by large-scale data engineering, complex information retrieval, and building the backbone of cutting-edge AI systems, we’d love to talk.
Lead and scale the Data organization from a single high-performing team into multiple specialized teams.
Partner closely with leadership to define the strategic roadmap for Harvey’s data ecosystem and ensure it scales with our global growth
Own and evolve Harvey’s end-to-end data architecture — from ingestion to transformation, storage, retrieval, and delivery — ensuring performance, reliability, and scalability to power LLMs and downstream applications.
Design and oversee large-scale data ingestion pipelines that aggregate, normalize, and maintain data from thousands of heterogeneous, publicly available legal and regulatory sources across global jurisdictions.
Integrate private and partner data sources, ensuring robust access controls, lineage tracking, and compliance with security and privacy requirements.
Evaluate and implement data infrastructure technologies to support large-scale document processing, embedding generation, vector storage, and retrieval optimization.
Collaborate closely with the Applied AI team to drive experimentation and model improvements that directly enhance AI quality and differentiation across Harvey’s products.
Drive the development of end-to-end research experiences that weave together our retrieval, reasoning, and UX layers — transforming AI insights into intuitive, lawyer-friendly workflows that redefine how professionals engage with complex information.
Partner cross-functionally with Product Engineering, Applied AI, Research, and Platform teams to deliver high-quality, production-ready systems.
You have 10+ years of experience in data engineering, data architecture, or platform engineering, with 5+ years of leading high-performance teams.
You’ve led data or ML infrastructure teams through scale — from startup to multi-team org.
You have a proven track record of building and scaling distributed data systems handling large, complex, and heterogeneous datasets.
You bring depth in backend, data infrastructure, or information retrieval, with a strong appreciation for applied AI.
You value clarity, craftsmanship, and high trust as the foundations of great engineering.
$320,000 - $360,000 USD
#LI-KV1
Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing [email protected]
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead and grow fullstack product engineering teams at Harvey to deliver user-facing, AI-enabled features that transform how legal professionals work.
Lead the design and delivery of robust, scalable ML and data platform infrastructure at Quizlet to accelerate ML development and drive data reliability for millions of learners.
Paragon Cyber Solutions is hiring an experienced Database Engineer (active TS/SCI required) to manage and modernize enterprise databases supporting the DefenseReady application at HQ USSOCOM in Tampa, FL.
Smalls is hiring a Senior Analytics Engineer to architect and operate our data platform, turning raw data into trusted, actionable analytics for cross-functional teams.
College Board seeks an experienced AI/ML Data Engineer to build production data and ML plumbing—ETL, feature/embedding stores, RAG foundations, and observability—for personalized higher-ed experiences.
Join IntegriChain as a Data Pipeline / ETL Engineer to build and optimize scalable data ingestion and transformation pipelines powering healthcare analytics.
Harvey is a trusted generative AI company headquartered in San Francisco, California. We provide a suite of AI products tailored to lawyers and law firms across practice areas and workflows.
23 jobs