Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Data Engineer image - Rise Careers
Job details

Data Engineer

Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5m H200 GPU cluster and routinely run batchjobs with tens of thousands of machines. This isn't your average startup :)


As a Data Engineer, you'll architect and build the data infrastructure that powers everything we do—from crawling billions of pages to training our embedding models to serving real-time search. You'll have enormous autonomy in designing systems that scale to hundreds of petabytes. If you've ever wanted to build data pipelines at a scale that most companies only dream about, this is your chance.


Desired Experience

  • Deep understanding of lakehouse architectures (Delta Lake, Iceberg, Hudi) and when to use them

  • Experience building and operating large-scale distributed data processing pipelines

  • Hands-on experience with streaming data systems (Kafka, Flink, or similar)

  • Familiarity with Ray, Spark, or ClickHouse at production scale

  • An obsessive focus on reliability and building systems that don't page you at 3am


Bonus Points

  • Experience with Lance or other vector-native storage formats

  • Background in GPU-accelerated data processing (RAPIDS, cuDF)


Example Projects

  • Design a lakehouse architecture that handles 100+ PB of web crawl data

  • Build streaming pipelines that process billions of documents per day for real-time indexing

  • Architect the data layer for our embedding training infrastructure on Ray

  • Scale our ClickHouse deployment to handle analytical queries across petabytes of search logs


This is an in-person opportunity in San Francisco. We're happy to sponsor international candidates (e.g., STEM OPT, OPT, H1B, O1, E3).

Exa Corporation Glassdoor Company Review
3.4 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
Exa Corporation DE&I Review
3.5 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
CEO of Exa Corporation
Exa Corporation CEO photo
Stephen Remondi
Approve of CEO

Average salary estimate

$210000 / YEARLY (est.)
min
max
$160000K
$260000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Inspiren Hybrid Remote - US or Canada
Posted 6 hours ago

Senior Data Platform Engineer needed to lead and scale Inspiren's Databricks/AWS data platform, building ingestion, governance, and reusable tooling for analytics and ML across the company.

Photo of the Rise User
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Sabbatical
Paid Holidays

Handshake is hiring logistics and warehouse professionals to do flexible, asynchronous contract work evaluating AI outputs and refining domain-specific prompts.

Photo of the Rise User
Posted 6 hours ago

Lead the design and operation of BetterUp's enterprise data platform, enabling analytics and production ML with a focus on security, scalability, and operational excellence.

Photo of the Rise User
CurbWaste Hybrid No location specified
Posted 5 hours ago

CurbWaste is hiring an Analytics Engineer to design and own its data warehouse, analytics stack, and KPIs as the company's first dedicated analytics hire.

Photo of the Rise User
Berklee Hybrid Berklee College of Music - Boston
Posted 17 hours ago

Lead Berklee's advancement data and systems strategy as Senior Director of Advancement Services, ensuring secure, accurate, and actionable data to drive philanthropy and engagement.

Lead Bristol Myers Squibb’s Taxonomy & Tagging Center of Excellence to design enterprise metadata frameworks, drive governance and enable omnichannel personalization, automation and compliance.

Photo of the Rise User

Senior Database Developer needed to architect, optimize, and lead SQL database solutions for a remote-first team based in New York.

MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
December 18, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!