Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Senior Data Platform Engineer image - Rise Careers
Job details

Senior Data Platform Engineer

Welcome to Crosby, the next-generation law company!

We're a team of technologists and legal experts collaborating closely to reimagine corporate legal services from the ground up. We build proprietary technology and human-in-the-loop workflows to radically enhance the lawyer-machine relationship. Our mission is to review complex documents faster than ever and with perfect quality.

Crosby was founded by Ryan (Penn, Stanford Law, ex-Cooley, former GC) and John (Penn M&T, ex-Ramp, ex-Google).

We believe:

  • A great legal system is the watermark of a great society.

  • Legal work is art and science. We are discovering the frontier between the two— codifying what can be systematized, and amplifying human expertise where judgment matters most.

  • The right way to deploy AI in law isn’t by selling software. It’s by selling outcomes — higher-quality legal work, delivered faster and more efficiently

  • In an in-person culture at our NYC office.

The Role

As a Founding Data Platform Engineer, you will build and own the technical backbone that powers Crosby's entire AI-driven platform. Your mission is to design, build, and scale the data infrastructure responsible for ingesting, processing, and storing vast amounts of complex legal documents. You will create the reliable, high-performance systems that our machine learning models and application engineers depend on to transform the legal industry. This is a foundational role with a massive impact on our ability to scale.

What You’ll Do

  • Build the core pipeline: Design and build scalable data ingestion and processing pipelines for unstructured documents (PDFs, DOCX) using tools like Python and Prefect.

  • Own the storage layer: Manage our core data storage layer, including PostgreSQL with pgvector for hybrid search and Redis for high-speed caching.

  • Power AI Workloads: Develop and maintain our serverless compute and ML workloads using infrastructure like Modal and AWS.

  • Codify Infrastructure: Implement and manage robust, version-controlled infrastructure using Infrastructure as Code (Pulumi).

  • Fuel Product Velocity: Partner closely with ML and application engineers to build the APIs and data models they need to launch new, AI-powered features.

  • Raise the bar: Define best practices for data quality, observability, and reliability across the entire platform.

What We’re Looking For

  • Experience: 3+ years of experience in data engineering, platform engineering, or a related backend role.

  • Strong Programming Skills: Deep proficiency in Python and a strong command of SQL.

  • Cloud Infrastructure: Hands-on experience building and managing infrastructure on AWS (e.g., RDS, ECS, S3).

  • Builder Mindset: You are excited about the opportunity to build data systems from the ground up and make foundational architectural decisions.

  • Data Modeling: A strong understanding of how to model data for both analytical and operational use cases.

  • Ownership: You are driven to take full responsibility for the reliability and performance of the systems you build.

Ideal Qualifications

  • Direct experience with workflow orchestration tools like Prefect or Airflow.

  • Experience with Infrastructure as Code tools like Pulumi or Terraform.

  • Familiarity with vector databases and search technologies (pgvector, Pinecone, etc.).

  • Experience in a fast-paced, early-stage startup environment.

  • Knowledge of serverless compute platforms like Modal or AWS Lambda.

Ideal Qualifications

  • Direct experience with workflow orchestration tools like Prefect or Airflow.

  • Experience with Infrastructure as Code tools like Pulumi or Terraform.

  • Familiarity with vector databases and search technologies (pgvector, Pinecone, etc.).

  • Experience in a fast-paced, early-stage startup environment.

  • Knowledge of serverless compute platforms like Modal or AWS Lambda.

Why Work at Crosby

  • Foundational Impact: As a founding engineer, you will shape the data platform, culture, and technical direction of the entire company.

  • Competitive salary and equity compensation.

  • Comprehensive health, dental, and vision insurance.

  • Unlimited PTO.

  • In-person team in NYC with a collaborative, high-energy environment.

Apply now to join Crosby and be part of transforming the legal landscape.

Average salary estimate

$185000 / YEARLY (est.)
min
max
$150000K
$220000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Posted 16 hours ago

Sigma Squared is hiring a Cambridge-based Data Engineer to design and maintain scalable ETL pipelines and data infrastructure powering analytics and ML for enterprise customers.

Moneta Hybrid 1 N Brentwood Blvd, Clayton, MO 63105, USA
Posted 20 hours ago

Moneta seeks a hands-on Data Engineer to build and maintain scalable Microsoft-centric data pipelines and support analytics across Snowflake, SQL Server, and Microsoft Fabric.

Photo of the Rise User
Posted 24 hours ago

Dechert LLP is hiring a Business Intelligence Developer to design and maintain ETL processes, optimize SQL Server data workflows, and support reporting using SSIS, SSRS, SSAS, Python and Tableau.

Photo of the Rise User
GitLab Hybrid Remote, North America
Posted 15 hours ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Customer-Centric
Social Impact Driven
Dare to be Different
Maternity Leave
Paternity Leave
401K Matching
Paid Holidays
Paid Time-Off
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)

Lead and scale a revenue analytics function at GitLab to deliver forecasting, churn/expansion insights, and services analytics that inform executive decisions and accelerate growth.

Haus Analytics Hybrid No location specified
Posted 5 hours ago

Lead Haus’s analytics strategy and team to deliver GTM and product insights, build robust data pipelines and metric layers, and embed data-driven decision-making across the company.

Edwards Hybrid USA - California - Irvine - Hybrid
Posted 18 hours ago

Edwards Lifesciences is seeking a Clinical Data Management Analyst to specify EDC requirements, maintain data integrity, and support clinical studies for the TMTT portfolio in a hybrid Irvine-based role.

Photo of the Rise User
Posted 5 hours ago

Senior Collibra Data Quality SME needed to design, implement, and administer enterprise Collibra DQ solutions with strong Java/API skills for a hybrid onsite/remote contract in New York.

Photo of the Rise User

Visa is seeking a Senior Business Intelligence Engineer to lead data engineering, modeling, and AI-enabled analytics for the Visa Marketing 360 platform.

Photo of the Rise User
Posted 12 hours ago

A regulated-industry partner is hiring a Cloud Data Engineer to architect, automate, and maintain AWS, Azure, and private cloud solutions that support finance, clinical, and operational business needs.

Silgan Containers Hybrid Chesterfield, Missouri
Posted 14 hours ago

Silgan is hiring a Senior Data Engineer to build scalable ETL pipelines, optimize complex SQL across multiple platforms, and deliver actionable reporting using Power BI and Tableau.

Photo of the Rise User
Jobgether Hybrid No location specified
Posted 15 hours ago

Experienced Data Engineer needed to build and operate secure, production-grade ETL/ELT pipelines and migrate legacy data systems to modern, scalable architectures in a remote US role.

Photo of the Rise User
Posted 42 minutes ago

Conagra Brands is hiring a Senior Analytics Engineer to modernize finance analytics and deliver scalable data pipelines, models, and BI solutions that empower finance decision-making.

Sia Hybrid 48 Wall St, New York, NY 10005, USA
Posted 14 hours ago

Lead analytics initiatives for Healthcare and Life Sciences clients at Sia, driving ETL, analyses, dashboards and predictive modeling to turn data into actionable business insights.

MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
October 23, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!