We're a team of technologists and legal experts collaborating closely to reimagine corporate legal services from the ground up. We build proprietary technology and human-in-the-loop workflows to radically enhance the lawyer-machine relationship. Our mission is to review complex documents faster than ever and with perfect quality.
Crosby was founded by Ryan (Penn, Stanford Law, ex-Cooley, former GC) and John (Penn M&T, ex-Ramp, ex-Google).
We believe:
A great legal system is the watermark of a great society.
Legal work is art and science. We are discovering the frontier between the two— codifying what can be systematized, and amplifying human expertise where judgment matters most.
The right way to deploy AI in law isn’t by selling software. It’s by selling outcomes — higher-quality legal work, delivered faster and more efficiently
In an in-person culture at our NYC office.
As a Founding Data Platform Engineer, you will build and own the technical backbone that powers Crosby's entire AI-driven platform. Your mission is to design, build, and scale the data infrastructure responsible for ingesting, processing, and storing vast amounts of complex legal documents. You will create the reliable, high-performance systems that our machine learning models and application engineers depend on to transform the legal industry. This is a foundational role with a massive impact on our ability to scale.
Build the core pipeline: Design and build scalable data ingestion and processing pipelines for unstructured documents (PDFs, DOCX) using tools like Python and Prefect.
Own the storage layer: Manage our core data storage layer, including PostgreSQL with pgvector for hybrid search and Redis for high-speed caching.
Power AI Workloads: Develop and maintain our serverless compute and ML workloads using infrastructure like Modal and AWS.
Codify Infrastructure: Implement and manage robust, version-controlled infrastructure using Infrastructure as Code (Pulumi).
Fuel Product Velocity: Partner closely with ML and application engineers to build the APIs and data models they need to launch new, AI-powered features.
Raise the bar: Define best practices for data quality, observability, and reliability across the entire platform.
Experience: 3+ years of experience in data engineering, platform engineering, or a related backend role.
Strong Programming Skills: Deep proficiency in Python and a strong command of SQL.
Cloud Infrastructure: Hands-on experience building and managing infrastructure on AWS (e.g., RDS, ECS, S3).
Builder Mindset: You are excited about the opportunity to build data systems from the ground up and make foundational architectural decisions.
Data Modeling: A strong understanding of how to model data for both analytical and operational use cases.
Ownership: You are driven to take full responsibility for the reliability and performance of the systems you build.
Direct experience with workflow orchestration tools like Prefect or Airflow.
Experience with Infrastructure as Code tools like Pulumi or Terraform.
Familiarity with vector databases and search technologies (pgvector, Pinecone, etc.).
Experience in a fast-paced, early-stage startup environment.
Knowledge of serverless compute platforms like Modal or AWS Lambda.
Direct experience with workflow orchestration tools like Prefect or Airflow.
Experience with Infrastructure as Code tools like Pulumi or Terraform.
Familiarity with vector databases and search technologies (pgvector, Pinecone, etc.).
Experience in a fast-paced, early-stage startup environment.
Knowledge of serverless compute platforms like Modal or AWS Lambda.
Foundational Impact: As a founding engineer, you will shape the data platform, culture, and technical direction of the entire company.
Competitive salary and equity compensation.
Comprehensive health, dental, and vision insurance.
Unlimited PTO.
In-person team in NYC with a collaborative, high-energy environment.
Apply now to join Crosby and be part of transforming the legal landscape.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Sigma Squared is hiring a Cambridge-based Data Engineer to design and maintain scalable ETL pipelines and data infrastructure powering analytics and ML for enterprise customers.
Moneta seeks a hands-on Data Engineer to build and maintain scalable Microsoft-centric data pipelines and support analytics across Snowflake, SQL Server, and Microsoft Fabric.
Dechert LLP is hiring a Business Intelligence Developer to design and maintain ETL processes, optimize SQL Server data workflows, and support reporting using SSIS, SSRS, SSAS, Python and Tableau.
Lead and scale a revenue analytics function at GitLab to deliver forecasting, churn/expansion insights, and services analytics that inform executive decisions and accelerate growth.
Lead Haus’s analytics strategy and team to deliver GTM and product insights, build robust data pipelines and metric layers, and embed data-driven decision-making across the company.
Edwards Lifesciences is seeking a Clinical Data Management Analyst to specify EDC requirements, maintain data integrity, and support clinical studies for the TMTT portfolio in a hybrid Irvine-based role.
Senior Collibra Data Quality SME needed to design, implement, and administer enterprise Collibra DQ solutions with strong Java/API skills for a hybrid onsite/remote contract in New York.
Visa is seeking a Senior Business Intelligence Engineer to lead data engineering, modeling, and AI-enabled analytics for the Visa Marketing 360 platform.
A regulated-industry partner is hiring a Cloud Data Engineer to architect, automate, and maintain AWS, Azure, and private cloud solutions that support finance, clinical, and operational business needs.
Silgan is hiring a Senior Data Engineer to build scalable ETL pipelines, optimize complex SQL across multiple platforms, and deliver actionable reporting using Power BI and Tableau.
Experienced Data Engineer needed to build and operate secure, production-grade ETL/ELT pipelines and migrate legacy data systems to modern, scalable architectures in a remote US role.
Conagra Brands is hiring a Senior Analytics Engineer to modernize finance analytics and deliver scalable data pipelines, models, and BI solutions that empower finance decision-making.
Lead analytics initiatives for Healthcare and Life Sciences clients at Sia, driving ETL, analyses, dashboards and predictive modeling to turn data into actionable business insights.