Job details

Data Engineer

Role Overview

We are looking for a high-caliber Data Engineer who can architect and scale the data systems that power our AI workflows. You’ll be responsible for building reliable data pipelines, integrating external APIs, maintaining clean and structured data models, and enabling the product and ML teams to iterate quickly.

You should thrive in ambiguous environments, enjoy wearing multiple hats, and be comfortable designing end-to-end data solutions with minimal direction.

What You’ll Own

Design, build, and maintain scalable data pipelines that process and transform large volumes of structured and unstructured data.
Manage ingestion from third-party APIs, internal systems, and customer datasets.
Develop and maintain data models, data schemas, and storage systems optimized for ML and product performance.
Collaborate with ML engineers to prepare model-ready datasets, embeddings, feature stores, and evaluation data.
Implement data quality monitoring, validation, and observability.
Work closely with product engineers to support new features that rely on complex data flows.
Optimize systems for performance, cost, and reliability.
Contribute to early architecture decisions, infrastructure design, and best practices for data governance.
Build tooling that enables the entire team to access clean, well-structured data.

Who You Are

Builder Mentality

You’re a hands-on engineer who thrives in a fast-paced environment, enjoys autonomy, and takes ownership of problems from start to finish.

Strong Communication

You translate technical complexity into clarity. You work well with ML, product, and GTM partners.

Practical, Not Academic

You can design elegant systems but default to shipping solutions that work and can be iterated on.

Detail-Oriented & Reliable

You care about clean pipelines, reproducibility, and data correctness.

What You Bring

3+ years of experience as a Data Engineer, ML Engineer, Backend Engineer, or similar.
Proficiency in Python, SQL, and modern data tooling (dbt, Airflow, Dagster, or similar).
Experience designing and operating ETL/ELT pipelines in production.
Experience with cloud platforms (AWS, GCP, or Azure).
Familiarity with data lakes, warehouses, and vector databases.
Experience integrating APIs and working with semi-structured data (JSON, logs, event streams).
Strong understanding of data modeling and optimization.
Bonus: experience supporting LLMs, embeddings, or ML training pipelines.
Bonus: startup experience or comfort working in fast, ambiguous environments.

What Success Looks Like

Stable, documented, testable pipelines powering ML and product features.
High-quality data consistently available for analytics, modeling, and core product workflows.
Faster iteration cycles for the Engineering and ML teams due to improved tooling.
Clear visibility into data quality and reliability.
Strong cross-functional collaboration and communication.

Why Artisan

Build core systems at the heart of a fast-growing AI company.
High autonomy, high impact, zero bureaucracy.
Work with a talented, ambitious team solving meaningful problems.
Shape the data platform from the ground up.

Data Engineer Python SQL ETL ELT dbt Airflow Dagster AWS GCP VectorDB Embeddings Feature store Data modeling APIs ML pipelines

Average salary estimate

$155000 / YEARLY (est.)

min

max

$130000K

$180000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Senior Data Engineer – Real-Time Data & Visualization Focus

Highmark Health Hybrid PA, Working at Home - Pennsylvania

VIEW

Posted 21 hours ago

Senior Data Engineer needed to lead real-time data ingestion, transformation, and visualization across cloud and streaming platforms for Highmark/enGen (U.S. citizens only).

Principal Data Platform Architect

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 23 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA seeks a Principal Data Platform Architect to lead architecture and delivery of large-scale observability and data platforms for AI and HPC clusters.

Data Engineer

Promise Hybrid Oakland

VIEW

Posted 22 hours ago

Promise is hiring a Data Engineer to build and scale reliable data pipelines, warehouses, and data products that enable data-driven decisions across product, operations, and finance.

CRE Senior Business Intelligence Manager

Flagstar Bank Hybrid Hicksville/102 Duffy Avenue/3797

VIEW

Posted 22 hours ago

Lead the Commercial Real Estate BI function to design scalable analytics, reporting, and visualizations that drive strategic portfolio and executive decisions at Flagstar Bank.

Senior Software Engineer (Data)

Awesome Motive Hybrid New York City

VIEW

Posted 22 hours ago

Amigo is hiring a Senior Software Engineer (Data) to design and operate Databricks streaming and batch pipelines that enable analytics, research, and continuous improvement of clinical AI agents.

Digital Initiatives Librarian

UC Irvine Hybrid Irvine

VIEW

Posted 3 hours ago

UC Irvine Libraries is hiring a Digital Initiatives Librarian to manage and grow digital collections and infrastructure while supporting digital scholarship, rights review, and campus outreach.

Sr Associate Data Engineer (ETL / Databricks)

McKesson Hybrid USA, OH, Columbus

VIEW

Posted 23 hours ago

Senior Associate Data Engineer to develop and maintain ETL/Databricks pipelines and integrations that power Rx Savings Solutions' analytics and operational reporting.

Senior Director - Risk Insights Architect

Lilly Hybrid US, Indianapolis IN

VIEW

Posted 16 hours ago

Lead enterprise risk analytics strategy and architecture at Lilly to turn cross-functional risk data into executive insights that drive strategic decisions and measurable business outcomes.

Director, Enterprise Data Management

ffive Hybrid San Jose

VIEW

Posted 22 hours ago

F5 is hiring a seasoned Director of Enterprise Data Management to build enterprise-scale data governance, security, and AI oversight that enables responsible, high-impact analytics.

Director of Data Strategy & Insights

Artsy Hybrid New York

VIEW

Posted 14 hours ago

Artsy is hiring an experienced data leader to define and operationalize a company-wide data strategy, translating analytics into product and revenue growth for its marketplace.

Python, NodeJS, Java Senior Data Engineer I

Elsevier Hybrid Remote

VIEW

Posted 4 hours ago

Senior Data Engineer needed to design and maintain complex ETL and backend systems (Python, NodeJS, Java) for Elsevier's clinical AI and healthcare data products.

Founding Engineer (Data/AI) - Flyway Health

Pear VC Hybrid New York City

VIEW

Posted 22 hours ago

Join Flyway Health as a founding Data/AI engineer to build and scale AI-driven data pipelines and agent systems that power enterprise life-sciences insights.

Data Engineer

Junction Hybrid No location specified

VIEW

Posted 22 hours ago

Design, build, and operate scalable GCP data pipelines and analytics schemas at a fast-growing diagnostics startup powering customer products, analytics, and AI models.

Artisan

Artisan Components, Inc. is a leading provider of physical intellectual property (IP) components for the design and manufacture of complex system-on-a-chip integrated circuits. Artisan's products include embedded memory, standard cell, input/outpu...

1 jobs

MATCH

Calculating your matching score...

FUNDING

Series A

DEPARTMENTS

Data

SENIORITY LEVEL REQUIREMENT

Mid-Level

INDUSTRY

Materials

TEAM SIZE