We are looking for a high-caliber Data Engineer who can architect and scale the data systems that power our AI workflows. You’ll be responsible for building reliable data pipelines, integrating external APIs, maintaining clean and structured data models, and enabling the product and ML teams to iterate quickly.
You should thrive in ambiguous environments, enjoy wearing multiple hats, and be comfortable designing end-to-end data solutions with minimal direction.
Design, build, and maintain scalable data pipelines that process and transform large volumes of structured and unstructured data.
Manage ingestion from third-party APIs, internal systems, and customer datasets.
Develop and maintain data models, data schemas, and storage systems optimized for ML and product performance.
Collaborate with ML engineers to prepare model-ready datasets, embeddings, feature stores, and evaluation data.
Implement data quality monitoring, validation, and observability.
Work closely with product engineers to support new features that rely on complex data flows.
Optimize systems for performance, cost, and reliability.
Contribute to early architecture decisions, infrastructure design, and best practices for data governance.
Build tooling that enables the entire team to access clean, well-structured data.
You’re a hands-on engineer who thrives in a fast-paced environment, enjoys autonomy, and takes ownership of problems from start to finish.
You translate technical complexity into clarity. You work well with ML, product, and GTM partners.
You can design elegant systems but default to shipping solutions that work and can be iterated on.
You care about clean pipelines, reproducibility, and data correctness.
3+ years of experience as a Data Engineer, ML Engineer, Backend Engineer, or similar.
Proficiency in Python, SQL, and modern data tooling (dbt, Airflow, Dagster, or similar).
Experience designing and operating ETL/ELT pipelines in production.
Experience with cloud platforms (AWS, GCP, or Azure).
Familiarity with data lakes, warehouses, and vector databases.
Experience integrating APIs and working with semi-structured data (JSON, logs, event streams).
Strong understanding of data modeling and optimization.
Bonus: experience supporting LLMs, embeddings, or ML training pipelines.
Bonus: startup experience or comfort working in fast, ambiguous environments.
Stable, documented, testable pipelines powering ML and product features.
High-quality data consistently available for analytics, modeling, and core product workflows.
Faster iteration cycles for the Engineering and ML teams due to improved tooling.
Clear visibility into data quality and reliability.
Strong cross-functional collaboration and communication.
Build core systems at the heart of a fast-growing AI company.
High autonomy, high impact, zero bureaucracy.
Work with a talented, ambitious team solving meaningful problems.
Shape the data platform from the ground up.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Senior Data Engineer needed to lead real-time data ingestion, transformation, and visualization across cloud and streaming platforms for Highmark/enGen (U.S. citizens only).
NVIDIA seeks a Principal Data Platform Architect to lead architecture and delivery of large-scale observability and data platforms for AI and HPC clusters.
Promise is hiring a Data Engineer to build and scale reliable data pipelines, warehouses, and data products that enable data-driven decisions across product, operations, and finance.
Lead the Commercial Real Estate BI function to design scalable analytics, reporting, and visualizations that drive strategic portfolio and executive decisions at Flagstar Bank.
Amigo is hiring a Senior Software Engineer (Data) to design and operate Databricks streaming and batch pipelines that enable analytics, research, and continuous improvement of clinical AI agents.
UC Irvine Libraries is hiring a Digital Initiatives Librarian to manage and grow digital collections and infrastructure while supporting digital scholarship, rights review, and campus outreach.
Senior Associate Data Engineer to develop and maintain ETL/Databricks pipelines and integrations that power Rx Savings Solutions' analytics and operational reporting.
Lead enterprise risk analytics strategy and architecture at Lilly to turn cross-functional risk data into executive insights that drive strategic decisions and measurable business outcomes.
F5 is hiring a seasoned Director of Enterprise Data Management to build enterprise-scale data governance, security, and AI oversight that enables responsible, high-impact analytics.
Artsy is hiring an experienced data leader to define and operationalize a company-wide data strategy, translating analytics into product and revenue growth for its marketplace.
Senior Data Engineer needed to design and maintain complex ETL and backend systems (Python, NodeJS, Java) for Elsevier's clinical AI and healthcare data products.
Join Flyway Health as a founding Data/AI engineer to build and scale AI-driven data pipelines and agent systems that power enterprise life-sciences insights.
Design, build, and operate scalable GCP data pipelines and analytics schemas at a fast-growing diagnostics startup powering customer products, analytics, and AI models.
Artisan Components, Inc. is a leading provider of physical intellectual property (IP) components for the design and manufacture of complex system-on-a-chip integrated circuits. Artisan's products include embedded memory, standard cell, input/outpu...
1 jobs