Modal is building the serverless compute platform to support the next generation of AI companies. In order to deliver the developer experience we wanted, we went deep and built our own infrastructure—including our own custom file system , container runtime, scheduler, container image builder, and much more.
We're a small team based out of New York, Stockholm and San Francisco. In just one year, we've reached 8-figure revenue, tripled our headcount, scaled to support thousands of GPUs, and raised over $32M in funding.
Working at Modal means joining one of the fastest-growing AI infrastructure organizations at an early stage, with many opportunities to grow within the company. Our team includes creators of popular open-source projects (e.g. Seaborn, Luigi), academic researchers, international olympiad medalists, and experienced engineering and product leaders with decades of experience.
At Modal, we dynamically scale workloads across many cloud providers (see Linear Programming for Fun and Profit). We are looking for engineers with backgrounds spanning cloud infra, data engineering, and mathematical optimization to push this system to the next level.
This role is for people who are deep systems thinkers and love optimizing things. You will be responsible for the system end-to-end, including things like:
Negotiating with cloud vendors and influencing our cloud strategy.
Modeling GPU and resource costs.
Coming up with, backtesting, and rolling out new optimizations.
Designing the next iteration of our workload scheduling system.
Pricing new product offerings and GPU types.
5+ years of experience writing high-quality production code.
Strong cloud skills, and deep familiarity with at least one of AWS, GCP, Azure, Oracle Cloud.
Experience with data science, visualization, and statistics.
A love for optimizing stuff, especially the bottom line.
Familiarity with linear programming solvers and related optimization techniques is a plus.
Ability to work in-person in our NYC or Stockholm office.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
An early-career AI Engineer role supporting integration of machine learning into product design, testing, and manufacturing processes for an industrial equipment engineering team.
PermitFlow is hiring an Engineering Manager in San Francisco to lead and scale a high-performing engineering team building AI-enabled permitting software on a hybrid schedule.
Join Voltai as a Product Engineer to build scalable, high-quality web experiences and generative UI systems that advance semiconductor and electronics design workflows.
Experienced Full Stack Engineer (Golang + Angular) needed to develop secure, high-performance healthcare software in a remote-first environment.
Safran Passenger Innovations seeks a Software Engineer, Test Automation to develop and maintain automated test frameworks and validate IFEC software to ensure high reliability and product quality.
Build and scale production-quality React/TypeScript UIs at August to power agentic AI workflows that augment legal work for mid-market law firms.
WHOOP is hiring a Software Engineer II on the Business Systems team to build integrations, APIs, and internal tools that power critical business operations from the Boston office.
Parafin is hiring an early-career Applied AI Engineer to build production GenAI systems that drive underwriting, risk, and merchant integration improvements for small-business finance products.
Experienced BI-focused Software Engineer needed to develop Power BI dashboards, optimize data pipelines, and apply predictive analytics to support USACE reporting and decision-making.
Become a core engineer at Dialogue AI to build full-stack systems and AI integrations that accelerate customer research and define the product from day one.
Lead the design and production deployment of AI-first tooling to empower engineering teams and speed electric aviation development at BETA Technologies.
Experienced systems engineer needed to design and implement advanced auto-scaling and performance optimizations for a cloud-native distributed database platform supporting global enterprise customers.
Work alongside experienced engineers at a mission-driven startup to build and test real-time telemetry features as a Software Engineering Intern in El Segundo.
At Modal, we build the future of auto commerce for the world’s largest auto brands and retailers. We take the moving parts of an auto purchase transaction and assemble them into a simple, digital transaction flow that seamlessly fits any webpage a...
4 jobs