Relace is building the models and infrastructure that code agents reach for. We power the fastest model on OpenRouter (10,000 tok/s) and deliver optimized small language models designed for retrieval, application, and core code generation functions.
Our technology supports some of the world’s fastest-moving companies — including Lovable, Figma, and Vercel — as they deploy and scale code generation to hundreds of millions of users. We recently raised our Series A from a16z, and we’re growing quickly.
Our team is made up of mathematicians, physicists, and computer scientists who are deeply passionate about their craft. If you thrive on ambitious technical problems, care about elegant systems design, and want to build the foundation of how code gets written at scale, this is the place for you.
We’re looking for a Machine Learning Engineer who loves getting close to the metal. This is a hands-on engineering role focused on making models faster, more efficient, and more reliable through low-level optimizations and smart systems design.
The ideal candidate is excited by CUDA kernels, memory layouts, GPU scheduling, and squeezing performance out of complex training and inference workloads. They should be just as comfortable optimizing compute and networking paths as they are working alongside research teams to productionize new architectures.
This is a role for someone who enjoys deep performance tuning, understands the realities of running large-scale ML systems, and thrives in fast-moving, high-leverage environments.
Strong background in systems-level ML engineering.
Experience with CUDA, GPU kernel optimization, and performance tuning.
Fluency in Python and at least one systems language (C++ or Rust preferred).
Familiarity with distributed training frameworks (e.g., PyTorch, JAX, DeepSpeed, or similar).
Experience working with large-scale training or inference infrastructure.
Understanding of memory management, parallelization, and hardware-aware model optimization.
2+ years of experience working in ML infrastructure or performance-critical environments.
Willingness to work in-person from our SF office in FiDi.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Work as a hands-on engineering intern building GPU-optimized AI infrastructure and inference systems with a San Francisco-based team.
Join Relace as a Machine Learning Scientist to push the limits of small, high-performance language models used for retrieval and code generation at scale.
Palo Alto Networks seeks a Principal Engineer to lead design and implementation of dataplane and L2-L4 network security features for its next-generation firewalls.
Lead development of low-latency, mission-critical software for infrared imaging and ladar systems at Anduril, using C++ and Python to deliver production-ready embedded solutions.
Experienced .NET/React engineer needed to develop and maintain enterprise applications at Milwaukee Tool, contributing to architecture, CI/CD, and cloud-based solutions.
Experienced Go backend engineer needed to build and own the backend for an AI-driven observability platform focused on high-throughput customer support in travel and transport.
Oumi seeks a Platform Engineer to design and implement scalable backend systems and infrastructure for an open, research-driven AI platform.
Temporal is hiring a Senior Software Engineer on the Release Engineering team to design, build, and operate fully automated release and deployment pipelines using Temporal and modern cloud-native tooling.
Drive the technical vision and hands-on delivery of a global payments platform as a Principal Backend Engineer, building scalable, secure systems that move money for millions of businesses.
Disney's Product Engineering team is hiring Fullstack Software Engineering Interns to build and ship scalable frontend and backend features for Disney+, Hulu, and ESPN.
Lead end-to-end feature development for an AI/ML-driven narrative intelligence platform as a Senior Full-Stack Engineer, owning UI, APIs, search, and production services.
dLocal is hiring a Senior DevOps Engineer to build self-service platforms, automate CI/CD and infrastructure, and lead cloud-native, secure deployments in a remote-first fintech environment.
Work as a ReactJS UI Developer crafting modular, high-performance user interfaces for a remote US-based product team.
Agile Six is hiring a frontend-focused Fullstack Engineer to build accessible, performant web and mobile features that improve Veterans' access to healthcare and benefits.
Software Engineer II to design, build, and operate reliable, scalable crypto payment backend systems using Java/Spring and cloud-native technologies at Ripple.
SpringRole is the first professional reputation network powered by artificial intelligence and blockchain to eliminate fraud from user profiles. Because SpringRole is built on blockchain and uses smart contracts, it's able to verify work experienc...
595 jobs