Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Backend Software Engineer (ML Infra) image - Rise Careers
Job details

Backend Software Engineer (ML Infra)

Rockstar is recruiting for a mobile-first digital product studio that turns ideas into extraordinary experiences. They are a team of dynamic and savvy professionals who know how to create killer digital products. Our lean structure and remote team mean we can move fast while still delivering top-notch technology and design.

Our client is building the AI backbone for the next generation of intelligent products. They help fast-growing AI startups design, fine-tune, evaluate, deploy, and maintain specialized models across text, vision, and embeddings.

Think of them as “AWS for AI models”—not data or raw compute, but a full-stack backend for fine-tuning, reinforcement learning, inference, and long-term model maintenance.

Their customers are Series A–C AI companies building enterprise-grade products. Their promise is simple: they make your AI system better.

They are hiring a Backend Software Engineer (ML Infrastructure) to help design, build, and scale the core systems that power large-scale model training and deployment.

The candidate will work on distributed training pipelines, cloud-native infrastructure, and internal developer platforms that support fine-tuning, reinforcement learning, and inference at scale. This role sits at the intersection of backend engineering and ML systems—the candidate will collaborate closely with ML engineers while owning production-grade infrastructure.

This is an ideal role for an early-career engineer who wants to work on real distributed systems, GPU workloads, and modern ML infrastructure—not dashboards or CRUD apps.

What You’ll Do

Build & Scale Core Infrastructure

- Design and implement backend systems that support large-scale ML workloads, including fine-tuning and reinforcement learning.

- Build distributed training and inference pipelines that are efficient, fault-tolerant, and observable.

- Develop internal developer tools and platforms that make it easier for ML engineers to train, evaluate, and deploy models.

Cloud & Systems Engineering

- Work on cloud-native systems using containers and orchestration (e.g., Kubernetes).

- Optimize systems for performance, reliability, and cost efficiency, especially for GPU-heavy workloads.

- Implement monitoring, logging, and observability for long-running training jobs and production services.

Collaborate with ML Engineers

- Partner closely with ML engineers to support evolving model architectures, training workflows, and evaluation needs.

- Translate ML requirements into scalable backend and infrastructure solutions.

Who You Are

Required

- 1–3 years of backend engineering experience, ideally working on production systems.

- Strong fundamentals in distributed systems, networking, and backend architecture.

- Experience building systems that scale under real load.

- Comfortable working in Python and/or Go (or similar backend languages).

- Excited to work on-site in San Francisco with a fast-moving early-stage team.

Strongly Preferred

- Experience with or exposure to ML infrastructure or ML platforms.

- Familiarity with GPU workloads, training pipelines, or inference systems.

- Experience with containerization and orchestration (Docker, Kubernetes).

- Contributions to or deep familiarity with ML infrastructure libraries such as:

  - Ray

  - vLLM

  - SGLang

  - or similar distributed ML systems

Bonus

- Computer science background from a top-tier program or equivalent demonstrated excellence.

- Open-source contributions, research projects, or side projects in systems or ML infrastructure.

- A track record of high ownership and technical curiosity.

Average salary estimate

$160000 / YEARLY (est.)
min
max
$140000K
$180000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Posted 19 hours ago

CDW is hiring a Senior Software Engineer I to build and maintain large-scale business applications using .NET, APEX/Visualforce, SQL Server and modern web and cloud technologies.

Photo of the Rise User
Posted 16 hours ago

Work closely with a small, technical team in San Mateo to build core AI technology and dynamic graphical user interfaces that redefine how people interact with AI.

Photo of the Rise User
Posted 16 hours ago

WorkWhile seeks a product-oriented Backend Software Engineer to design and build scalable Python services that improve reliability and performance for our workforce platform.

Photo of the Rise User

BetterUp is hiring a Senior Site Reliability Engineer to scale and automate AWS/Kubernetes infrastructure while integrating AI-powered observability and incident response.

Photo of the Rise User
Zone IT Solutions Hybrid No location specified
Posted 18 hours ago

Zone IT Solutions is hiring a Blue Prism Developer to build and maintain enterprise RPA solutions in California City.

Photo of the Rise User
Posted 6 hours ago

Relativity Space seeks an early-career Embedded Software Engineer to develop and test high-reliability firmware for Terran R avionics in Long Beach, CA.

Photo of the Rise User

Virtue AI seeks an Inference Engineer to design and operate high-performance, production-ready inference systems for LLMs and embeddings in San Francisco.

Photo of the Rise User
Posted 18 hours ago

Software Engineer (Data Platform) role focused on building and optimizing high-performance, large-scale data systems in Santa Clara that support millions of users.

Photo of the Rise User
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Lead the architecture and delivery of large-scale AIOps and observability platforms at NVIDIA to monitor, diagnose, and optimize millions of assets across cloud and on-prem environments.

Photo of the Rise User

Sandisk is hiring a Staff Platform Engineer to lead the design and implementation of Python-based developer tooling and platform solutions that scale CI/CD, security automation, and developer productivity.

Photo of the Rise User
AbbVie Hybrid Salt Lake City, UT
Posted 9 hours ago

Allergan Aesthetics (AbbVie) is hiring a Staff Software Engineer to lead architecture, mentor engineers, and build scalable consumer-facing web and mobile services for the Allē platform.

Photo of the Rise User
Posted 14 hours ago

Lead the design and delivery of full-stack and mobile tools at Doxel to turn large-scale video, image, 3D and IoT data into fast, reliable workflows used by major contractors and owners.

Photo of the Rise User
StubHub Hybrid New York, New York, United States
Posted 19 hours ago

Lead the engineering efforts to scale and enhance StubHub’s marketplace systems, delivering reliable, high-performance features that connect fans with live events worldwide.

Rockstar games was founded in 1998. This company manufactures complex living world games such as grand theft auto. Their headquarters are located in New York City, New York.

3 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
December 22, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!