Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Senior/Staff Backend Engineer - Distributed System image - Rise Careers
Job details

Senior/Staff Backend Engineer - Distributed System

About Us

At Zettabyte, we’re on a mission to make AI compute ubiquitous, seamless, and limitless. We’re building a cloud where AI just works—anywhere, anytime. “AI Power. Everywhere.” Be part of the team designing the infrastructure for the AI-first world.

Why this role exists

We need a Backend Engineer to build the systems that orchestrate GPU clusters for AI workloads. You'll create APIs that handle GPU allocation, memory management, compute scheduling, and multi-tenant isolation—challenges unique to AI infrastructure that go far beyond typical backend engineering. As part of our backend team, you'll solve problems like: How do we efficiently share expensive GPU resources across users? How do we handle GPU memory constraints for large AI models? How do we ensure quality of service when workloads compete for compute? This is an opportunity to build infrastructure where every API call could allocate thousands of dollars worth of compute per hour, where your optimizations directly impact whether AI startups can afford to train their models.

What you’ll do

  • Design APIs that abstract complex GPU operations into simple developer experiences

  • Build scheduling algorithms that maximize GPU utilization while ensuring SLA compliance

  • Develop resource management systems for GPU lifecycle—provisioning, allocation, scheduling, and release

  • Create usage tracking and billing systems for GPU-hours, memory usage, and compute utilization

  • Implement monitoring for GPU-specific metrics, health checks, and automatic failure recovery

  • Build multi-tenancy systems with resource isolation, quota management, and fair scheduling

  • Optimize cold starts for model serving and implement efficient model loading strategies

  • Collaborate with frontend engineers to expose complex infrastructure through intuitive interfaces

  • Leverage AI-assisted coding tools (GitHub Copilot, Claude Code, Cursor IDE, etc.) to boost productivity and code quality.

You’ll thrive here if you

  • 5+ years backend engineering experience with distributed systems

  • Strong proficiency in Go, Python, or similar backend languages

  • Experience with resource scheduling, orchestration, and API design (REST, GraphQL, gRPC)

  • Understanding of hardware constraints and system optimization

  • Linux systems knowledge and containerization experience (Docker, Kubernetes)

  • Comfortable working with expensive resources where efficiency directly impacts costs

  • Excited about solving novel problems in AI infrastructure (not just another CRUD app)

  • Startup mindset—comfortable with ambiguity and rapid iteration

Bonus qualifications

  • GPU or HPC cluster management experience

  • Understanding of ML/AI workload patterns and requirements

  • Experience with high-value resource allocation systems

  • Background in performance optimization for compute-intensive workloads

  • Familiarity with GPU virtualization and sharing technologies

  • Experience building billing or metering systems

Details

  • We provide competitive salary and meaningful equity, based on your experience and skillset.

  • This is a Hybrid role - 3 days in office, 2 days WFH; Must locate in Palo Alto and be able to commute to the local office.

  • Please note that this position is open to U.S. citizens and permanent residents only, visa sponsorship is not available.

Average salary estimate

$215000 / YEARLY (est.)
min
max
$170000K
$260000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User

Lead the design-to-code pipeline as a Frontend Engineer focused on UI/UX, creating Figma prototypes and implementing them in Vue.js for a product team working with GPU infrastructure.

Photo of the Rise User
Posted 17 hours ago

Lead the architecture and development of anti-tamper and product security software for Boeing's MS&B programs in Oklahoma City, delivering secure embedded solutions for defense platforms.

Sobek AI is hiring a Senior Frontend Engineer to design and ship secure, data-dense React/TypeScript UIs and design systems for enterprise life-sciences and emergency-response applications.

Photo of the Rise User
Posted 17 hours ago

Lead the architecture and implementation of a high-volume, secure payments platform as a Principal Backend Engineer, driving technical strategy and mentoring senior engineers across the organization.

Posted 5 hours ago

Finvari is hiring a Staff React-Native Engineer to build and improve our cross-platform mobile app and contribute across our React web front-end and serverless backend for customers in construction finance.

Photo of the Rise User
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Lead the design and implementation of scalable backend and AI infrastructure at NVIDIA to enable internal AI tools, agent orchestration, and production ML deployments.

Photo of the Rise User
Posted 12 hours ago

Jump, an AI platform for financial advisors backed by Series A funding, seeks a remote software engineer contractor (4+ years experience) to build well-tested, production-ready features in a fast-growing team.

Photo of the Rise User
Devsu Hybrid No location specified
Posted 18 hours ago

Senior Microsoft AI Developer role to lead design and delivery of large-scale Generative AI solutions using Azure, Microsoft 365 toolkits, and modern microservices architectures.

Photo of the Rise User
Posted 1 hour ago

Decagon is hiring an AI Implementation Engineer to build and integrate conversational AI agents, resolve production-level technical issues for enterprise customers, and drive adoption of the platform.

Posted 5 hours ago

TENEX seeks a senior Staff Software Engineer to lead architecture and hands-on development of its AI-driven MDR platform with remote flexibility for exceptional candidates.

Photo of the Rise User
Posted 23 hours ago

Pattern is hiring a Staff MLOps Engineer to lead the design, deployment, and operationalization of scalable ML and LLM pipelines that power ecommerce AI across global marketplaces.

Photo of the Rise User
PostHog Hybrid No location specified
Posted 9 hours ago

Build agentic AI features and end-to-end AI products at PostHog, leveraging real product data to create high-impact, production-ready experiences.

Photo of the Rise User
Posted 2 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Work on advanced routing and control-plane software for NVIDIA Cumulus Linux, developing protocols and telemetry for large-scale data-center networks.

Photo of the Rise User
Posted 18 hours ago

Cyber Health seeks a Full Stack Engineer proficient in TypeScript, Node, and React to build secure, customer-facing web services and help define product architecture at an early-stage startup.

Zettabyte is a software development company that focuses on the education sector. We work together with our multicultural team from our offices in Singapore, Bali, Yogyakarta, Pune, and Paris to create and produce tools that increase the quality o...

2 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
October 27, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!