At Zettabyte, we’re on a mission to make AI compute ubiquitous, seamless, and limitless. We’re building a cloud where AI just works—anywhere, anytime. “AI Power. Everywhere.” Be part of the team designing the infrastructure for the AI-first world.
We need a Backend Engineer to build the systems that orchestrate GPU clusters for AI workloads. You'll create APIs that handle GPU allocation, memory management, compute scheduling, and multi-tenant isolation—challenges unique to AI infrastructure that go far beyond typical backend engineering. As part of our backend team, you'll solve problems like: How do we efficiently share expensive GPU resources across users? How do we handle GPU memory constraints for large AI models? How do we ensure quality of service when workloads compete for compute? This is an opportunity to build infrastructure where every API call could allocate thousands of dollars worth of compute per hour, where your optimizations directly impact whether AI startups can afford to train their models.
Design APIs that abstract complex GPU operations into simple developer experiences
Build scheduling algorithms that maximize GPU utilization while ensuring SLA compliance
Develop resource management systems for GPU lifecycle—provisioning, allocation, scheduling, and release
Create usage tracking and billing systems for GPU-hours, memory usage, and compute utilization
Implement monitoring for GPU-specific metrics, health checks, and automatic failure recovery
Build multi-tenancy systems with resource isolation, quota management, and fair scheduling
Optimize cold starts for model serving and implement efficient model loading strategies
Collaborate with frontend engineers to expose complex infrastructure through intuitive interfaces
Leverage AI-assisted coding tools (GitHub Copilot, Claude Code, Cursor IDE, etc.) to boost productivity and code quality.
5+ years backend engineering experience with distributed systems
Strong proficiency in Go, Python, or similar backend languages
Experience with resource scheduling, orchestration, and API design (REST, GraphQL, gRPC)
Understanding of hardware constraints and system optimization
Linux systems knowledge and containerization experience (Docker, Kubernetes)
Comfortable working with expensive resources where efficiency directly impacts costs
Excited about solving novel problems in AI infrastructure (not just another CRUD app)
Startup mindset—comfortable with ambiguity and rapid iteration
GPU or HPC cluster management experience
Understanding of ML/AI workload patterns and requirements
Experience with high-value resource allocation systems
Background in performance optimization for compute-intensive workloads
Familiarity with GPU virtualization and sharing technologies
Experience building billing or metering systems
We provide competitive salary and meaningful equity, based on your experience and skillset.
This is a Hybrid role - 3 days in office, 2 days WFH; Must locate in Palo Alto and be able to commute to the local office.
Please note that this position is open to U.S. citizens and permanent residents only, visa sponsorship is not available.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead the design-to-code pipeline as a Frontend Engineer focused on UI/UX, creating Figma prototypes and implementing them in Vue.js for a product team working with GPU infrastructure.
Lead the architecture and development of anti-tamper and product security software for Boeing's MS&B programs in Oklahoma City, delivering secure embedded solutions for defense platforms.
Sobek AI is hiring a Senior Frontend Engineer to design and ship secure, data-dense React/TypeScript UIs and design systems for enterprise life-sciences and emergency-response applications.
Lead the architecture and implementation of a high-volume, secure payments platform as a Principal Backend Engineer, driving technical strategy and mentoring senior engineers across the organization.
Finvari is hiring a Staff React-Native Engineer to build and improve our cross-platform mobile app and contribute across our React web front-end and serverless backend for customers in construction finance.
Lead the design and implementation of scalable backend and AI infrastructure at NVIDIA to enable internal AI tools, agent orchestration, and production ML deployments.
Jump, an AI platform for financial advisors backed by Series A funding, seeks a remote software engineer contractor (4+ years experience) to build well-tested, production-ready features in a fast-growing team.
Senior Microsoft AI Developer role to lead design and delivery of large-scale Generative AI solutions using Azure, Microsoft 365 toolkits, and modern microservices architectures.
Decagon is hiring an AI Implementation Engineer to build and integrate conversational AI agents, resolve production-level technical issues for enterprise customers, and drive adoption of the platform.
TENEX seeks a senior Staff Software Engineer to lead architecture and hands-on development of its AI-driven MDR platform with remote flexibility for exceptional candidates.
Pattern is hiring a Staff MLOps Engineer to lead the design, deployment, and operationalization of scalable ML and LLM pipelines that power ecommerce AI across global marketplaces.
Build agentic AI features and end-to-end AI products at PostHog, leveraging real product data to create high-impact, production-ready experiences.
Work on advanced routing and control-plane software for NVIDIA Cumulus Linux, developing protocols and telemetry for large-scale data-center networks.
Cyber Health seeks a Full Stack Engineer proficient in TypeScript, Node, and React to build secure, customer-facing web services and help define product architecture at an early-stage startup.
Zettabyte is a software development company that focuses on the education sector. We work together with our multicultural team from our offices in Singapore, Bali, Yogyakarta, Pune, and Paris to create and produce tools that increase the quality o...
2 jobs