It's an exciting time to join the NVIDIA Cloud Native Engineering (NVCNE) group’s backend software team! As a Cloud Platform Software Engineer, you will work alongside architects, designers, frontend engineers, SREs and others to deliver a software platform that supports the lifecycle of Artificial Intelligence (AI) super compute infrastructure on Kubernetes. Together we will enable AI services across the cloud.
The engineer will write software that aligns with the broad architectural vision for the NVIDIA Cloud Platform, working with architects to develop a robust, scalable system. The engineer owns their code - from development to commit to test to production. They will be responsible for supporting SRE teams with development support and collaboration with internal product teams on sophisticated distributed systems problems at scale. This engineer will be encouraged to foster NVIDIA’s approach to Cloud Native development and especially Kubernetes.
What you will be doing:
Develop software systems to support large scale deployments of cloud infrastructure
Design, develop and distribute APIs to support Infrastructure as Code (IaC) automation and deployment workflows.
Responsible for contributing to multiple source code projects to fulfill NVIDIA requirements with software services
Work and collaborate with engineering managers, architects, designers, and frontend engineers to deliver high quality software
Automate the validation of software solutions with unit and integration tests
Innovate with other engineers on proposed designs and product direction
Openly share successes and failures in a no blame environment
What we need to see:
BS in Computer Science, Information Systems, Computer Engineering (or equivalent experience) and at least 12 years of overall experience
5-7 years of proven experience in large scale software development
Experience building and delivering services on Kubernetes
Proficiency with cloud-native infrastructure (AWS, GCP, Azure, OCI).
Collaborated with teams to write software to support cloud services at scale
Ability to troubleshoot issues across multiple layers: infrastructure, Kubernetes, application runtime.
Strong proficiency in Golang for building Kubernetes operators, controllers, and custom tooling.
Experience designing and managing Kubernetes Custom Resource Definitions (CRDs).
Knowledge of managed Kubernetes services and scaling strategies across cloud and on-prem environments.
Experience developing auto-scaling infrastructure components and incident response and root cause analysis.
Ways to stand out from the crowd:
Experience with Kubernetes Cluster API, Terraform, CSP API and other infrastructure tooling
Background with using and contributing to open-source projects
Solid experience with Kustomize, or other Kubernetes packaging tools.
Capable of refactoring software to run in systems such as Kubernetes
Ability to discuss and work with CSI, CNI, and CRI as well as familiarity with the CNCF and the tooling across the ecosystem
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're a creative, curious, and driven technical leader, we want to hear from you!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead technical strategy and build resilient, highly automated production systems as a Staff Site Reliability Engineer at NVIDIA, helping deliver reliable AI-driven services at scale.
Lead development of conversational AI and generative models for interactive game assistants and agents within NVIDIA's AI for Games organization.
Lead the architecture and delivery of scalable, mission-critical applications as Principal Software Engineer at an enterprise-focused company partnered with Jobgether.
Reducto is looking for a Backend/AI Engineer to design and scale APIs and document-processing pipelines that combine vision models and LLMs to unlock enterprise data.
Lead and grow a new remote engineering team at Equip, guiding mobile and backend development while ensuring high-quality, scalable product delivery for a mission-driven healthtech company.
PlayOn seeks a Senior Software Engineer to design and operate scalable Java-based backend services and APIs that power ticketing, payments, and high-volume event experiences.
Upbound is looking for a Senior Software Engineer to build and operate Upbound Spaces, scaling multi-tenant control plane management and improving production reliability for enterprise customers.
Work on high-impact trading systems at Anchorage Digital, implementing backend services and trading domain features that power institutional cryptocurrency trading and settlement.
Lead the technical vision for AI-enabled, cloud-native solutions at KMS Technology, architecting scalable systems that deliver revenue growth, cost savings, and improved user experiences.
Lead the architecture and hands-on engineering of identity, messaging, and trust systems at Quizlet to scale secure, privacy-first user experiences across the platform.
Broadcom is hiring an Embedded Firmware Developer to develop and bring up ARM-based real-time firmware for PCIe switches in its data center solutions group.
Loft seeks a Senior Software Engineer to build and scale automation and operations software that keeps a growing, heterogeneous satellite fleet healthy and mission-ready.
Join Arista's NDR team as a senior engineer focused on high-performance network traffic parsing and feature extraction to support mission-critical threat analysis.
Eightfold.ai seeks a FullStack Software Engineer to deliver AI-driven web applications across frontend and backend systems for its Talent Acquisition product.
SpaceX is hiring an OS/Platform Software Engineer to develop and maintain deterministic, secure, high-performance Linux-based platform software for Starlink spacecraft and ground systems.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
266 jobs