The NVIDIA DGX Cloud organization is looking for software engineering talent to build NVIDIA’s accelerated compute infrastructure. This includes software to assist in the rapid bring-up, operation, configuration, and trouble-shooting of compute hardware and networking equipment.
As a Principal Systems Software Engineer, you will work with other software engineers, product architects, and product managers as a collaborative team to deliver and support end-to-end software solutions to manage complex cloud infrastructure deployments. You will write services and software that aligns with the broad architectural vision for the NVIDIA Cloud Platform, working with other teams to develop a robust and scalable system. You own your code - from development to commit to test to production, including operational support. We expect you to be passionate about code quality, testing, deployment efficiency/simplicity and bringing amazing products to market.
What you will be doing:
Work with NVIDIA internal customers.
Design and build scalable software systems to manage NVIDIA’s cloud infrastructure.
Participate in responses to real-time operational events.
Build network and systems automation software for managing a multi-tenant cloud infrastructure.
Participate in open-source communities of software we leverage and build.
Present to internal stakeholders and NVIDIA leadership on roadmaps, vision, & demos.
What we need to see:
15+ years of experience with designing and building distributed software systems.
Track record of directly supporting systems with external customers, or demanding internal customers.
BS/MS degree in Computer science or related areas (or equivalent experience).
Demonstrated ability to write code in a mainstream systems programming language such as C, C++, Golang, or Rust.
Demonstrated ability to design and implement maintainable APIs for consumers.
Practical experience with asynchronous programming, type safety, threading models, state machines and data structures.
Background of data persistence (SQL or similar).
Understanding of secure communication protocols (mutual-TLS, IPsec, or similar).
Knowledge of SRE principles (observability, SLOs, logging, etc.)
Ways to stand out from the crowd:
Experience in a Hyperscale Cloud Service Provider (public facing or not).
Understanding of networking protocols such as IP, IPv6, BGP, HTTP, ICMP, tunneling protocols (VXLAN, Geneve, FoU, GRE), etc.
Familiarity with Infiniband networking.
Background with Host management systems (DHCP, Redfish, UEFI) and host security services such as TPM, TXT, and SecureBoot.
Kubernetes and/or distributed task scheduling.
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.
NVIDIA is looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence. NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and talented people in the world working for us. If you're creative and passionate about developing cloud services we want to hear from you!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 425,500 USD.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Experienced SAP FICO professional needed to lead configuration, integration, and delivery of finance systems at NVIDIA, supporting global finance transformation and production support.
Lead development of high-performance compiler features for GPUs and multicore CPUs at NVIDIA, focusing on MLIR/LLVM-based optimizations and parallelism for C++, Fortran, and Python.
Momentus Technologies is hiring a Senior DevOps Engineer to lead cloud infrastructure, CI/CD, and observability for a fast-growing event management SaaS platform.
Acorns is hiring a Software Engineer II for its Risk Platform to build and maintain secure, scalable microservices and APIs that protect customer assets and support compliance.
Lead a new cross-functional product engineering team at Swiftly to deliver cloud-based transit products that drive real-world impact for agencies and riders.
Lead development and deployment of advanced vision and perception software for high-speed, dynamic mobile robots, collaborating closely with controls, hardware, and research teams.
NVIDIA seeks a Verification Software Engineer to build and maintain verification tools and test automation for NVLink and InfiniBand switch operating systems.
Experienced software engineer needed to design, code, and evolve healthcare-oriented software products while supporting testing and DevOps in a remote capacity for Sentara Health.
Work within LG Ad Solutions' TV Labs in Denver to build full-stack CTV advertising applications using the MERN stack and GoLang, contributing to high-performance, scalable ad tech systems.
Senior Cloud Software Engineer needed to develop, integrate, and sustain large-scale distributed data and cloud solutions for federal customers under an active TS/SCI requirement.
Lead and mentor remote engineering teams to deliver secure, resilient desktop (C++) and cloud backend solutions for a globally deployed, security-focused product.
Senior Software Engineer (Hardware Test) to build Python-driven test frameworks, drivers, and scalable test infrastructure that validate aircraft components and increase hardware reliability.
Senior Power Apps Developer needed to architect and implement enterprise Power Platform solutions (Power Apps, Power Automate, Dataverse, SharePoint, and Power BI) to automate workflows and deliver operational analytics for VA programs.
Senior Salesforce Full Stack Developer needed to lead architecture, integrations, and custom development across Quench's Salesforce-centric platform to improve scalability and operational efficiency.
Lead multiple engineering teams at a remote-first, AI-driven SaaS company as Senior Director of Engineering to drive technical excellence, operational rigor, and scalable product delivery.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
226 jobs