Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. NVIDIA is widely recognized as one of the most desirable employers, with some of the most talented people in the world working for us. If you're passionate about building scalable, efficient systems to power cloud operations, we invite you to join our team.
We are looking for a Lead Software Engineer to join our DGX Cloud team and build the foundational systems that drive NVIDIA’s high-performance GPU infrastructure. You will play a technical lead role in designing scalable cloud services that integrate with diverse systems including GPU telemetry in datacenters, and enabling operational automation across global cloud operations.
What You'll Be Doing:
Act as technical lead for a team of software engineers designing cloud services backed by databases and data warehouses.
Design and develop RESTful APIs to ingest telemetry from AI datacenters.
Build scalable cloud services for high-volume ingestion, processing, and storage of large datasets.
Build and manage data pipelines for online and offline data storage.
Collaborate across teams to codify business processes into scalable, self-measuring systems.
Optimize the reliability and efficiency of cloud services and operations.
Lead and ship impactful technical projects, ensuring quality and scalability at every stage.
What We Need To See:
At least 12+ years of industry experience with a Bachelor’s or Master’s degree (or equivalent experience); PhD degree preferred.
Expertise in building scalable REST APIs backed by PostgreSQL-compatible data stores.
Proficiency in programming languages such as Go, Java, or Python.
Familiarity with modern JavaScript frameworks (e.g., React, Angular, Next.js).
Expertise in cloud infrastructure (AWS, GCP, Azure, etc) and container technologies like Docker and Kubernetes.
Expertise with high-scale distributed systems, including architectural patterns for APIs and data pipelines.
Outstanding communication and collaboration skills, with a focus on solving complex operational challenges.
A passion for delivering scalable and efficient cloud services.
Familiarity with Linux operating systems.
Ways to Stand Out from the Crowd:
A track record of leading engineers to successful delivery and operations of high-performance cloud services at Internet scale.
Experience operating NVIDIA datacenter GPUs.
Strong debugging and problem-solving skills in distributed environments.
NVIDIA is committed to creating an environment where diverse perspectives drive innovation. As part of the DGX Cloud team, you’ll work on cutting-edge technology that powers the future of AI and cloud computing.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 5, and 272,000 USD - 425,500 USD for Level 6.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA seeks an experienced Senior Network Operations Engineer to operate and troubleshoot large-scale datacenter and cloud networks supporting AI and graphics workloads.
Lead NVIDIA's technical marketing for AI Factory scale-out networking, demonstrating how Ethernet, InfiniBand, DPUs, and SuperPOD designs deliver predictable performance, efficiency, and compelling TCO for hyperscale datacenters.
Lead development and productization of large-scale deep learning models for digital biology at NVIDIA, building scalable microservice-based AI solutions for life sciences.
Merlin Labs is hiring a Simulation Software Engineer to develop and integrate scalable simulation systems that power autonomous aviation testing, training, and validation.
Booz Allen is hiring a Senior Application Developer to lead back-end/.NET development and database-integrated solutions for mission-critical grant processing applications.
Work on Starlink’s wireless software stack to design, implement, and test PHY and MAC layer software that operates on large-scale satellite networks.
Pylon is hiring a Software Engineer, Security to own product security and compliance at a rapidly scaling B2B post-sales platform.
DXC Technology is hiring a ServiceNow Senior Developer to lead development and administration of ITSM modules, integrations, CMDB, and platform enhancements in a hybrid work model.
DXC Technology seeks a Mainframe Application Developer in Plano, TX to work hybrid (2 days onsite) on COBOL/CICS mainframe development, debugging, and documentation as part of a collaborative engineering team.
Lead reliability and automation initiatives to keep critical enterprise SaaS systems performant and highly available in a remote-friendly, fast-paced environment.
Lead design and delivery of scalable, production-ready systems for an AI-driven healthcare revenue-cycle platform, working end-to-end across product, operations, and engineering.
Lead NBCUniversal's developer platforms and AI-enabled SDLC initiatives as a Principal Software Engineer driving cloud control plane, API governance, observability, and developer tooling at enterprise scale.
ServiceNow is hiring a Senior Staff Fullstack Software Engineer to lead full-stack development and technical direction for the Connected Customer Experience team, delivering scalable, configurable web solutions.
Help build and maintain critical Python-based test and calibration tools for Formlabs' manufacturing lines, working onsite with cross-functional engineering teams.
Mercor is hiring a Senior Software Engineer to design and scale production-grade systems that power workflows for leading AI labs and support rapid company growth.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
170 jobs