NVIDIA is looking for outstanding software engineers to help us expand our enterprise GPU management and monitoring tools. In this role, you will work closely with the broader NVIDIA team to design and build cloud-native management agents, Kubernetes integrations, and end-to-end integration solutions that combine GPUs with the rest of the datacenter software management ecosystem. We are focused on supporting NVIDIA products across HPC, cloud, and enterprise on both bare metal and virtualized platforms as the role of GPUs in all of these environments expands. Your contributions will span many aspects of GPU system integration, including telemetry and metrics, health checks, diagnostics, configuration, and system management. These tools fill roles of both passive background monitoring and active online management with a core emphasis on operational transparency and seamless integration in customer environments. Your code will support single-node developer systems through large clusters with thousands of nodes.
To succeed, you must have a strong Linux background, familiarity with modern cloud-native systems, and a proven work ethic. You will be expected to jump in quickly and provide valuable contributions from day one. This is a dynamic work environment with many exciting opportunities awaiting. NVIDIA GPUs are central to many hot enterprise, cloud, and datacenter trends. Come join us as we craft the future of accelerated computing and AI.
What you'll be doing:
Develop and maintain distributed, robust and scalable Go programs deployed to Kubernetes environments that manage large datacenters
Develop and maintain user-space applications, containers, Go-bindings, and CLI tools.
Enable GPU management integration with the state-of-the-art open-source ecosystem, including Kubernetes and Docker.
Support internal and external users through bug fixes, documentation, and feature improvements.
Maintain high-quality products through robust test coverage.
What we need to see:
BS or higher in Computer Science or equivalent experience. 5+ years of meaningful industry experience with a strong Go and Kubernetes development background
User space development and debugging expertise in Linux environments
Experience with APIs and interface design
Outstanding written and verbal interpersonal skills. Business level English
Strong motivation and commitment to learn new skills
Ability to execute all aspects of the software development lifecycle. Ability to manage time in a fast, heavily multitasked environment
Development experience with Rust, Python and/or C, C++. Development experience with distributed systems and concurrent applications, especially in a Kubernetes environment
Experience developing and maintaining enterprise software. Experience deploying, managing, and debugging applications in a Kubernetes environment
Ways to stand out from the crowd:
Background with containers (e.g. Docker, OCI), orchestration frameworks, and logging/telemetry backends with Kubernetes monitoring stacks with tools such as Prometheus, Loki and Grafana
Experience with modern UI development in React and Node.js or similar frameworks. Experience developing Kubernetes operators or Helm charts
Experience with HPC job schedulers like Slurm or Run.AI Familiarity with Kubernetes internals. Exposure to GPU programming with CUDA. Experience with Jenkins and GitHub/GitLab CI/CD pipelines
You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA invites experienced engineers to architect and optimize large-scale GPU-accelerated HPC and networking systems, contributing to pioneering AI and computing breakthroughs.
Lead the successful introduction of NVIDIA’s next generation GPU based products as a Software Program Manager driving cross-functional collaboration and software system readiness.
Lead frontend development at Mindera, crafting responsive user interfaces with cutting-edge JavaScript frameworks in a supportive and flexible work culture.
Software Engineer II role at LexisNexis Risk Solutions, contributing to designing, coding, and delivering cutting-edge risk assessment software.
Drive impactful full stack solutions at Alloy, a fintech innovator, by enhancing client data integration and streamlining product connectivity.
Datadog is seeking a Senior Software Engineer with expertise in AI and LLMs to develop next-generation developer tools that amplify productivity.
Contribute to cutting-edge backend infrastructure at Apple, ensuring robust software testing platforms for millions of users worldwide.
Parsons is seeking an experienced Software Engineer 3 to design, develop, and lead complex software solutions supporting critical US government missions.
Lead Meta's AI Infrastructure team in building advanced AI technologies and driving innovative AI-powered experiences.
Seeking an experienced Senior Full-Stack Software Engineer to architect and lead complex projects at a healthcare technology company passionate about improving patient care.
Experienced PeopleSoft Software Developer needed to manage system upgrades, performance tuning, and troubleshooting in a hybrid work environment.
Yahoo seeks a Senior Software Development Engineer to advance its User Data Platforms with expertise in distributed databases, cloud technologies, and scalable infrastructure.
Experienced full stack Software Developer needed at Booz Allen to develop and deploy scalable systems leveraging cloud and container technologies.
Articul8 AI is hiring a seasoned Frontend Engineer skilled in React.js to develop and optimize advanced frontend solutions for their GenAI platform in a hybrid work setting.
Lead software engineering initiatives within JPMorgan Chase's Data Platform team to deliver innovative, secure, and scalable solutions.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
144 jobs