NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
We are looking for an outstanding engineer for a System Performance Engineer role for at scale AI system performance and datacenter applications. Be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in artificial intelligence and GPU computing! Provide insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated Computing and Deep Learning software and hardware platforms, and with many researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, CPU and GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.
What you'll be doing:
Provide engineering solutions to enable deployment of world-class GPU computing products at scale, lead technical relationships with engineering teams, and assisting system administrators, software and hardware engineers, and machine learning/deep learning engineers in building creative solutions.
Lead aspects of performance analysis and scalable practices to support large scale infrastructure, deliver powerful tools, methodologies, and workflows to validate expectations.
Deliver engineering solutions to deliver continuous insights into performance of AI workloads over evolving environments, generating quick insights to improvements and regressions over time.
Decompose multi-faceted issues into minimal reproduction cases, working towards final root cause of underlying problems.
Participate and engage with multiple team members to develop best practices for understanding trends in test results and presenting data clearly to develop data driven actions.
What we need to see:
5+ years of experience running multinode workloads and identifying bottlenecks and implementing improvements.
Proven understanding of high-performance computing based architectures and GPU accelerated computing software stacks and DL Frameworks (CUDA, PyTorch).
Experience with CPU architectures.
Experience with C/C++/Python/Bash programming/scripting.
Strong teamwork and communication skills.
Ability to multitask in a dynamic environment.
Action driven with strong analytical and analytical skills.
BS in Engineering, Mathematics, Physics, or Computer Science, MS or PhD desirable (or equivalent experience).
Ways to Stand Out From the Crowd:
Experience tuning memory, storage, and networking settings for performance on Linux systems.
Knowledge of modern Cloud and container-based architectures.
Hands-on experience deploying and debugging systems with NVIDIA NVLink and Infiniband.
Experience with multiple monitoring stacks such as Prometheus+Grafana, Elasticsearch+Kibana, Splunk, Zabbix, etc.
Demonstrated work with Open-Source software: building, debugging, patching and contributing code.
With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, with a genuine passion for technology, we want to hear from you.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead and own a portfolio of complex infrastructure programs at NVIDIA, coordinating cross-functional teams to deliver networking, compute, and storage initiatives on time and on budget.
NVIDIA is hiring an experienced Technical Program Manager to lead Data Center Diagnostics programs with CSPs and OEMs, driving delivery and operational integration of AI server solutions.
Work on Exa's core backend systems to build extremely high-performance crawling, indexing, and vector search infrastructure for AI applications in San Francisco.
Complex is hiring a Solutions Architect to design and drive scalable, cloud-native architectures and integrations for its commerce and media platforms.
Commure seeks a Full Stack Software Engineer on the Scribe Growth team in Mountain View to build and scale AI-driven clinical documentation, infrastructure for audio processing, and EHR integrations.
Clearwater Analytics is hiring a Software Development Engineer II to build and maintain cloud-native Java microservices and full-stack features for its investment accounting SaaS platform.
Deepgram is hiring a backend engineer skilled in Rust and distributed systems to design and optimize high-performance inference services for production voice AI products.
Experienced technical engineer needed to lead application development, automation, and integration efforts supporting AbbVie's research and business applications in a hybrid work model.
Uline seeks a Senior Web Software Developer to design, build, and support high-performance e-commerce web applications using ASP.NET, C#, JavaScript, and T-SQL at its Waukegan, IL site.
Senior frontend engineer needed to architect and build scalable, high-performance features for LogicGate's Risk Cloud using Angular and TypeScript.
Work with General Dynamics Mission Systems as a Software Engineering Intern to apply software engineering coursework on mission-critical defense projects within an on-site, security-cleared environment.
A paid, full-time Spring 2026 Software Engineering Co-Op role at FM offering hands-on software development, application integration, and testing experience in a remote position with occasional travel to Johnston, RI.
DeepFin Research is hiring a Quantitative Developer to translate market-microstructure research into high-performance, low-latency production trading systems across derivatives markets.
Help reduce enterprise downtime by building operational tooling, monitoring, and customer-facing features as a Software Engineer at a fast-growing SaaS outage intelligence company.
NVIDIA is hiring a Senior System Software Engineer to design and triage SoC/GPU platform drivers, BSP, and automation across pre-silicon to production in a remote California role.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
167 jobs