NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
We are looking for outstanding High Performance AI Engineer to build groundbreaking multi-agent systems for the CUDA ecosystem. We build innovative agentic runtimes and compiler-integrated orchestration that work together with NVIDIA's software stack to provide comprehensive acceleration for modern agent workloads powered by foundational models. As a member of the team, you will develop new agent abstractions, GPU-centric runtimes, and compiler- or runtime-driven system solutions to accelerate agent planning, tool-use, code generation, and other high-impact AI workloads. You will collaborate closely with internal NVIDIA software and hardware teams to push the latest developments into NVIDIA products.
What you'll be doing:
Design, build and optimize agentic AI systems for the CUDA ecosystem.
Co-design agentic system solutions with software, hardware and algorithm teams; influence and adopt new capabilities as they become available.
Develop reproducible, high-fidelity evaluation frameworks covering performance, quality and developer productivity.
Collaborate across the AI stack—from hardware through compilers/toolchains, kernels/libraries, frameworks, distributed training, and inference/serving—and with model/agent teams.
What we need to see:
Bachelor’s degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); MS or PhD preferred.
3 years+ industry or academia experience with AI systems development; exposure to building foundational models, agents or orchestration frameworks; hands-on experience with deep learning frameworks and modern inference stacks.
Strong C/C++ and Python programming skills; solid software engineering fundamentals.
Experience with GPU programming and performance optimization (CUDA or equivalent).
Ways To Stand Out From The Crowd:
Strong experience in building/evaluating deep learning models, coding agents and developer tooling.
Demonstrated ability to optimize and deploy high-performance models, including on resource-constrained platforms.
Demonstrated ability in GPU performance optimizations, evidenced by benchmark wins or published results.
Publications or open-source leadership in deep learning, multi-agent systems, reinforcement learning, or AI systems; contributions to widely used repos or standards.
With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 120,000 USD - 189,750 USD for Level 2, and 148,000 USD - 235,750 USD for Level 3.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA is hiring a Senior ASIC Design Engineer (Clocks IP) to design and deliver robust, high-performance clocking solutions for next-generation GPU and CPU chips.
Lead a team at NVIDIA building and optimizing accelerated ML and vector search libraries on GPUs, driving performance, benchmarks, and community impact.
Help build and scale internal tooling and AI-driven automation at Ramp to empower operations teams and improve customer experiences.
Lead a team at NVIDIA building and optimizing accelerated ML and vector search libraries on GPUs, driving performance, benchmarks, and community impact.
SanDisk is hiring a Senior Technologist, Firmware Engineering to lead development and optimization of SSD firmware on embedded multi-core platforms and support customer design‑in and qualification.
Be the founding backend engineer at Known, architecting scalable TypeScript/Node.js services, real-time communication, and infrastructure to power a consumer AI product at scale.
AnswersNow is hiring a remote Lead Full Stack Engineer to lead a small team and drive architecture, delivery, and quality for its React/Node.js telehealth platform.
Brellium is hiring a Senior Software Engineer to architect scalable AWS-backed systems and advance AI workflows for its clinical review platform that improves care quality across the U.S.
Join Classroom Mosaic as a Full-Stack Software Engineer to lead end-to-end projects that improve instructional practice for K-12 schools through elegant, high-performance software.
Lead the BI-layer development for ServiceNow's FinOps platform, building dashboards and visualizations with Lightdash, React, and modern data stack technologies to enable enterprise-scale cloud cost governance.
Replit is hiring a Staff Site Reliability Engineer to lead observability, incident response, and infrastructure automation for its large-scale, Kubernetes-based platform.
Parabola is hiring a Senior Full Stack Software Engineer to design and ship scalable TypeScript and Python features that advance its flow-based automation platform.
Krazy Coupon Lady is hiring a Full-Stack Developer to help build and optimize responsive, accessible web features for a fast-growing, remote-first shopping publisher.
Senior backend engineer (Member of Technical Staff) to design and ship scalable backend systems powering Ideogram’s generative design products across a fast-moving, collaborative team.
Experienced Senior Software Engineer needed to lead design and delivery of map services for autonomous driving at Woven by Toyota’s Palo Alto office.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
281 jobs