We are seeking a Software Engineering Manager to lead the development for the Dynamo engineering team, NVIDIA’s high-performance, low-latency inference platform for serving generative AI and reasoning workloads at scale. The team accelerates deployment of cutting-edge models across diverse engines and architectures, enabling breakthroughs from real-time LLM serving to complex multi-GPU, multi-node pipelines. Ideal candidate is strong in software development, designing and creating fault-tolerant distributed systems, and has the ability to implement well thought out long term maintenance strategy.
What you'll be doing:
Mentor, grow, and develop the Dynamo engineering team and be responsible for planning and execution of projects and workflows..
Work across several teams and orgs to build platforms that use the latest developments in LLM inferencing. In this role, you will be collaborating with research and development teams and serve a large user base (software teams both internal and external to NVIDIA).
Align priorities across collaborators and define metrics for measuring the success of the product/team.
Stay updated with the latest trends in AI, ML, and infrastructure, proactively seeking opportunities to integrate advancements into NVIDIA's LLM and AI infrastructure solutions.
What we need to see:
Masters or PhD or equivalent experience in Computer Science, computer architecture, or related field.
10+ years of overall experience in developing large distributed systems.
2+ years of experience managing of AI and SW development teams.
Experience in developing and maintaining LLM or GenAI infrastructure
Excellent communication, collaboration and problem-solving skills, with a dedication to encouraging an inclusive and diverse workplace.
Hands-on experience developing large-scale distributed systems
Ways to stand out from the crowd:
Strong technical background in cloud/distributed systems.
Experience working in a globally distributed organization.
Good knowledge of CPU and/or GPU hardware architecture
Background in developing LLM inference systems.
Experience with LLM frameworks like vLLM & TRT-LLM.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most expert and passionate people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come help us build the real-time, efficient computing platform driving our success in the multifaceted and quickly growing field Deep Learning and Artificial Intelligence.
#LI-Hybrid
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA seeks ambitious Bachelor's, Master's, and PhD students for 12-week Mixed Signal and Digital Circuit Design internships to contribute to impactful hardware projects using modern EDA tools and lab test equipment.
A paid, full-time 12-week hardware verification internship at NVIDIA in Santa Clara for students pursuing degrees in electrical or computer engineering to work on real verification and validation projects.
Captions is hiring an experienced full-stack Software Engineer to build high-performance, browser-based video tools at its Union Square NYC HQ.
Lead architecture and hands-on development of Catena Labs' AI-native stablecoin treasury platform, driving technical vision, bank and blockchain integrations, and mentoring a compact engineering team.
Voleon Group seeks a motivated Software Engineer Intern for summer/fall/winter 2026 to build high-performance systems powering machine-learning-driven trading from its Berkeley office.
Lead a high-impact team accelerating LLM inference performance at NVIDIA by combining deep systems expertise, GPU profiling, and cross-functional collaboration.
Brillio is hiring a React Native Developer to design, build, and maintain cross-platform mobile applications for enterprise clients.
iHeartMedia is hiring a motivated Software Engineer Level 1 to develop and maintain full-stack features and tooling for its marketing and audio platforms in a remote New York-based role.
Lead architecture and delivery of scalable, secure healthcare platforms as a Staff Software Development Engineer at CVS Health, mentoring teams and driving technical strategy.
Autodesk is looking for a remote Software Engineer to design and implement C++ features for its manufacturing product suite while collaborating within an Agile Scrum team.
Lead engineering and modernization efforts for HR and payroll systems at NBCUniversal/Versant, ensuring reliable operations through a complex spin-off and preparing systems for future cloud migrations.
Parable is hiring a Senior AI Engineer to architect and productionize LLM-driven agentic workflows and scalable backend systems that power enterprise time-insights.
Sieve seeks a pragmatic, customer-facing software engineer to build and scale the data pipelines and tooling that power next-generation video AI datasets.
Monarch is hiring a hands-on Infrastructure & MLOps Engineer to build and operate scalable cloud and AI infrastructure that powers their personal finance platform.
Woodard & Curran seeks a Lead Software Engineer to lead design and delivery of Azure/.NET and Power Platform solutions while mentoring teams and supporting citizen-developer initiatives.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
156 jobs