We are seeking a Software Engineering Manager to lead the development for the Dynamo engineering team, NVIDIA’s high-performance, low-latency inference platform for serving generative AI and reasoning workloads at scale. The team accelerates deployment of cutting-edge models across diverse engines and architectures, enabling breakthroughs from real-time LLM serving to complex multi-GPU, multi-node pipelines. Ideal candidate is strong in software development, designing and creating fault-tolerant distributed systems, and has the ability to implement well thought out long term maintenance strategy.
What you'll be doing:
Mentor, grow, and develop the Dynamo engineering team and be responsible for planning and execution of projects and workflows..
Work across several teams and orgs to build platforms that use the latest developments in LLM inferencing. In this role, you will be collaborating with research and development teams and serve a large user base (software teams both internal and external to NVIDIA).
Align priorities across collaborators and define metrics for measuring the success of the product/team.
Stay updated with the latest trends in AI, ML, and infrastructure, proactively seeking opportunities to integrate advancements into NVIDIA's LLM and AI infrastructure solutions.
What we need to see:
Masters or PhD or equivalent experience in Computer Science, computer architecture, or related field.
10+ years of overall experience in developing large distributed systems.
2+ years of experience managing of AI and SW development teams.
Experience in developing and maintaining LLM or GenAI infrastructure
Excellent communication, collaboration and problem-solving skills, with a dedication to encouraging an inclusive and diverse workplace.
Hands-on experience developing large-scale distributed systems
Ways to stand out from the crowd:
Strong technical background in cloud/distributed systems.
Experience working in a globally distributed organization.
Good knowledge of CPU and/or GPU hardware architecture
Background in developing LLM inference systems.
Experience with LLM frameworks like vLLM & TRT-LLM.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most expert and passionate people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come help us build the real-time, efficient computing platform driving our success in the multifaceted and quickly growing field Deep Learning and Artificial Intelligence.
#LI-Hybrid
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Booz Allen is seeking an experienced embedded software engineer to lead development and integration of secure C/C++ solutions for mission-forward systems in a hybrid work environment.
Experienced full-stack engineer needed to build and maintain Polly's C#/.NET backend and React-based, mobile-first frontends for embedded auto insurance products.
Architect and build the multichain backend systems for a VC-backed DeFi yield engine, owning secure, production-grade on-chain integrations and transaction flows.
Help accelerate developer productivity on Poe by building tooling, reusable components, and APIs as a Senior Software Engineer focused on developer experience.
Lead the design and evolution of Attentive’s next-generation event streaming platform to improve throughput, observability, and developer self-service.
Expeditors is hiring a seasoned COBOL Developer III to support, enhance and migrate the eTMS Export mainframe system that powers the company’s global operations.
Augment is hiring an Engineering Manager to lead a core engineering team building AI-first systems that automate logistics operations across freight and 3PL customers.
Senior CIS Application Software Developer (SME) needed to design and implement secure, scalable Java-based enterprise solutions for federal clients, with strong experience in Spring Boot and Drools.
Product-focused Software Engineer to build AI-driven, customer-centered interfaces and ship high-impact features at a fast-moving NYC wealth-tech startup.
Lead a core product engineering team at Wise in Austin, driving technical direction, roadmap delivery, and engineering growth for products used by millions globally.
Graphite seeks a hands-on Software Engineer to help architect and build a real-time collaborative code review platform and influence the company’s technical direction.
Palo Alto Networks seeks a Director of Software Engineering to lead the Prisma AIRS organization and drive technical strategy for a scalable, cloud-native AI security platform.
Fullscript seeks a Senior Fullstack Developer to design and implement backend services, developer SDKs, and fullstack tooling that power identity, profile, and event-driven data flows across its platform.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
74 jobs