NVIDIA is the platform upon which every new AI‑powered application is built! We are seeking a deeply technical, hands‑on Senior Engineering Manager to lead the NVIDIA Inference Microservices (NIM) Factory team. You will lead and scale a world‑class engineering organization that delivers day‑0 model launches and follows through with enterprise‑grade software—to delight customers with reliable, performant, and secure AI services at massive scale. You will partner closely with product, research, SRE, and security to define strategy, drive execution across multiple workstreams, and safeguard the long‑term technical health of the platform.
What you'll be doing:
Lead the NIM Factory engineering team (containers, orchestration, workflow, observability, platform APIs); attract, hire, onboard, and grow top talent.
Define vision, strategy, and roadmap for how we build, ship, and operate NIM from day‑0 launch through enterprise‑grade hardening (security, reliability, performance, compliance).
Own end‑to‑end delivery of cross‑functional programs; align stakeholders and manage dependencies.
Drive predictable delivery across multiple programs; manage priorities, resourcing, schedules, and dependencies
Establish engineering excellence: code health and reviews, documentation, CI/CD, testing.
Collaborate with research and platform teams on inference architecture and scalable deployment patterns.
What we need to see:
10+ overall years building and delivering production software systems, including 5+ years leading engineering teams as a manager; experience leading multiple teams or managing managers is a plus.
Proven track record driving complex, cross‑functional programs from inception to successful production launch and scale.
Strong foundation in cloud‑native engineering (containers, Kubernetes, microservices) and modern SDLC practices (CI/CD, testing, observability).
Proficiency with cloud languages such as ython; ability to read code, guide designs, and drive high‑quality engineering outcomes.
Demonstrated ability to hire, coach, and develop senior engineers/tech leads; build inclusive teams and a culture of ownership and excellence.
Excellent communication and stakeholder management; ability to influence across product, research, security, and operations.
A degree in Computer Science, Computer Engineering, or a related field (BS or MS) or equivalent experience.
Ways to stand out from the crowd:
Led teams that built and operated large‑scale LLM inference or model‑serving platforms (Triton, TensorRT‑LLM, vLLM) in production.
Experience architecting next-generation container build systems or CI/CD platforms at scale.
Built organizations across multiple time zones; established durable engineering processes that improved quality and velocity.
Proven success building talent pipelines, mentoring managers/tech leads, and increasing team engagement and retention.
Contributions to open‑source ecosystems, technical publications, or talks in containers, Kubernetes, GPU, or inference communities.
We are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward‑thinking and creative people in the world working for us. If you're creative and autonomous with a real passion for technology, we want to hear from you.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 425,500 USD.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead NVIDIA’s IT supplier performance program to establish scorecards, drive business reviews, and improve supplier delivery, risk management, and value realization.
Lead the design and implementation of secure, efficient autonomous agents and application features for an AI-driven enterprise platform as a Senior Software Engineer specializing in security.
A US-remote Staff Software Engineer (SRE/DevEx) role to lead reliability, observability, and CI/CD automation across mission-critical platforms.
CGS is hiring an experienced Full Stack Developer to build and secure data-inventory web applications and APIs for government clients.
Experienced backend engineering leader sought to architect and ship scalable backend platform components at Zum’s Redwood City HQ, driving both hands-on development and team-level technical direction.
Lead architecture and build secure, scalable AI and data-pipeline systems for federal and defense customers as a Software Engineer focused on public sector solutions.
Lead and grow Tailscale’s networking features teams as an Engineering Manager, driving architecture, delivery, and mentorship for distributed, production-critical systems in a fully remote US role.
Build scalable, patient-centered onboarding experiences at Color as a Full Stack Software Engineer II, delivering features that directly impact cancer screening and care.
Experienced full-stack engineers are sought to build scalable backend services, production React UIs, and GCP-integrated ML features for a long-term, fully remote W-2 contract with a Fortune 50 client.
Lead and grow an engineering team at The Zebra, driving technical excellence and roadmap delivery for consumer-focused, scalable web products while working remotely across Texas.
A Senior Frontend Engineer role crafting enterprise-grade, LLM-powered user experiences with ownership of UI systems and the opportunity to contribute to backend work.
An experienced backend engineer to build and operate scalable managed-search services and integrations for enterprise customers in a remote-first company.
SynergisticIT seeks a motivated junior full-stack Java developer to help build and maintain scalable applications while receiving hands-on mentorship and training.
Lead reliability strategy and scalable infrastructure as a Staff SRE at a fast-growing startup building the intelligence layer for professional relationships.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
188 jobs