Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Engineering Manager - AI DevOps image - Rise Careers
Job details

Engineering Manager - AI DevOps

NVIDIA is looking for an outstanding AI DevOps Engineering Manager to lead and expand our next-gen inference operations infrastructure. Join us in transforming AI inference delivery, supporting NVIDIA's innovative products like Dynamo, Triton, NIXL, and our quickly growing range of AI inference solutions. This role is essential for our GitHub First initiative, enabling public CI/CD infrastructure with GPU and Kubernetes capabilities to deliver high-throughput, low-latency inferencing solutions in distributed environments. Lead a team ensuring our AI products achieve outstanding performance and reliability worldwide.

What you'll be doing:

  • Supervise a team of DevOps engineers with expertise in AI inference infrastructure, test automation (SDET), and Infrastructure as Code (IaC)

  • Architect and implement scalable test automation strategies for AI inference workloads, including performance benchmarking and automated quality gates

  • Lead the maintenance of our GitHub First public CI infrastructure, focusing on single/multi-GPU testing, Kubernetes multi-node GPU testing, and CSP validation

  • Drive Infrastructure as Code efforts by employing Terraform, Ansible, and Kubernetes to support scaling across multiple clouds and lead GPU clusters effectively.

  • Attain operational proficiency encompassing 24x7 on-call rotations, SRE methodologies, automated monitoring, and self-repairing systems to guarantee uptime exceeding 99.9%

  • Lead release coordination, cost optimization, and management of multi-cloud deployments

What we need to see:

  • Bachelor's/Master's degree in Computer Science, Engineering, or equivalent experience

  • 4+ years leading DevOps/SRE organizations with direct SDET leadership experience

  • 8+ years hands-on experience in software development, test automation, or infrastructure engineering with AI/ML or GPU-intensive workloads

  • Proficiency in Infrastructure as Code (IaC) platforms: Terraform, Ansible, or CloudFormation with exposure to multiple cloud environments (AWS, GCP, Azure, OCI)

  • Strong technical leadership in test automation frameworks, CI/CD pipeline development, and quality engineering practices

  • Familiarity with containerization and orchestration tools such as Docker and Kubernetes for leading AI/ML workloads and GPU resources

  • Proven success building and scaling teams in fast-paced, high-growth environments

  • Effective interpersonal skills to collaborate with remote teams and build agreement

  • Proficiency in Python, Rust, or related programming languages along with the capability to engage in architecture conversations

  • Demonstrated history of operational proficiency encompassing 24x7 on-call oversight, SRE methodologies, and robust high-availability infrastructures

Ways to stand out from the crowd:

  • Experience with CI/CD (specifically GitHub Actions), releasing Open-source AI software

  • Proficient in Deep AI/ML infrastructure with expertise in NVIDIA technologies such as CUDA, TensorRT, Dynamo and Triton Inference Server, including coordinating GPU cluster operations and GPU workload performance benchmarking

  • Background in DevOps, system software testing, and previous experience leading teams on inference engines, model serving platforms, or AI acceleration frameworks

  • Track record with monitoring tools (Prometheus, Grafana), security scanning, static/dynamic analysis tools, and license compliance automation for critical AI inferencing frameworks.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 3, and 272,000 USD - 425,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until September 29, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA Glassdoor Company Review
4.6 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
NVIDIA DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of NVIDIA
NVIDIA CEO photo
Jensen Huang
Approve of CEO

Average salary estimate

$324750 / YEARLY (est.)
min
max
$224000K
$425500K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Posted 6 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Technical and business-minded capacity planning lead needed to translate GPU roadmaps and tenant demand into actionable global data center capacity strategies.

Photo of the Rise User
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Lead product strategy and teams to design developer tools, SDKs, and platforms that improve developer efficiency across AI, HPC, and graphics at NVIDIA.

Photo of the Rise User
Posted 8 hours ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Dare to be Different
Reward & Recognition
Fast-Paced
Maternity Leave
Paternity Leave
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Paid Holidays
Paid Sick Days
Paid Time-Off
Learning & Development
Social Gatherings

Work on high-impact backend services at Robinhood to enable scalable, secure financial products across brokerage, futures, and crypto.

Photo of the Rise User
Posted 3 hours ago

Teledyne is hiring a Senior Software Developer focused on identity management to build secure, scalable IAM solutions and translate business requirements into technical implementations.

Photo of the Rise User
Workday Hybrid USA, CA, Pleasanton
Posted 1 hour ago

Workday is looking for a motivated Test Automation Engineer Intern to contribute to automation frameworks and testing strategies during a 12-week, in-person summer internship in Pleasanton, CA.

Photo of the Rise User
Facet Hybrid No location specified
Posted 3 hours ago

Senior Software Engineer role focused on building and testing full-stack features with Golang and React for a remote-first FinTech product team.

Photo of the Rise User

Zoox seeks a Perception Software Engineer to design and implement real-time perception pipelines for autonomous vehicles using advanced computer vision, probabilistic techniques, and large-scale sensor data.

Photo of the Rise User

Join Cofounder as a Fullstack Engineer - Cofounder to build the core autonomous agent platform, shipping full-stack features across integrations, flows, and UX with high ownership.

Photo of the Rise User
Posted 1 hour ago

Help scale Conversion's core platform by building reliable cloud infrastructure and platform features that enable product engineers to deliver mission-critical automations for thousands of customers.

Photo of the Rise User

Lead cross-functional engineering teams in Bellevue to modernize and scale Visa's VAS platform using generative AI, cloud-native patterns, and modern integration approaches.

Photo of the Rise User
Posted 9 hours ago

Experienced Release and Environments Manager needed to lead multi-environment release planning, environment provisioning and CI/CD automation for a global digital solutions company.

Posted 3 hours ago

Lead the design and development of intelligent, native iOS and macOS applications at Rox, blending deep OS-level integrations with cutting-edge AI to transform how revenue teams work.

Photo of the Rise User
Posted 3 hours ago

Help Zoox advance autonomous robotaxi pickup and dropoff by building and deploying motion-planning algorithms that operate safely and reliably in complex, real-world environments.

Posted 8 hours ago

Yeet is hiring a Backend Engineer to help scale its Python microservices and build high-throughput, fault-tolerant streaming and ingestion systems for a real-time observability platform.

Posted 4 hours ago

Build scalable full-stack systems and AI-powered automation at a high-growth AI platform, shipping production features rapidly to directly impact enterprise revenue outcomes.

NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.

163 jobs
MATCH
Calculating your matching score...
BADGES
Badge ChangemakerBadge Diversity ChampionBadge Family FriendlyBadge Global CitizenBadge Work&Life Balance
CULTURE VALUES
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
BENEFITS & PERKS
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
September 29, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!