NVIDIA is searching for a passionate Software Engineering Manager to lead the Triton Inference Server team. Triton is cutting-edge, open source inference software that powers AI deployment across cloud, data center, edge, and embedded devices—supporting models from TensorRT, TensorFlow, PyTorch, ONNX, and more. Join us to shape the future of scalable, production-ready AI solutions used by innovators around the globe.
What You Will be Doing:
Guide, mentor, and develop an inclusive and collaborative engineering team focused on delivering robust model serving solutions.
Drive planning, prioritization, and execution for projects that improve Triton’s scalability, performance, and reliability in non-generative AI deployments.
Foster partnerships with Product and Program Management to create feature roadmaps, manage cross-team dependencies, and balance project resources for both cloud and on-premises platforms.
Collaborate with internal collaborators and external customers to understand use cases and convert their needs into product features.
Promote engineering excellence through modern, agile development practices and a culture of quality and accountability.
What We Need to See:
Master’s or PhD, or equivalent experience, in Computer Science, Computer Engineering, or a related field.
Eight or more years of overall hands-on software development experience in customer-facing environments.
At least three years building, mentoring, and leading software engineering teams delivering production-grade solutions.
Deep background in scalable serving architectures, with direct experience building cloud-native inference APIs, REST/gRPC/protobuf-based services, or similar technologies.
Advanced C/C++ and Python development skills, demonstrating clean, object-oriented design, as well as proficiency in debugging, performance optimization, and testing.
Track record of contributing to or leading large open-source projects—using GitHub for code reviews, bug tracking, and release management.
Strong knowledge of agile methodologies and tools such as JIRA and Linear.
Ability to communicate technical topics with clarity and empathy to colleagues, partners, and diverse audiences.
Ways to stand out from the crowd:
Experience working within distributed, global teams.
Practical knowledge of machine learning model deployment with frameworks such as TensorRT, TRT-LLM, PyTorch, ONNX, Python, or similar platforms.
Understanding of CPU and GPU architectures.
Skills in GPU programming (for example, CUDA or OpenCL).
NVIDIA sets industry standards for innovation, collaboration, and workplace empowerment. Team members are creative, driven, and dedicated to building responsible, real-time solutions that power AI worldwide. If leading scalable AI serving software excites you, thrive in a flexible and inclusive work environment with opportunities for growth and impact.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead architecture and operation of scalable ML platform infrastructure at NVIDIA to empower researchers and engineers to train and deploy large-scale models on powerful GPU systems.
NVIDIA seeks a seasoned Senior Program Manager to scale global AI education programs, build strategic educational partnerships, and lead data-driven marketing and platform optimization to expand NVIDIA's education footprint.
Contribute to advanced GIS initiatives as an ArcPy and Python Developer for a North Carolina partner, building automation, managing LRS data, and delivering high-quality spatial analyses and maps.
Fullscript is looking for a Senior Ruby Engineer to help design and scale backend e-commerce systems that power practitioner and patient experiences.
Help build scalable scientific tooling and workflows at VantAI to accelerate AI-driven drug discovery across cheminformatics, docking, ML inference, and simulation pipelines.
Software engineering intern to build molecular visualization, ML model tooling, and scalable infrastructure at Genesis Therapeutics, contributing directly to AI-enabled drug discovery.
UiPath seeks a Software Engineer I to develop internal platform services—building Kubernetes-based infrastructure, improving CI/CD, and enhancing developer experience across the organization.
Lead development and optimization of high-performing Shopify storefronts for a rapidly growing wellness brand in a fully remote Senior Shopify Developer role.
Build and maintain scalable backend microservices and REST APIs for a global distribution and payment platform, leveraging .NET Core, C#, and AWS.
Lead and grow a compact engineering team at Voltai to deliver AI-driven tooling for next-generation semiconductor and electronics design in a fast-moving startup environment.
Lead architecture and operation of scalable ML platform infrastructure at NVIDIA to empower researchers and engineers to train and deploy large-scale models on powerful GPU systems.
Join Voltai as a Machine Learning Engineer to build and deploy high-performance LLMs, retrieval pipelines, and agentic frameworks tailored to semiconductor and electronics design.
Experienced Oracle Cloud Technical Architect needed to lead multi-pillar ERP/SCM implementations, drive integrations, and provide hands-on architectural leadership in a remote U.S. role.
Lead engineering for cross-platform 3D and web games targeting Smart TVs, streaming devices, web, and mobile while driving performance, scalability, and a high-performing team.
Blinq is seeking a Senior Android Engineer to lead development and architecture of its Android app, delivering high-performance experiences at scale for millions of users.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
198 jobs