NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Software Architect to define and own the technical vision for the NVIDIA Inference Microservices (NIM) Factory. You will set the architectural direction for how we build, deploy, and scale enterprise-grade AI services to delight customers, while staying hands-on to guide our most critical implementations. The scope spans day-0 launches and the follow-through to harden them into enterprise-grade software, ensuring reliability, performance, and security across thousands of GPUs. You will shape our strategy for emerging challenges like disaggregated LLM inference and safeguard the long-term technical health of the platform.
What you'll be doing:
Define the end-to-end technical architecture for the NIM Factory, from container build systems and CI/CD to Kubernetes deployment patterns and runtime optimization.
Drive technical strategy and roadmap, making high-impact decisions on frameworks, technologies, and standards that empower dozens of engineering teams.
Architect and influence the design of workflow orchestration systems that underpin the NIM factory.
Guide and support senior engineers throughout the organization in building a culture centered on technical excellence and innovation.
Advocate for guidelines in software development, encompassing API composition, automation, observability, and secure supply chain management.
Collaborate with leadership across research, backend, SRE, and product to align technical vision with product goals and influence technical roadmaps.
What we need to see:
15+ years of experience building large-scale, production distributed systems.
Consistent track record in a technical leadership or architect role, setting technical direction, and implementing.
Deep architectural expertise in cloud-native technologies, including Kubernetes, containers, and microservices.
Exceptional ability to mentor, and grow senior engineers with a passion for raising the technical bar of the entire organization.
Proficiency in languages like Python for building tooling and services.
Experience architecting solutions for GPU-accelerated or other high-performance computing workloads.
Excellent communication and collaboration skills, with the ability to articulate complex technical concepts to diverse audiences and drive consensus.
A degree in Computer Science, Computer Engineering, or a related field (BS or MS) or equivalent experience.
Ways to stand out from the crowd:
Hands-on with LLM inference stacks (Triton Inference Server, TensorRT-LLM, vLLM).
Experience optimizing large-model serving (KV cache sharding/paging, tensor/sequence parallelism, speculative decoding, dynamic batching).
Experience architecting next-generation container build systems or CI/CD platforms at scale.
Background with workflow orchestration engines (e.g., Temporal, Airflow) for complex, distributed processes.
Expertise in designing multi-tenant, multi-cluster, or edge/air-gapped deployment architectures.
With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and. due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 425,500 USD.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Build scalable security-first services and developer tooling at NVIDIA to enable secure, efficient chip and AI development as a Senior System Software Engineer.
Lead engineering efforts on NVIDIA Mission Control to build resilient, AI-enabled cluster automation and observability for DGX Cloud and Blackwell data centers.
Lead the design and implementation of secure software features and autonomous agents for an AI enterprise platform, bringing deep security expertise and strong Python engineering experience.
NVIDIA is hiring a Senior Frontend Web Applications Software Engineer to craft performant, accessible, and data-driven web interfaces for infrastructure and developer tooling.
Lead and enable a small, purpose-driven software team as a people-first Software Development Manager for an employee-owned MSP building nonprofit-focused solutions.
Capital One seeks early-career software engineers for an 18-month rotational Technology Development Program starting August 2026, focused on cloud-native engineering, full-stack development, and rapid skill building.
Early-career software engineer wanted to help build Eventual’s distributed query engine and cloud service, working primarily from the San Francisco office.
Lead frontend architecture and build performant, accessible TypeScript/React web experiences for Credit Genie's mobile-first financial wellness platform.
Design and own the context and orchestration layer for an AI‑first revenue platform, turning messy enterprise data into agentic actions that drive measurable revenue outcomes.
Agtonomy is hiring a Senior Software Engineer (C++) to design and ship production-grade cloud-to-edge and on-vehicle software for autonomous agricultural systems.
AeroVironment is hiring an entry-level Software Engineer I (Applications) in Melbourne, FL to develop, test, and maintain application software supporting UAS and tactical systems within an Agile team.
NVIDIA is hiring a Senior Machine Learning Engineer to develop real-time GenAI and graphics solutions for media and gaming on NVIDIA GPUs.
Senior Full-Stack Engineer to own end-to-end product features and integrate cutting-edge LLM and AI capabilities into Qualified’s B2B marketing platform.
Lead frontend architecture and hands‑on development of F2’s AI-driven enterprise UI platform, shaping the roadmap and delivering high-performance React/TypeScript interfaces for private markets customers.
Skyways is hiring an Embedded Linux Software Engineer in Austin to develop and debug kernel-level software and drivers for autonomous UAV hardware.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
196 jobs