Are you passionate about generative AI and building agentic workflows to solve real problems? Are you interested in learning more about computer hardware architecture? NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our chips and systems power applications within artificial intelligence, computer graphics, autonomous vehicles, robotics, gaming, virtual reality, and high-performance computing.
We are now looking for a senior SW AI architect to help redefine our engineering flows. Come join the NVIDIA hardware architecture team to build agentic systems to improve our future chip designs, or put differently, use AI to design our next-generation AI systems!
What you’ll be doing:
Serve as an expert in implementing and deploying AI applications based on large language models (LLMs), internal and external agentic frameworks, and custom models.
Work with hardware architects to identify how to best design, customize, and deploy AI-based solutions to their specific problem domains.
Collaborate with infrastructure engineers to improve existing automated workflows by incorporating LLMs and establishing best practices for future solutions.
Develop and optimize retrieval and generation algorithms for enterprise data (text, code, and images) to build advanced AI applications.
Interact with internal research groups on how to solve complex chip design problems in new ways by leveraging machine learning (ML) and deep learning (DL).
Research emerging AI technologies and engineering best practices to continuously evolve our development ecosystem and maintain a competitive edge.
What we need to see:
MSc or PhD in Data Science, Computer Science/Engineering, Electrical Engineering, or equivalent experience.
5+ years of industry or research experience.
Deep practical knowledge of LLMs, DL/ML, and Agent development.
Well versed in agentic literature and eager to continue learning.
Strong background in implementing AI solutions to solve real-world engineering problems.
Experience with training/fine-tuning custom models, building multi-agent systems, retrieval augmented generation (RAG) pipelines, and vector databases.
Strong analytical, communication, and interpersonal skills.
Ways to stand out from the crowd:
Background in computer architecture or hardware development.
Good understanding of distributed systems and microservice architecture.
Hands-on experience with NVIDIA Inference Microservices (NIMs).
NVIDIA engineers the most advanced chips, systems, and software for the AI factories of the future and is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and ambitious people in the world working for us. Are you a creative and autonomous engineer who wants to make a difference? If so, we want to hear from you!
#LI-Hybrid
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Senior Learning Developer role producing professional training videos, animations, and AI-enhanced e-learning at NVIDIA’s Global Education Services team in Santa Clara.
Provide senior-level administrative support within the Autonomous Vehicle organization, coordinating complex schedules, travel, events, and projects for multiple executives at NVIDIA.
Work at the intersection of software and hardware as a Product Engineer at Voltai, building scalable web apps, generative UI powered by LLMs, and browser extensions that redefine semiconductor design workflows.
Senior Software Engineer needed to lead CI/CD, release tooling, and GenAI-driven developer productivity initiatives for a remote robotics-focused engineering organization operating across the US and Canada.
WHOOP is hiring a Software Engineer II on the Business Systems team to build integrations, APIs, and internal tools that power critical business operations from the Boston office.
Modal is hiring a systems-focused engineer to evolve multi-cloud workload scheduling, cost modeling, and GPU pricing for its serverless AI compute platform.
Experienced Backend Java Developer needed to build and maintain scalable Java services, collaborate with engineers and data scientists, and contribute to data-driven backend solutions in a fully remote role.
Lead a remote engineering team to shape system architecture, mentor engineers, and reliably ship scalable backend features for a fast-growing US-based platform.
Lead and grow a compact engineering team at Voltai to deliver AI-driven tooling for next-generation semiconductor and electronics design in a fast-moving startup environment.
Experienced DevOps/IaC engineer needed to design and automate secure, multi-classification infrastructure and CI/CD pipelines in support of USSTRATCOM at Offutt AFB.
Mercor is hiring a mid-level Backend Software Engineer in San Francisco to design and ship scalable backend APIs and services that power training and evaluation workflows for leading AI labs.
Become a core engineer at Dialogue AI to build full-stack systems and AI integrations that accelerate customer research and define the product from day one.
Work alongside experienced engineers at a mission-driven startup to build and test real-time telemetry features as a Software Engineering Intern in El Segundo.
Experienced full-stack engineer needed to build and maintain a hybrid web application with rich data visualizations and robust backend systems.
Lead the design and implementation of LLM-powered conversational UX and Python backend systems for a WhatsApp remittance bot at a global payments and communications company.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
196 jobs