NVIDIA is seeking an outstanding AI Engineer or Solutions Architect to join our growing team focused on ecosystem partner enablement for Generative AI. In this role, you will lead by example, acting as both a strategic technical expert and a hands-on developer. You will directly build innovative proof-of-concept solutions and reference architectures for innovative AI agents, demonstrating the full power of the NVIDIA full-stack accelerated Generative AI platforms. By developing these foundational solutions, you will provide partners with the technical blueprints and expert guidance needed to architect and deploy their own transformative applications using NVIDIA full AI stack, from GPU systems and CUDA to NeMo and Nemotron.
The Generative AI Partners Enablement Solutions Architect team is committed to leveraging advanced technologies to address and expedite the deployment of solutions for customers' real-world challenges. We act as trusted technical advisors and partners to our ecosystem. As a member of NPN Generative AI Solution Architecture team, you will be immersed in a diverse, supportive environment where everyone is inspired to do their life’s work. Come join the team and see how you can make a lasting impact on the world by applying accelerated computing AI and solve category defining systems and production grade AI solutions at scale.
What you will be doing:
Building an end-to-end agentic AI applications that solve real-world enterprise problems across various industries.
Serve as the primary technical domain expert for pre- and post-sale for partners, embedding deeply with them to design and deploy Generative AI solutions at scale. Maintain strong relationships with leadership and technical teams to drive adoption, and successful utilization of NVIDIA GenAI platforms.
Accelerate partner/customer time to value by providing repeatable reference architecture guidance, building hands-on prototypes, and advising on standard methodologies for scaling solutions to productions.
Establish the scope, success metrics, and evaluation criteria for partner-led customer projects, ensuring alignment to standardized and reproducible GPU-accelerated workflows.
Enable strategic partners to build their own Professional Services, platforms and products by integrating and accelerating using NVIDIA technologies for high-impact customer workloads. You will proactively find opportunities to drive deeper adoption and utilization of NVIDIA's Generative AI products.
Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads.
What we need to see:
MS or PhD degree in Computer Science/Engineering, Machine Learning, Data Science, Electrical Engineering or a closely related field (or equivalent experience).
5+ years of meaningful work experience in deploying AI models at scale as a Software Engineer or Deep Learning engineer.
Consistent track record of building enterprise-grade agentic AI systems using open-source models and solid foundation in deep learning, with a particular emphasis on LLM and VLM.
Hands-on experience with LLM and agentic frameworks (NeMo Agent Toolkit, LangChain, Semantic Kernel, Crew.ai, AutoGen) and evaluation and observability platforms. Comfortable building prototypes or proofs of concept
Strong coding development and proficiency in Python, C++ and Deep Learning frameworks (PyTorch, or TensorFlow).
Excellent communication and presentation skills to effectively collaborate with both internal executives, partners and customers.
Ways to stand out from the crowd:
Demonstrate expertise in building applications and systems using NeMo Framework, Nemotron, Dynamo, TensorRTLLM, NIMs, AI Blueprints. And actively contribute to the open-source community.
Take end-to-end ownership of projects, proactively acquiring new skills or knowledge as needed to drive success.
Excel in fast-paced environments, adeptly managing multiple workstreams and prioritizing for the highest customer impact.
Understanding of different advanced agent architectures and emerging communication protocols (MCP, OpenAI Agentic SDK, or Google A2A).
NVIDIA GPUs and system software stacks (e.g. NCCL, CUDA), as well as HPC technologies such as InfiniBand, MPI, NVLink and others.
You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Build and maintain executable SDK examples, CI/CD-backed reference implementations, and docs-as-code to help ISVs integrate with NVIDIA Omniverse.
Lead the design and optimization of large-scale AI inference systems at NVIDIA, developing high-performance kernels, compilers, and orchestration for state-of-the-art models.
Experienced Staff/Senior Staff Software Engineer to lead development of a scalable ML platform that empowers data scientists and drives production-grade AI/ML in an energy-focused, remote-first company.
Lead development of scalable, secure robotic fleet management software as a Principal Enterprise Software Engineer working remotely across the US.
Lead development of UpdraftPlus by applying deep WordPress plugin expertise to guide a small global engineering team and deliver robust, secure features for millions of users.
Vendelux is hiring a Fullstack Engineer in New York to build and scale features on an AI-driven event-marketing platform and drive infrastructure improvements.
Lead architecture and technical strategy for enterprise web applications as a remote Software Engineer Architect, guiding design, implementation, and team mentorship.
Lead engineering efforts to design and scale ProRata Attribution systems—covering content understanding, distributed serving, knowledge systems, and agentic backend workflows—at ProRataAI's Bellevue office.
Fully remote Software Engineer II role building scalable full‑stack applications with .NET/PHP and modern front-end and cloud tooling for a collaborative, fast-paced team.
Lead and grow a mobile and backend engineering team to deliver high-quality live and on-demand video experiences for sports fans, coaches, and families across iOS and Android platforms.
A hands-on Senior Software Engineer who will design and build data pipelines, analytics APIs, and LLM-driven features to power Scrunch's AI product suite for customers across the US.
TENEX, an AI-driven MDR startup backed by Andreessen Horowitz, is hiring a Senior Software Engineer in San Jose to build scalable, secure backend and frontend systems and drive technical excellence.
Lead architecture and engineering efforts for large-scale, AI-enabled systems as a senior technical leader working remotely across the contiguous United States.
A partner company of Jobgether is hiring a Senior Fullstack Product Software Engineer to lead end-to-end product development, mentor engineers, and deliver scalable, AI-enabled corporate IT solutions in a fully remote US role.
Metova is hiring a hands-on Senior Software Developer to lead engineering efforts, mentor teammates, and deliver maintainable web and mobile applications for clients.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
197 jobs