Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Senior AI Engineer, NeMo Retriever - Model Optimization and MLOps image - Rise Careers
Job details

Senior AI Engineer, NeMo Retriever - Model Optimization and MLOps - job 1 of 2

NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars and robotics to co-pilots and more. Join us at the forefront of technological advancement in intelligent assistants and information retrieval. ​NVIDIA NIM provides containers to self-host GPU-accelerated inferencing microservices for pre-trained and customized AI models across clouds, data centers, RTX™ AI PCs, and workstations. NIM microservices expose industry-standard APIs for simple integration into AI applications, development frameworks, and workflows. Built on pre-optimized inference engines from NVIDIA and the community, including NVIDIA TensorRT and TensorRT-LLM, NIM microservices optimize response latency and throughput for each combination of foundation model and GPU.


NVIDIA NeMo Retriever is a collection of NIMs for building multimodal extraction, re-ranking, and embedding pipelines with high accuracy and maximum data privacy. It delivers quick, context-aware responses for AI applications like advanced retrieval-augmented generation (RAG) and Agentic AI workflows. The NeMo Retriever team is looking for an AI Engineer to join our team, focusing on the intersection of machine learning development, performance optimization, and MLOps. This role requires a unique blend of technical expertise in ML model development, system optimization, and operational excellence. We are looking for someone with a passion for working with the world's most complicated problems in Generative AI, LLM, MLLM, and RAG spaces using our innovative hardware and software platforms. You will leverage and augment existing tools that enable building NIMs, which power flexible, multi-modal retrievers and agents. If you're creative & passionate about solving real-world conversational AI problems, come join us.

What You'll Be Doing:

  • Develop and maintain NIMs that containerize optimized models using OpenAPI standards using Python or an equivalent performant language.

  • Work closely with partner teams to understand requirements, build & evaluate POCs, and develop roadmaps for production-level tools

  • Enable development of integrated systems - AI Blueprints that provide a unified, turnkey experience.

  • Help build and maintain our Continuous Delivery pipeline with the goal of moving changes to production faster and safer while ensuring key operational standards.

  • Provide peer reviews to other specialists, including feedback on performance, scalability, and correctness.

What We Need To See:

  • Bachelor’s or Master’s Degree program in Computer Science, Computer Engineering, or a related field (or equivalent experience).

  • 8+ years of demonstrated experience in a similar or related role

  • Python programming expertise with Deep Learning (DL) frameworks such as PyTorch.

  • Experience delivering software in a cloud context and is familiar with the patterns and processes of handling cloud infrastructure

  • Knowledge of MLOps technologies such as Docker-Compose, Containers, Kubernetes, Helm, data center deployments, etc.

  • Familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM.

  • Excellent in-depth hands-on understanding of NLP, LLM, MLLM, Generative AI , and RAG workflows

  • Self-starter with a passion for growth, enthusiasm for continuous learning, and sharing findings across the team

  • Extremely motivated, highly passionate, and curious about new technologies.

With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Due to unprecedented growth, our exclusive engineering teams are rapidly growing.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until December 20, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA Glassdoor Company Review
4.6 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
NVIDIA DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of NVIDIA
NVIDIA CEO photo
Jensen Huang
Approve of CEO

Average salary estimate

$270250 / YEARLY (est.)
min
max
$184000K
$356500K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Posted 22 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

NVIDIA is seeking an experienced Cluster Deployment Operations Engineer to lead validation, deployment guidance, and field enablement for large-scale AI and HPC cluster solutions.

Photo of the Rise User
Posted 6 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Lead a distributed datacenter team to plan, build, and operate NVIDIA’s private cloud infrastructure supporting AI and HPC workloads across global sites.

Photo of the Rise User
Guidehouse Hybrid US - VA, Springfield
Posted 49 minutes ago

Guidehouse is hiring a senior Software Engineer with an active TS/SCI clearance to architect, develop, and sustain classified-network software and knowledge-management tools for the International Affairs office.

Photo of the Rise User
Posted 22 hours ago

Blue Origin's Engines team is hiring a Software Engineer III to develop scientific solvers, CI/CD-enabled tools, and integrations that support structural and loads analysis for next-generation rocket engines.

Photo of the Rise User

Experienced backend engineer needed to lead design and delivery of secure, scalable crypto custody systems for a North America–remote blockchain team.

Photo of the Rise User
Posted 18 hours ago
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Paid Holidays

Kiddom seeks an experienced Senior Full Stack Engineer to own and deliver end-to-end features across React/TypeScript frontends and Go/Python backends with AI integrations.

Photo of the Rise User

Senior technology leader needed to head a global engineering organization and deliver scalable, AI-enabled web and mobile solutions while partnering closely with executive leadership and clients.

Photo of the Rise User
WorkOS Hybrid No location specified
Posted 13 hours ago

WorkOS is hiring a Product Engineer - Enterprise to design and build scalable, developer-friendly identity and authorization features used by fast-growing SaaS companies.

Photo of the Rise User
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Paid Holidays

Kiddom is hiring a Staff Software Engineer (Frontend) to lead frontend architecture and deliver scalable, data-driven product features for their education platform.

Photo of the Rise User
Posted 19 hours ago

Lead a remote engineering team at Platform Science building scalable event-driven web services, guiding architecture, delivery, and the professional growth of engineers.

Photo of the Rise User
Posted 10 hours ago

McKesson is hiring a Sr. Associate Software Engineer to help build and maintain B2B commerce microservices using Java Spring Boot and React.

Photo of the Rise User

RSM is seeking a Development Capability Leader to shape and lead its software development discipline, driving technical excellence, scalable delivery practices, and team growth across cloud and low-code environments.

Posted 8 hours ago

Lead DevOps technical strategy and execution at Atria, driving cloud infrastructure, CI/CD, and observability for a fast-growing preventive healthcare platform.

Photo of the Rise User
Posted 18 hours ago

Hudl is seeking an Engineering Manager to lead a full‑stack team building scalable, cloud‑based products that support coaches, athletes, and fans in the Competitive market.

Photo of the Rise User

Lead the design and delivery of high-performance, accessible React front-ends at a fast-paced, product-driven company working across distributed teams.

NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.

88 jobs
MATCH
Calculating your matching score...
BADGES
Badge ChangemakerBadge Diversity ChampionBadge Family FriendlyBadge Global CitizenBadge Work&Life Balance
CULTURE VALUES
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
BENEFITS & PERKS
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
December 18, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!