Job details

Deep Learning Software Engineer, Inference - New College Grad 2026

NVIDIA seeks a Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and optimize the GPU-accelerated software that powers today’s most sophisticated AI applications. Our team is responsible for developing and maintaining high-performance open-source frameworks, which are at the forefront of efficient large-scale model serving and inference. You will play a central role in improving these platforms, facilitating smooth deployment and serving of groundbreaking language models.

You’ll work closely with the deep learning community to implement the latest algorithms for public release in inference frameworks. Your work will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative AI models across NVIDIA accelerators, from datacenter GPUs to edge SoCs. You'll bring to bear open-source tools and plugins—including CUTLASS, OAI Triton, NCCL, and CUDA kernels—to implement and optimize model serving pipelines.

What you'll be doing:

Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI.
Scale performance of DL models across different architectures and types of NVIDIA accelerators.
Contribute features and code to NVIDIA’s inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions.
Work with cross-collaborative teams across frameworks, NVIDIA libraries and inference optimization innovative solutions.

What we need to see:

Pursuing a Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI).
C/C++ programming and software design skills. SW Agile skills are helpful and Python experience is a plus.
Experience with training, deploying or optimizing the inference of DL models in production is a plus.
Modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus.
GPU programming experience (CUDA, OAI TRITON or CUTLASS) is a plus.

Ways to Stand out from The Crowd

Contribute to deep learning software projects, such as PyTorch, vLLM, and SGLang to drive advancements in the field.
Experience with Multi GPU Communications (NCCL, NVSHMEM)

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our special engineering teams are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 120,000 USD - 189,750 USD for Level 2, and 148,000 USD - 235,750 USD for Level 3.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until December 5, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Deep Learning Inference GPU CUDA C++ Triton CUTLASS NCCL LLM Model Optimization PyTorch vLLM Profiling Generative AI Multimodal

NVIDIA Glassdoor Company Review

4.6

NVIDIA DE&I Review

No rating

CEO of NVIDIA

Jensen Huang

Approve of CEO

Average salary estimate

$177875 / YEARLY (est.)

min

max

$120000K

$235750K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Software Engineer, Storage

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 11 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is hiring a Software Engineer to develop user-space applications and Linux kernel storage drivers for next-generation storage solutions.

Software Engineer, Deep Learning Libraries - New College Graduate 2026

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 10 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Develop agentic AI systems at NVIDIA that use LLMs and systems programming to build safe, autonomous software integrated with GPU platforms.

Senior Backend Enginer

EchoMark Hybrid Bellevue

VIEW

Posted 23 hours ago

EchoMark is looking for a Senior Backend Engineer to design scalable, multi-tenant backend systems and infrastructure-as-code to support secure document fingerprinting across commercial, government, and on-prem environments.

Forward Deployed Engineer (AI Solutions) - Atrix

Pear VC Hybrid New York City

VIEW

Posted 2 hours ago

Atrix seeks a technically fluent, customer-obsessed Forward Deployed Engineer to embed with enterprise life-sciences teams and deliver accurate, trusted AI workflows from onboarding through go-live.

Staff Machine Learning Platform Engineer

Faire Hybrid San Francisco, CA

VIEW

Posted 3 hours ago

Lead the architecture and delivery of Faire's machine-learning platform, building scalable feature stores, model serving, and inference infrastructure to power production ML across the marketplace.

Sr Staff Engineer Software (Prisma Access)

Palo Alto Networks Hybrid Santa Clara, CA

VIEW

Posted 21 hours ago

Palo Alto Networks is hiring a Sr Staff Software Engineer to design and build scalable backend services for Prisma Access, enabling secure cloud-delivered networking for global customers.

C++ Fintech Developer, Onsite

Parallel Partners Hybrid 205 West Randolph Street, Fairfield, NJ, United States

VIEW

Posted 22 hours ago

Experienced C++ developer needed to design and optimize low-latency data processing and analytics components for a Fairfield, NJ fintech platform in a fully onsite role.

Senior Backend Engineer

Verkada Hybrid San Mateo, CA United States

VIEW

Posted 6 hours ago

Mission Driven

Inclusive & Diverse

Take Risks

Collaboration over Competition

Growth & Learning

Lead backend development for Verkada's Core Command systems, designing and scaling authentication and user infrastructure to support millions of users and devices.

Sr. Full Stack Engineer

Awesome Motive Hybrid Remote

VIEW

Posted 23 hours ago

An experienced full-stack engineer to design and ship scalable, high-performance features across frontend and backend systems for a creative, AI-enabled collaborative canvas.

Sr Principal Software Engineer (Posture Security)

Palo Alto Networks Hybrid Santa Clara, CA

VIEW

Posted 21 hours ago

Palo Alto Networks is hiring a Sr. Principal Backend Engineer to lead architecture and development of scalable, high-performance cloud posture security services for the Cortex Cloud platform.

Engineering Manager

Kaizen Labs Hybrid New York

VIEW

Posted 17 hours ago

Kaizen Labs is hiring an Engineering Manager to lead and build reliable, scalable payments and accounting systems that support government customers nationwide.

Full Stack Software Engineer (Front-End Focus)

Jobgether Hybrid US

VIEW

Posted 16 hours ago

Full Stack Software Engineer (front-end focus) sought to design and deliver accessible, high-performance React/TypeScript applications integrated with enterprise systems for a large-scale mission-critical platform.

Software Engineer, Storage

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 11 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is hiring a Software Engineer to develop user-space applications and Linux kernel storage drivers for next-generation storage solutions.

Cloud Services, Engineering Manager

Replit Hybrid Foster City

VIEW

Posted 2 hours ago

Inclusive & Diverse

Mission Driven

Work/Life Harmony

Diversity of Opinions

Friends Outside of Work

Empathetic

Collaboration over Competition

Fast-Paced

Transparent & Candid

Medical Insurance

Dental Insurance

Vision Insurance

Disability Insurance

Learning & Development

401K Matching

Paid Time-Off

WFH Reimbursements

Paid Holidays

Equity

Flex-Friendly

Lead Replit's Cloud Services team to build and operate first-party cloud infrastructure that powers Replit Agent and enables scalable, user-friendly app hosting and deployment.

Software Engineer

Air Space Intelligence Hybrid Boston

VIEW

Posted 2 hours ago

Air Space Intelligence is hiring a Software Engineer to develop and scale high-impact systems that power airspace decision-making for airlines and government customers.

NVIDIA

NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.

74 jobs

MATCH

Calculating your matching score...

BADGES