Job details

Senior Deep Learning Software Engineer, Inference - job 1 of 2

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and optimize the GPU-accelerated software that powers today’s most sophisticated AI applications. Our team is responsible for developing and maintaining high-performance deep learning frameworks, including SGLang and vLLM, which are at the forefront of efficient large-scale model serving and inference. You will play a central role in improving these platforms, facilitating smooth deployment and serving of groundbreaking language models.

You’ll work closely with the deep learning community to implement the latest algorithms for public release in frameworks like SGLang and vLLM, as well as other DL frameworks. Your work will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative AI models across NVIDIA accelerators, from datacenter GPUs to edge SoCs. You'll bring to bear open-source tools and plugins—including CUTLASS, OAI Triton, NCCL, and CUDA kernels—to implement and optimize model serving pipelines.

What you'll be doing:

Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI.
Scale performance of DL models across different architectures and types of NVIDIA accelerators.
Contribute features and code to NVIDIA’s inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions.
Work with cross-collaborative teams across frameworks, NVIDIA libraries and inference optimization innovative solutions.

What we need to see:

Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI).
5+ years of relevant software development experience.
Excellent C/C++ programming and software design skills. SW Agile skills are helpful and Python experience is a plus.
Prior experience with training, deploying or optimizing the inference of DL models in production is a plus.
Prior background with performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus.

Ways to stand out from the crowd:

Contribute to Deep Learning Software projects, such as PyTorch, vLLM, and SGLang to drive advancements in the field.
Experience with Multi-GPU Communications (NCCL, NVSHMEM)
Experience building and shipping products to enterprise customers.
GPU programming experience (CUDA, OAI TRITON or CUTLASS).

NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization. Our teams are composed of driven, innovative professionals dedicated to pushing the boundaries of technology.

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until September 7, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Deep Learning Inference LLM vLLM SGLang CUDA CUTLASS Triton NCCL C++ Python GPU Performance Optimization Model Serving Multimodal Generative AI Profiling GPU Programming

NVIDIA Glassdoor Company Review

4.6

NVIDIA DE&I Review

No rating

CEO of NVIDIA

Jensen Huang

Approve of CEO

Average salary estimate

$217750 / YEARLY (est.)

min

max

$148000K

$287500K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Senior Circuit Design Engineer

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 13 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is hiring a Senior Circuit Design Engineer to lead transistor-level and custom digital IP design for cutting-edge GPU and AI products.

Site Lab Technical Project Manager

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 9 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA seeks a Site Lab Technical Project Manager to oversee lab space allocation, infrastructure projects, and operational support for engineering teams working on datacenter products in Santa Clara.

Software Engineer, Distributed Data Systems

OpenAI Hybrid San Francisco

VIEW

Posted 10 hours ago

Inclusive & Diverse

Feedback Forward

Collaboration over Competition

Growth & Learning

As a Software Engineer on Sora, you will build and scale the distributed data infrastructure that powers multimodal model training and evaluation at OpenAI.

Product Developer - Home Environment

SharkNinja Hybrid Needham, MA

VIEW

Posted 22 hours ago

SharkNinja is hiring a consumer-focused Product Developer in Needham to own new product development and product optimizations from concept to mass production for home environment products.

Software Engineering Manager: Platform Core Services

Canopy Hybrid No location specified

VIEW

Posted 10 hours ago

Lead a Platform Engineering team to design, operate, and scale core cloud services and integrations that power Canopy’s IoT and security products.

Software Engineer, Scribe Growth

Commure + Athelas Hybrid Mountain View

VIEW

Posted 19 hours ago

Commure seeks a Full Stack Software Engineer on the Scribe Growth team in Mountain View to build and scale AI-driven clinical documentation, infrastructure for audio processing, and EHR integrations.

Software Engineer (ML) | $130K-$150K + Hybrid + Equity | SaaS Outage and Business Intelligence Company

PhillyTech.Co Hybrid King of Prussia Plaza, King of Prussia, PA 19406, USA

VIEW

Posted 20 hours ago

Help reduce enterprise downtime by building operational tooling, monitoring, and customer-facing features as a Software Engineer at a fast-growing SaaS outage intelligence company.

Chief Software Engineer

Visa Hybrid Foster City, CA, USA

VIEW

Posted 12 hours ago

Lead architecture, execution, and AI-driven innovation for Visa's RaIS product portfolio, delivering secure, large-scale payments and identity platforms.

Software Engineering Intern (2026)

General Dynamics Mission Systems, Inc Hybrid NC-McLeansville

VIEW

Posted 11 hours ago

Work with General Dynamics Mission Systems as a Software Engineering Intern to apply software engineering coursework on mission-critical defense projects within an on-site, security-cleared environment.

Senior Software Engineer, Infrastructure (Remote - US)

Jobgether Hybrid No location specified

VIEW

Posted 54 minutes ago

Senior-level engineering role focused on building scalable, cloud-first infrastructure and tooling using Node.js/TypeScript and infrastructure-as-code for a remote-first SaaS company.

Software Engineer Intern - Winter / Spring Session (January - April 2026)

NimbleRx Hybrid Redwood City, CA

VIEW

Posted 13 hours ago

Nimble is hiring a full-time Winter/Spring Software Engineer Intern to work on backend and frontend systems that power a high-scale pharmacy platform from our Redwood City HQ.

Senior Deep Learning Frameworks Sustaining Engineer

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 2 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is hiring a Senior Deep Learning Frameworks Sustaining Engineer to integrate, back-port, and stabilize TensorFlow, PyTorch and TensorRT for enterprise LTS releases.

Staff Software Engineer, Funding

GoodLeap Hybrid No location specified

VIEW

Posted 14 hours ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Performance Bonus

Family Medical Leave

Paid Holidays

Lead architecture and delivery for GoodLeap’s Funding domain, designing scalable C#.NET and TypeScript systems that power financial workflows and ledger reporting.

Software Engineer - Gadagkar Lab

Howard Hughes Medical Institute Hybrid Columbia University

VIEW

Posted 2 hours ago

HHMI's Gadagkar Lab at Columbia University is hiring a software engineer to code in MATLAB/Python, build and maintain electronics and data-acquisition systems, and support experimental neuroscience research.

Sr. Software Engineer, Mobile (US)

Gridwise Hybrid Pittsburgh

VIEW

Posted 4 hours ago

Gridwise is hiring a Senior Mobile Software Engineer to own and deliver React Native features and mentor teammates for a fast-growing, remote-first mobility startup.

NVIDIA

NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.

170 jobs

MATCH

Calculating your matching score...

BADGES