Job details

Senior Software Development Engineer, TensorRT-LLM

We are now looking for a TensorRT-LLM Software Development Engineer!

NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and Generative AI that have put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which is foundational to product lines within NVIDIA and across the industry! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.

What you'll be doing:

Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance
Perform benchmarking, profiling, and system-level programming for GPU applications.
Closely follow academic developments in the field of artificial intelligence and feature update TensorRT
Provide code reviews, design docs, and tutorials to facilitate collaboration among the team.
Conduct unit tests and performance tests for different stages of the inference pipeline.
Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams
Write safe, scalable, modular, and high-quality (C++/Python) code for our core backend software for LLM inference.
Improve the usability of the TensorRT-LLM library and build systems (CMake)

What we need to see:

Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)
4+ years of relevant software development experience.
Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.
Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative and recommender models
Experience working with deep learning frameworks like TensorFlow and PyTorch
Self-starter who consistently takes initiative to drive projects forward
Excellent written and oral communication skills in English

Ways to stand out from the crowd:

Prior experience with a LLM framework or a DL compiler in inference, deployment, algorithms, or implementation
Prior experience with performance modeling, profiling, debug, and code optimization of a DL/HPC/high-performance application
Architectural knowledge of CPU and GPU
GPU programming experience (CUDA or OpenCL)

NVIDIA is widely considered to be one of technology’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. Does the idea of contributing to and pushing the boundaries of state-of-the-art AI and Compute systems excite you? Interested in getting exposure to the entire DL SW stack? Come join us and help build the GPU-accelerated DL platform used worldwide.

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until November 4, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

TensorRT LLM C++ Python CUDA GPU Deep Learning PyTorch TensorFlow Performance Inference CMake Software Engineer

NVIDIA Glassdoor Company Review

4.6

NVIDIA DE&I Review

No rating

CEO of NVIDIA

Jensen Huang

Approve of CEO

Average salary estimate

$217750 / YEARLY (est.)

min

max

$148000K

$287500K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Senior Software Technical Program Manager - GPU Communication Libraries

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 15 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead cross-functional programs for NVIDIA's GPU communication libraries to deliver high-performance compute software and customer-facing releases for HPC and deep learning workloads.

Director, Technical Program Management - AI and ML Platforms

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 15 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead TPMs to deliver a resilient, high-performance DGX Cloud AI/ML platform that accelerates NVIDIA research by integrating hardware, orchestration, and developer productivity.

Lead Senior Software Engineer, Agentic AI Applications

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 7 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA seeks a Lead Senior Software Engineer to design and deliver industry-leading agentic AI blueprints and scale GenAI applications for enterprise deployment.

Senior Software Engineer

DomainTools Hybrid No location specified

VIEW

Posted 1 hour ago

DomainTools is hiring a Senior Software Engineer to build and operate cloud-native, near-real-time data systems that power leading security analytics and investigations.

Frontend AI UI Engineer

Freshworks Hybrid Bellevue, WA, USA

VIEW

Posted 5 hours ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Work/Life Harmony

Take Risks

Casual Dress Code

Emails over Meetings

Collaboration over Competition

Fast-Paced

Growth & Learning

Open Door Policy

Transparent & Candid

Customer-Centric

Passion for Exploration

Dare to be Different

Child Care stipend

Onsite Child Care

Family Medical Leave

Maternity Leave

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Health Savings Account (HSA)

Conferences Stipend

Learning & Development

Paid Time-Off

Equity

Donation Matching

Mixe-Ability Accomodations

Work Visa Sponsorship

Commuter Benefits

Employee Resource Groups

401K Matching

Freshworks is hiring a senior Frontend AI UI Engineer to create interactive visual editors, debugging tools, and multi-channel deployment interfaces for its Agentic AI Platform in Bellevue, WA.

Senior Back-end Engineer

NeoWork Hybrid No location specified

VIEW

Posted 7 hours ago

NeoWork is hiring a Senior Back-end Engineer to improve tool-use reliability by debugging systems, building test frameworks, and producing clear technical documentation for a remote, contractor-based engineering team.

Software Engineer (Full Stack)

N1 Hybrid New York City

VIEW

Posted 24 hours ago

Build high-performance web and blockchain tooling at N1 as a Full-Stack Software Engineer responsible for frontend, backend, and database systems.

Lead Full Stack Engineer, Payments

Collective Health Hybrid Lehi, UT | Plano, TX

VIEW

Posted 22 hours ago

Lead architecture and full-stack delivery for Collective Health's payments platform, building scalable, secure services that manage employer-sponsored benefits transactions.

Sr. Software Engineer (AI)

Nava Hybrid Remote

VIEW

Posted 22 hours ago

Senior Software Engineer needed to architect and build scalable, user-facing systems and generative AI integrations that improve government services and public programs.

Senior Software Engineer - Backend

Parker Group Inc Hybrid New York

VIEW

Posted 6 hours ago

Parker is hiring a Senior Backend Engineer to lead design and scaling of cloud-native backend systems powering its financial platform for eCommerce merchants.

Senior Engineering Manager - Pro

Kraken Hybrid No location specified

VIEW

Posted 13 hours ago

Lead a distributed engineering team building high-performance Rust microservices for Kraken Pro’s trading platform and help drive the roadmap for secure, scalable trading infrastructure.

Director of Engineering

ThalamusGME Hybrid Remote - US

VIEW

Posted 8 hours ago

Experienced, hands-on engineering leader needed to scale and own Core platform architecture and reliability while directly contributing to code, design, and team growth at a remote-first healthcare recruitment company.

Lead Application Engineer, Zuora Billing and Revenue

Jobgether Hybrid No location specified

VIEW

Posted 8 hours ago

Lead the design and deployment of Zuora Billing & Revenue solutions for a global organization, driving I2C and revenue process optimization while ensuring audit and SOX compliance.

Senior Backend Engineer – FinTech (Remote)

Plural (NY) Hybrid United States

VIEW

Posted 6 hours ago

Experienced backend engineer sought to architect and ship secure, scalable APIs and backend systems for a climate-focused fintech building tokenized energy infrastructure.

Engineering Manager, Frontend Platform

Harvey Hybrid San Francisco

VIEW

Posted 14 hours ago

Lead and grow Harvey's Frontend Platform team to design, build, and operate a shared component library and tooling that powers consistent, performant front-end experiences across the product.

NVIDIA

NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.

263 jobs

MATCH

Calculating your matching score...

BADGES