Job details

GPU Kernel Compiler Engineer, AI Inference

NVIDIA’s AI and GPU software is at the forefront of computing fueling breakthroughs across deep learning, LLMs, and intelligent applications. Our team is building solutions for rapid development and deployment of GPU kernels for AI systems. We take the latest AI models, rigorously analyze them, develop and deploy high-performance GPU kernels that define model performance and integrate the derived techniques and methodologies into the tools that automate this process.

This role is a unique opportunity to shape the next generation of AI performance and efficiency. You will work hands-on with emerging AI models, collaborating across compiler, AI inference, and model performance teams. The focus is on building programming solutions that can be applied to concrete AI inference use cases to deliver real-world performance and development efficiency wins.

What you will be doing:

Analyze state-of-the-art AI models, identifying key performance bottlenecks and opportunities at the kernel level.
Develop, optimize, and evaluate both hand-tuned and compiler-generated kernels for inference workloads, balancing speed and flexibility.
Design and build high-level DSLs and innovative compiler infrastructure to increase kernel developer productivity while achieving near peak performance.
Collaborate with model AI inference and compiler teams to iterate on kernel fusion, auto tuning, and sophisticated GPU programming techniques.
Benchmark performance across real workloads, diagnose root causes, and rapidly deploy optimizations that maximize hardware utilization on NVIDIA platforms.

What we need to see:

Bachelor’s, master’s or PhD degree in Computer Science, Computer Engineering or related field, or equivalent experience.
At least 3+ years Strong C++ and/or Python programming skills for system and performance engineering.
Understanding of GPU architecture and proficiency in CUDA programming.
Intellectual curiosity and interest to solve exciting problems and deliver practical results in production environments.

Ways to stand out from the crowd:

Experience designing, developing and optimizing high-efficiency GPU kernels for modern AI workloads.
Experience building compilers, domain-specific languages, or automatic optimization systems
Familiarity with popular compiler, GPU programming and AI frameworks such as MLIR, LLVM, PyTorch, XLA, Triton or Cutlass.
Experience with AI/ML inference workloads and model performance analysis.
Strong communication skills and ability to collaborate in a cross-team environment.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until October 27, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

GPU CUDA C++ Python compiler MLIR LLVM Triton Cutlass inference kernel auto-tuning performance DSL AI inference kernel compiler engineer

NVIDIA Glassdoor Company Review

4.6

NVIDIA DE&I Review

No rating

CEO of NVIDIA

Jensen Huang

Approve of CEO

Average salary estimate

$217750 / YEARLY (est.)

min

max

$148000K

$287500K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Director, XR Product and Partner Management

NVIDIA Hybrid US, CA, Santa Clara

GPU Kernel Compiler Engineer, AI Inference

Average salary estimate

Sign up for our weekly newsletter of fresh jobs

Sign up for our weekly
newsletter of fresh jobs