Are you passionate about driving innovation in deep learning and eager to work on cutting-edge AI technology? Join NVIDIA's TensorRT team as a Software Engineer, and be at the forefront of technology, contributing to high-performance AI inference solutions for specialized platforms and applications. Your fresh perspective and technical skills will help shape the performance and functionality of our products, ensuring NVIDIA remains synonymous with innovation. If you're ready to tackle challenging projects, push the boundaries of AI performance, and make a significant impact in a company that values creativity, excellence, and teamwork, we want to hear from you!
What you'll be doing:
Contribute to the design and development of high-performance deep learning inference software using modern C++
Collaborate with teams across the hardware and software stack to understand and leverage new technologies to improve TensorRT's functionality and performance
Participate in the development of robust, high-quality C++ code in alignment with Modern C++ standards
Support systematic reasoning about test plans from unit to integration level
Assist in documenting the properties of functions, classes, and systems to improve robustness
Contribute to performance optimization and benchmarking efforts
Help develop new features and capabilities for TensorRT to serve specialized customer needs
What we need to see:
Masters, or PhD in relevant fields (Computer Engineering, Computer Science, Electrical Engineering, AI) or equivalent experience
Strong foundational C++ skills, including familiarity with C++11 and C++14 or newer standards
Familiarity with the C++ Standard Template Library (STL)
Familiarity with modern deep learning models and inference frameworks
Interest in performance optimization and systems programming
Demonstrated ability to take initiative and see projects through to completion
Excellent interpersonal skills and a collaborative, pragmatic approach to solving problems
Ways to stand out from the crowd:
Experience with Python and/or CUDA through coursework, internships, or personal projects
Exposure to systems programming, embedded systems, and/or compiler concepts
Experience in software performance analysis, profiling, or optimization techniques
Knowledge of C++17 or later standards
Understanding of computer architecture, memory management, or parallel computing concepts
NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous, and love a challenge, come join our team and help us build the future of high-performance AI inference technology!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 120,000 USD - 189,750 USD.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead and scale NVIDIA's academic research grant programs by coordinating fulfillment, CRM operations, reviewer workflows, and cross-functional stakeholders from the Santa Clara team.
Lead the development of distributed runtime and orchestration systems (Rust, Kubernetes, Slurm) to enable large-scale, low-latency GPU inference for NVIDIA's Dynamo/Inference Server ecosystem.
Senior Full-Stack Engineer needed to architect and build AI-driven web applications that translate advanced ML models into scalable, user-friendly enterprise features.
Trissential seeks a Full Stack Software Engineer to build backend services and web applications for a healthcare-focused Sleep Center initiative, driving scalable data solutions and improved patient care.
Foxglove is hiring a Forward-Deployed Engineer to partner with strategic customers, build integrations and production solutions, and feed real-world learnings back into our robotics observability platform.
Join Zipline's Droid team to design, build, and deploy cloud-side perception systems that produce production-grade 3D and semantic priors for a large-scale autonomous delivery fleet.
Phare Health is hiring a Software Engineer to design, build, and operate data ingestion and customer integrations that connect provider and payer systems to its AI reimbursement platform.
Lead .NET application refactoring and Azure cloud migration efforts to deliver secure, scalable solutions for federal clients at Aretum.
Senior Salesforce Order Management Architect wanted to lead the design and delivery of scalable OMS solutions, integrations, and orchestration flows for a leading digital transformation services provider.
Jerry.ai is seeking a Senior Backend Engineer to design and build scalable real-time IoT and streaming systems that power the DriveShield safety product.
Starpath is hiring a Software Engineer to own and build telemetry, command, and test frameworks that enable rapid, reliable hardware testing and Lunar/Martian operations.
Senior software engineer needed to architect reliable, stateful systems and transactional pipelines at Central, helping automate back-office operations with AI for high-growth startups.
Puzzle is seeking a Senior Fullstack Software Engineer to craft high-quality React and TypeScript user experiences while contributing to backend APIs in a remote-first, growth-stage fintech startup.
Tennr seeks a Backend Software Engineer to build scalable backend systems and ML-driven workflows that accelerate patient referrals and improve healthcare outcomes.
Lead and scale remote engineering teams at Cengage Group to deliver cloud-native, high-availability learning platforms that improve learner engagement and outcomes.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
124 jobs