NVIDIA has been transforming computer graphics and accelerated computing for more than 25 years. In the AI era, it’s a unique legacy of innovation that’s fueled by great technology and amazing people. Our team builds state-of-the-art AI models for video streaming and broadcasting. Our models are deployed on the NVIDIA Maxine platform for real-time video communication and content creation (https://developer.nvidia.com/maxine). Our AI technologies are also offered in NVIDIA Broadcast App to enhance the video and voice in live streaming and conference calls.
We are now looking for outstanding engineers to join the NVIDIA AI for media team. You will work alongside brilliant engineers on core technologies to solve ambitious computer vision and deep learning problems, especially building and optimizing real-time AI solutions that could run anywhere on cloud or premise.
What You’ll Be Doing:
Develop highly efficient and low cost AI models and algorithms for computer vision and video AI
Optimize the performance, latency and power consumption of AI models on low power processors for deep learning acceleration
Deploying deep learning models and optimize the inference stack for real-time performance
Deliver the benefits of NVIDIA’s latest hardware and platform software innovations to the Deep Learning
Closely collaborate with different deep learning software and hardware teams across NVIDIA to influence roadmaps and deliver solutions
What We Need To See:
Strong experience of building and optimizing innovative AI model architectures for video use cases
Strong experience of developing efficient models with model pruning, distillation, post-quantization and quantization aware training
Experience with analyzing and fine-tuning deep learning pipeline performance
Experience with building real-time AI models for laptop and cloud use cases
Hands-on development skills using deep learning libraries and frameworks such as PyTorch/TensorFlow/ONNX, TensorRT/Triton/WinML and other neural processing SDKs
Collaboration ability to define project scope and roadmap together with the team while independently drive development effort with strong self-motivation
8+ years of relevant engineering or research background in deep learning and/or computer vision
BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, or related fields (or equivalent experience)
Ways to stand out from the crowd:
Experience with AI inference accelerating hardware and building/optimizing models on them
Background with performance and latency analysis, profiling and tuning of AI workloads
Experience with CUDA programming, as well as a real passion for optimizing AI system performance
Experience of building platforms for computer vision such as real-time tracking of human face, gaze and body, as well as avatar animation and modeling.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. As a part of NVIDIA, we have the opportunity to influence the future with your vision and expertise. Are you creative? We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA is seeking a Manager, ML Engineering to lead teams building GPU-accelerated ML libraries and high-performance vector-search tooling for community and enterprise users.
NVIDIA seeks a Principal Systems Software Engineer to lead development of CUDA/C++ libraries that accelerate Apache Spark I/O and DataFrame processing for multi-node GPU deployments.
Work on core automation systems at Jerry.ai to build scalable backend pipelines and user-facing automations that enable self-service insurance workflows for millions of customers.
AnaVation is hiring a Junior Software Developer to design, test, and maintain secure full-stack web applications for mission-critical programs in Washington, DC.
AnswersNow is hiring a remote Lead Full Stack Engineer to lead a small team and drive architecture, delivery, and quality for its React/Node.js telehealth platform.
Help build and scale internal tooling and AI-driven automation at Ramp to empower operations teams and improve customer experiences.
Senior Backend Engineer needed to build scalable, Kotlin-based financial tools and shape backend architecture at a rapidly growing B2B SaaS startup in New York.
Aviagen is hiring a Full-Stack Developer to build and maintain production web applications at its Huntsville global headquarters, working across front-end and back-end systems to deliver business-driven solutions.
Lead a talented engineering team at Visa's RaIS group to design, build, and operate scalable consumer authentication services that power global card-not-present transactions.
Seeking a seasoned Software Engineer with deep experience in algorithm design, large-data processing, database interfaces, and TS/SCI clearance to develop and lead complex software solutions for mission-critical systems.
Experienced backend engineer needed to develop and scale core Cortex server-side systems, work cross-functionally on complex distributed systems, and drive production reliability at Palo Alto Networks.
Experienced .NET developer needed to design and maintain internal full-stack applications for U-Haul’s manufacturing and operations teams in Tempe, AZ.
Palo Alto Networks is hiring a Principal Machine Learning Platform Engineer to architect and scale a high-performance ML inference platform for the Prisma AIRS AI security product.
Whatnot is hiring an experienced Fullstack Engineer on Seller Merchandising to design and build high-performance listing, checkout, and seller tools using modern web technologies.
Lead the Web Core Product & Chrome Extension engineering efforts at Speechify, owning ML inference deployments, production reliability, and performance improvements for a fast-growing, remote-first text-to-speech product.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
280 jobs