NVIDIA is a world-leader in artificial intelligence and computer vision. Our team builds hardware-accelerated computer vision pipelines, cloud services and SDKs bringing the latest AI innovations to data centers, gaming rigs, cars, robots, buildings, medical devices, and more. We are looking for an engineering expert to help us productize and optimize the latest Vision Language Models (VLMs) and their pipelines. Together, we will democratize the use of these amazing models, unlocking all sorts of innovative applications the world is barely dreaming of.
What you'll be doing:
Develop, profile and optimize inference pipelines for VLMs and other AI CV models: improve throughput and latency, data loading, pre- and post-processing.
Improve the efficiency of VLM models themselves: kernel optimization in CUDA
Upstream improvements to SDKs and libraries across NVIDIA and beyond to deliver accelerated computer vision at scale.
Promote high-performance AI computer vision across NVIDIA teams and functions (Engineering, Product Management, Marketing, and more).
What we need to see:
Master's of Science in Computer Science or Electrical engineering or equivalent experience.
8 years practical experience or equivalent
Expertise in AI computer vision (VLMs, Vision Transformers, Diffusion models). Proven track record using its software ecosystem (PyTorch, HuggingFace, vLLM) to develop and release production-grade software.
Excellent software engineering fundamentals (source control, CI/CD, testing/validation, packaging, containerization, release).
Proficiency with Python, C++ and CUDA (kernel optimization)
Experience developing cloud applications (REST APIs, gRPC).
Excellent written, visual, and verbal communication to present performance challenges, tradeoffs, and architectural alternatives.
Curiosity and drive to learn new technologies and partner across teams and functions.
Ways to Stand Out from the Crowd:
Expertise in classical, non-ML computer vision
Strong fundamentals with system-level performance: multi-threaded, multi-process and distributed software development.
Grounding in mathematical fundamentals such as linear algebra, numerical methods, statistics, and exploratory data analysis.
History of creativity and innovation around performance in multiple problem domains.
You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Drive partner adoption of NVIDIA Omniverse and AI-enabled simulation by providing deep technical guidance, integration support, and developer enablement across engineering and executive stakeholders.
NVIDIA is hiring a Senior Manager to lead Robotics QA, driving test engineering, automation, and sim-to-real validation across embedded systems, simulation, and AI models.
Commure seeks a Full Stack Software Engineer on the Scribe Growth team in Mountain View to build and scale AI-driven clinical documentation, infrastructure for audio processing, and EHR integrations.
Vanguard is hiring a paid, full-time summer intern in Malvern for application development—ideal for CS/IT students seeking hands-on web, mobile, and software engineering experience with mentorship in a hybrid, Agile setting.
Experienced Senior Software Engineer needed to build cloud-native, TypeScript-based IoT applications using React and NestJS with serverless Azure deployments and strong automation/testing practices.
As a Production Engineer II on Yahoo's Media Platform team, you'll architect, automate, and operate large-scale cloud infrastructure and observability tooling to improve reliability and developer velocity.
Lead the development and operation of Attentive’s ML platform to enable high-velocity, reliable training and low-latency serving for production ML applications.
Work on cutting-edge, low-level blockchain systems in Rust at Tempo to help scale stablecoin and payments infrastructure across Ethereum execution and consensus layers.
Billd is looking for a Staff Full-Stack Developer to lead full-stack feature delivery, integrate AI tooling, and manage cloud-based production systems across a modern tech stack.
Senior-level Software Integration Engineer to lead integration, automation, and testing of COTS/GOTS software in high-performance computing environments for federal customers.
Planet Fitness is hiring a Senior Principal Engineer to define technical strategy, lead multi-team delivery, and build scalable cloud-native and AI-enhanced products from our Hampton, NH office.
Be part of NVIDIA’s performance engineering team to architect, tune, and validate large-scale GPU-accelerated systems and workflows for AI and datacenter workloads.
At Jasper, a leading AI marketing platform, this Senior Frontend Software Engineer will design and deliver scalable React/TypeScript features that enable advanced AI-driven content and collaboration experiences.
AnaVation seeks a Senior Agentic-AI Software Engineer to architect and deploy production-grade autonomous AI agents for mission-critical intelligence applications.
Experienced Python engineer needed to build scalable APIs and cloud services (GCP) for a consortium-driven higher-education platform in a remote, mission-focused role.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
159 jobs