Browse 83 exciting jobs hiring in Cuda now. Check out companies hiring such as EchoTwin AI, Waabi, Poolside in Oxnard, Laredo, Omaha.
Apply computer vision expertise to design, optimize, and deploy high-performance edge AI solutions on NVIDIA Jetson platforms for EchoTwin AI's urban sensing systems.
Work on cutting-edge online mapping research and engineering to help Waabi’s self-driving stack adapt to real-world changes across Toronto, San Francisco, and remotely across the US & Canada.
Poolside seeks an experienced engineering member to improve fault tolerance, checkpointing, and recovery across large-scale LLM training and inference infrastructure.
Lead system-level performance engineering for NVIDIA's computer vision pipelines, optimizing data-center and edge workloads using Python, CUDA, and C++ to deliver production-grade, high-throughput solutions.
NVIDIA is hiring a Senior Deep Learning Frameworks Sustaining Engineer to integrate, back-port, and stabilize TensorFlow, PyTorch and TensorRT for enterprise LTS releases.
Senior engineer role to optimize and extend NVIDIA's GPU-accelerated inference stacks (vLLM, SGLang, FlashInfer) for LLMs and generative AI across datacenter and edge accelerators.
Be part of NVIDIA’s performance engineering team to architect, tune, and validate large-scale GPU-accelerated systems and workflows for AI and datacenter workloads.
Lead performance engineering for Vision Language Models at NVIDIA, optimizing end-to-end inference pipelines, CUDA kernels, and SDK integrations to deliver accelerated computer vision at scale.
Drive partner adoption of NVIDIA Omniverse and AI-enabled simulation by providing deep technical guidance, integration support, and developer enablement across engineering and executive stakeholders.
Rescale is hiring a Simulation Engineer, AI & Data to apply cutting-edge AI techniques to CAE/CFD/FEA workflows and deliver scalable cloud-based simulation solutions for engineering customers.
Rescale is hiring a Simulation Engineer, AI & Data to embed cutting-edge AI into simulation workflows and deliver scalable cloud-based HPC solutions for engineering customers.
Rescale seeks an experienced HPC Engineer to drive end-to-end delivery, automation, and optimization of multi-cloud HPC workloads and platform features for enterprise customers.
Lead customer-facing AI infrastructure deployments and performance optimization for groundbreaking large-scale training and inference workloads on NVIDIA's platform.
NVIDIA is seeking a Senior Developer Technology Engineer to develop and optimize GPU-accelerated techniques for high-performance databases, ETL, and data analytics workloads.
Contribute to cutting-edge computer vision and deep learning research at NVIDIA as a PhD intern, developing novel methods and working closely with product and research teams to deliver prototypes and publications.
Lead a high-impact team accelerating LLM inference performance at NVIDIA by combining deep systems expertise, GPU profiling, and cross-functional collaboration.
Contribute to NVIDIA’s Graphics and Simulation research as a PhD intern, developing cutting-edge algorithms and prototypes in rendering, simulation, and GPU-accelerated systems.
NVIDIA invites motivated undergraduate and graduate students to participate in a 12-week Deep Learning Computer Architecture internship, contributing to GPU, deep learning, and high-performance computing projects.
NVIDIA is hiring Computer Architecture interns for paid, hands-on 12-week projects focused on GPU/CPU architecture, VLSI, parallel programming, and high-performance computing.
TensorWave is hiring an AI Infrastructure Engineer to design, operate, and optimize high-performance GPU clusters that power its AI cloud services.
NVIDIA is recruiting PhD researchers for 2026 internships to advance large language and multimodal model research, offering hands-on experience with top LLM teams and opportunities to translate research into products and publications.
NVIDIA seeks motivated Bachelor's, Master's, and PhD students for a 12-week Autonomous Vehicles and Robotics internship to contribute to perception, planning, simulation, and robotics infrastructure projects.
NVIDIA seeks PhD candidates for 2026 hardware research internships to advance cutting-edge VLSI, ASIC, and EDA technologies used in AI and accelerated computing.
NVIDIA is hiring 2026 Systems Software Engineering interns to work full-time for 12 weeks on systems, graphics, compiler, firmware, or security projects that power next-generation GPUs and platforms.
An opportunity to work with NVIDIA’s deep learning teams on algorithm development, framework optimization, and GPU performance tasks during a 12-week paid internship in Santa Clara.
NVIDIA is recruiting PhD students for paid 2026 research internships focused on advancing GPU, architecture, systems, and related AI technologies.
NVIDIA is hiring university students for 12-week Software Engineering internships to contribute to projects in development tools, cloud infrastructure, infrastructure tooling and MLOps using modern software and GPU-accelerated technologies.
Zoox is hiring a Senior Manager to lead perception teams building multi-modal detection and sensor-fusion systems for its autonomous vehicle platform.
Silvaco seeks an Optics & Electromagnetics Intern to work remotely on computational lithography simulations, model calibration, and GPU‑accelerated tool development.
Lead the design and implementation of an LLVM-based JIT backend for NVIDIA GPUs, delivering high-performance code generation and optimizations for next-generation architectures.
Lead the development of scalable simulation and learning pipelines for robotics within NVIDIA’s Isaac team, focusing on Isaac Lab, Mimic, and Omniverse-based workflows.
Lead development of advanced manipulation systems that let Chef Robotics' kitchen robots handle diverse food ingredients reliably at scale.
Be part of WindBorne's Deep Learning team as a Forward-Deployed ML Engineer, developing WeatherMesh models and leading external technical collaborations to ensure real-world impact.
Lead developer and partner engagement to drive adoption of NVIDIA GPUs and SDKs across power generation, creating blueprint reference solutions, SDK integrations, and high-impact developer content for industrial and regulated environments.
NVIDIA is hiring an Enterprise ISV Account Manager to drive AI platform adoption and joint go-to-market success with strategic enterprise software partners.
Lead the development of distributed runtime and orchestration systems (Rust, Kubernetes, Slurm) to enable large-scale, low-latency GPU inference for NVIDIA's Dynamo/Inference Server ecosystem.
A new-graduate software engineer role on NVIDIA's TensorRT team to help design and optimize high-performance deep learning inference software for specialized platforms.
Contribute to aion's inference infrastructure as an ML Inference Platform Intern, learning and implementing high-performance optimization techniques for production GPU systems.
Serve Robotics is hiring an ML Performance Engineer to optimize and deploy real-time ML models on NVIDIA Jetson-based delivery robots in Los Angeles.
Hayden AI is seeking a Staff Software Engineer to produce robust C++ edge applications and optimize ML/vision pipelines for real-time vehicle detection and tracking on Nvidia Jetson devices.
Encord is hiring a seasoned full-stack software engineer to drive end-to-end product development for an enterprise AI platform that prioritizes data quality and scale.
Lead and scale a specialized GPU kernels team at Modular to design, optimize, and ship high-performance compute kernels that power the MAX GenAI inference platform.
Contribute to NVIDIA’s embedded Linux and Jetson platform development as an Embedded Systems Software Intern, focusing on C/C++ software, kernel internals and performance optimization.
Lead developer relations for GPU-accelerated solutions across grid, industrial and data-center power domains, connecting OT systems to AI-enabled IT infrastructures.
Help engineer the inference backbone at Together AI, optimizing global request routing, autoscaling, and multi-tenant systems to serve cutting-edge generative models at scale.
Contribute to a cross-disciplinary team in Palo Alto building full-stack web applications and backend infrastructure to serve generative AI models for next-generation biological and medical research.
Voltage Park is looking for an experienced Infrastructure Engineer specializing in InfiniBand/NCCL to optimize and scale GPU networking for large AI/HPC clusters.
Lead customer-facing GPU performance benchmarking and trial validation at a seed-stage AI infrastructure startup focused on an open-access GPU marketplace.
Lead development of scalable, photorealistic simulation environments and pipelines to power robotics research and training at OpenAI's San Francisco site.
Help accelerate production LLM inference at .txt by optimizing multi-GPU pipelines, kernel performance, and deployment reliability for structured generation workloads.
Below 50k*
0
|
50k-100k*
10
|
Over 100k*
72
|