Browse 34 exciting jobs hiring in Gpu Infrastructure now. Check out companies hiring such as Anthropic, NVIDIA, USAA in Virginia Beach, Ontario, Garland.
Anthropic seeks a Strategic Sourcing Business Partner to lead procurement of compute, cloud, and R&D services, building strategic supplier relationships that accelerate AI research and innovation.
NVIDIA seeks a Senior Software Engineer to design and implement scalable GPU cluster solutions and AIOps-driven operational tools that accelerate internal AI research.
Lead the development of robust, high-performance deep learning training infrastructure for NVIDIA's Autonomous Vehicles group to enable multi-thousand-GPU training and rapid experimentation on massive datasets.
Help architect and scale a production ML inference platform at Tamarind Bio to serve hundreds of biological models and support rapid customer growth.
Lead high-impact cross-functional programs to optimize OpenAI’s global compute infrastructure—improving cost, capacity, and reliability across compute, storage, and networking.
NVIDIA seeks an experienced Senior Director of Product Management to lead product strategy and execution for AI infrastructure platforms that power large-scale ML and HPC workloads.
NVIDIA is hiring a Senior Software Engineer to build and maintain scalable, high-performance GPU cluster platforms that accelerate AI research and reduce operational toil.
NVIDIA seeks a Senior Software Engineer to design and operate scalable ML productivity tooling, CI/CD, and visualization systems that accelerate research and GPU utilization for humanoid robotics initiatives.
Lead a team to design and operate scalable cloud services and telemetry pipelines for NVIDIA's DGX Cloud GPU infrastructure.
Lead the development of scalable compute infrastructure for robotics foundation-model research at NVIDIA, optimizing GPU clusters, job orchestration, and observability for large training and evaluation workloads.
Lead the design and operation of hybrid cloud and bare-metal GPU infrastructure to power high-performance simulation, ML, and factory automation at Atomic Industries.
Lead customer-facing AI infrastructure deployments at NVIDIA, advising on GPU servers, networking, cluster bring-up, and performance debugging to enable large-scale AI systems.
NVIDIA seeks a Senior Solutions Architect to help hyperscale cloud customers design and optimize GPU-based AI/ML and HPC solutions at scale, providing technical leadership, performance analysis, and customer-facing engineering support.
Technical Program Manager for Capacity Tooling to lead development of scalable data pipelines, forecasting dashboards, and allocation solvers that drive compute planning at OpenAI’s San Francisco office.
Lead product strategy and go-to-market for NVIDIA's AI Infrastructure, focusing on inference software, Kubernetes integrations, and customer-driven AI Factory solutions.
Technical and business-minded capacity planning lead needed to translate GPU roadmaps and tenant demand into actionable global data center capacity strategies.
FAR.AI is looking for an Infrastructure Engineer to own and evolve its bare-metal Kubernetes GPU cluster to enable bleeding-edge AI safety research across a growing team.
Bjak seeks an MLOps Engineer to run and scale open-source LLMs into production, optimizing for cost, latency, and reliability while working in a flexible hybrid model.
Lead engineering efforts on LinkedIn's AI Platform to scale model training, feature engineering, and high-performance model serving for large language and recommendation models.
Modal is looking for an Account Executive to build and expand relationships with AI-native startups, guiding founders and engineering teams to adopt and scale on its serverless AI compute platform.
Build scalable, resilient cloud services at Eventual to power multimodal AI data pipelines and production workloads as part of a small, fast-moving SF engineering team.
Eventual is hiring a Software Engineer to design and operate GPU-optimized infrastructure and scalable production systems for multimodal AI workloads.
Lead architecture and operation of scalable ML platform infrastructure at NVIDIA to empower researchers and engineers to train and deploy large-scale models on powerful GPU systems.
Lambda Labs is hiring a Staff Product Manager, Commerce & Operations to drive pricing, billing lifecycle, and commercial strategy for its AI infrastructure products while ensuring operational readiness for scale.
Lead global GPU capacity sourcing and strategic ecosystem partnerships for Novita AI to expand distribution and accelerate GTM growth for a fast-scaling AI infrastructure platform.
Lead the end-to-end architecture and technical strategy for the NIM Factory to deliver enterprise-grade, GPU-accelerated inference services at scale.
Lead engineering for NVIDIA Mission Control to build and operate resilient, AI-enabled cluster automation for large-scale GPU and CPU infrastructure.
Early-career software engineer wanted to help build Eventual’s distributed query engine and cloud service, working primarily from the San Francisco office.
Help power AGI research by building ML infrastructure and tools that make researchers and GPUs dramatically more productive in a hybrid San Francisco role.
Agtonomy is hiring a Senior Software Engineer to scale ML infrastructure and productionize distributed training, data pipelines, and model deployment for autonomy on heavy equipment.
Lead enterprise sales for a public AI cloud provider by driving adoption of AI Studio’s GPU-accelerated infrastructure and GenAI services across large customers.
Flexcompute is recruiting outstanding recent STEM graduates into a multi-year Rising Stars Program to work on GPU, AI, and HPC systems as part of its Boston engineering team.
An experienced systems and ML-inference engineer is needed to lead development of low-latency, high-throughput inference pipelines spanning on-device and cluster deployments.
Maintain and troubleshoot TensorWave's bare-metal Kubernetes clusters to ensure reliable, high-performance infrastructure for cutting-edge AI workloads.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
2
|