Browse 10 exciting jobs hiring in Nccl now. Check out companies hiring such as Periodic Labs, NVIDIA, FM in Tucson, San Diego, New York.
Periodic Labs seeks an experienced Distributed Training Engineer to optimize and operate frontier-scale LLM training systems that power AI-driven scientific research.
Lead technical engagements with top customers to architect, benchmark, and optimize large-scale AI and HPC solutions using NVIDIA GPU platforms.
NVIDIA seeks a Senior Solutions Architect to help hyperscale cloud customers design and optimize GPU-based AI/ML and HPC solutions at scale, providing technical leadership, performance analysis, and customer-facing engineering support.
Provide expert-level diagnostics and engineering support for NVIDIA's multi-GPU datacenter platforms, helping customers resolve hardware and workload issues while contributing tooling and product improvements.
Lead the optimization and scaling of distributed training infrastructure for foundation models, improving wall-clock convergence by tuning data pipelines, kernels, and multi-node systems.
Provide advanced technical support and troubleshooting for AI infrastructure customers, specializing in InfiniBand, NVLink, and GPU cluster technologies at NVIDIA.
NVIDIA is hiring a Solutions Architect to drive design and deployment of next-generation GPU cloud infrastructure with strategic cloud partners and customers.
Lead the design and implementation of next-generation test frameworks and stress tests to validate and improve performance, reliability, and data integrity of NVIDIA datacenter GPU systems.
Poolside seeks an experienced engineering member to improve fault tolerance, checkpointing, and recovery across large-scale LLM training and inference infrastructure.
Senior engineer role to optimize and extend NVIDIA's GPU-accelerated inference stacks (vLLM, SGLang, FlashInfer) for LLMs and generative AI across datacenter and edge accelerators.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
7
|