NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, a deep understanding of distributed systems, familiarity with software testing and deployment, and excellent communication and planning abilities. We also welcome out-of-the-box thinkers who can provide new ideas with strong at execution bias. Expect to be constantly challenged, improving, and evolving for the better. You and other engineers in this team will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range of AI-based applications that affect core data science. What are you waiting for if you're creative, passionate about what you do, and love having fun apply today!
We’re looking for a highly motivated, creative engineer with strong experience in system software to join the DGX Cloud Software Team. You will lead the architecture, design and implementation of our next generation DGX cloud clusters using latest technologies. On this team, you will do full stack deployment including hardware architecture, workload orchestration and application performance tuning. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement.
What you’ll be doing:
Lead technical activities for data centers with focus on hybrid deployments between cloud and on-prem
Providing expertise in infrastructure workflows, including hardware, software release, workload orchestration and application tuning
Provide fast and creative solutions for complex problems and write effective, clear and reliable architecture specification
Translate requirements to vision, architecture and roadmap
Work with engineering teams across NVIDIA to ensure your software integrates seamlessly from the hardware all the way up to the AI training applications.
What we need to see:
Masters or PhD in Computer Science, Computer Engineering, Physics or equivalent experience.
9+ years of experience in this field.
Data Sciences, Deep Learning, or Machine Learning coursework
Ability to seamlessly shift between Linux system environments to Python programming
Programming skills in 1 or more high-level languages (C, C++, Go, Rust, etc)
System-level experience with both hardware and software
Motivated self-starter with an equal balance of strong problem-solving skills and customer-facing communication skills
Strong design, coding, analytical, debugging and problem-solving skills
Passion for continuous learning and knowledge transfer. Ability to work concurrently with multiple groups locally and abroad in the organization
Ways to stand out from the crowd:
Experience with GPU deep learning and data sciences. Experience using TensorFlow, PyTorch or other DL framework. Experience working with Docker containers, Slurm, Terraform and Kubernetes
CUDA programming and NCCL experience. HPC programming experience including MPI, OpenACC, or other parallel programming tools. Hands-on experience with DGX Cloud, NVIDIA AI Enterprise AI Software, Base Command Manager, NEMO and NVIDIA Inference Microservices.
Interest in crafting, analyzing and fixing large-scale distributed systems.
Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA is hiring a Principal Software Engineer to design and build scalable, GPU-accelerated database platforms and automation frameworks for high-performance AI workloads.
NVIDIA is hiring a Senior Signal & Power Integrity Engineer to drive 3D EM modeling, system-level SI simulations and lab correlation for next-generation high-speed interfaces.
Lead a growing engineering team at Assured to deliver reliable, scalable claim-processing products while coaching engineers and shaping the roadmap.
Experienced C++/Java software engineer to build and evolve Altera’s FPGA debug and IDE tools, focusing on desktop GUI, multithreaded systems, and tight toolchain integrations.
Shield AI seeks a Senior Perception Engineer to design and deliver real-time object detection, sensor fusion, and state-estimation algorithms for autonomous aircraft operating in challenging environments.
Lead a development team building high-performance, multi-threaded .NET applications and drive best practices across software delivery at Tyson Foods' Springdale site.
Join a fast-growing seed-stage team building a dedicated CI cloud, working on large-scale orchestration and bare-metal infrastructure to speed up CI for hundreds of startups.
Ottimate is hiring a Senior Backend Engineer to drive Django-based fintech and payments features that scale to tens of thousands of customers.
BlackRock is hiring a Director-level System Engineer to architect and deliver scalable, cloud-native data and application platforms for its studio products in Princeton, NJ.
Xplor is seeking a Salesforce Technical Architect to design and deliver scalable Sales Cloud and Service Cloud solutions, integrations and telephony strategies that align with business goals.
Lead and grow an engineering team to design and deliver a centralized pricing and packaging platform that enables scalable monetization across OpenAI's product portfolio.
Experienced Staff Software Engineer needed in Austin (hybrid) to architect and implement backend systems using Java/Micronaut and distributed technologies to support mission-driven government software.
Lead a technical team building real-time perception and sensor-fusion algorithms for autonomous aircraft at Shield AI, a venture-backed defense technology company.
Lead the design and delivery of high-performance React-based full-stack visualization tools for large-scale AD/ADAS data streaming at Woven by Toyota while mentoring junior engineers.
Experienced UI engineer needed to build and maintain high-quality React and Angular front ends integrated with Java/Spring backends at a mature financial-services company.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
70 jobs