Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for a hardworking Solution Architect (SA) to join the DGX Cloud SA Segment Team. The mission of the DGX Cloud Segment team is to guide and enable the successful adoption at scale of DGX Cloud and NVIDIA AI Enterprise Software in production.
NVIDIA DGX Cloud is an AI platform for developers, researchers, and enterprises, optimized for the demands of Generative AI. The DGX Cloud SA team is dedicated to shaping the future of DGX Cloud by actively gathering and incorporating partner feedback and product requirements. Our team will help optimize the onboarding process for NVIDIA Cloud Partners, ensuring fast time to insights and exceptional user experience. Additionally, we will collaborate with internal teams to scale expertise and knowledge through training and the creation of repeatable guides. Our focus on building reliable infrastructure, partner qualifications, and assets will streamline onboarding, ultimately increasing adoption of DGX Cloud.
What you’ll be doing:
Work closely with DGX Cloud Partners, become their trusted technical advisor, advocate for their needs, and ensure they are successful in accomplishing their business goals with the platform.
Accelerate NVIDIA Cloud Partner onboarding time, cluster manageability and reliability.
Scale knowledge, reach, and opportunities by building and educating vertical teams and communities on DGX Cloud and NVIDIA Reference Architectures.
Communicate to our Reference Architecture teams findings gathered from the field.
Provide technical education and facilitate field product feedback to improve DGX Cloud.
Enable partners to participate in the DGX Cloud Ecosystem with the goal of end-user satisfaction and increased sales.
What we need to see:
Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science (or equivalent experience)
5+ years of proven experience with one or more Cloud Service Providers (AWS, Azure, GCP or OCI), NVIDIA Cloud Partners (CoreWeave, Lambda Labs, Crusoe, etc) and cloud-native architectures and software.
Demonstrated experience in technical leadership, strong understanding of NVIDIA technologies, and success in working with customers.
Expertise with parallel filesystems (e.g. Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects (InfiniBand, Omni Path, RoCE, and Gig-E).
Strong coding and debugging skills, and demonstrated expertise in one or more of the following areas: Machine Learning, Deep Learning, Slurm, Kubernetes, MPI, MLOps, LLMOps, Ansible, Terraform, and other high-performance AI cluster solutions.
Proficient in deploying GPU applications in Slurm, Kubernetes, docker, helm, registries
Linux-based configuration management and monitoring solutions, system administration, OS installation, configuration, and troubleshooting
Networking technologies (e.g. router, firewall, load balancer, DNS, VPN) for complex infrastructure configuration
Ways to stand out from the crowd:
Experience using DGX Cloud, NVIDIA AI Enterprise AI Software including Base Command Manager, NeMo, and NVIDIA's Inference Microservices.
Experience with AI application development and deployment
Background with deploying and configuring observability tooling including Grafana, Prometheus, W&B, Nagios, Zabbix
Experience with high performance or large-scale computing environments.
You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA is hiring a Senior Software Security Compiler Engineer to design and implement compiler-level security and hardening across LLVM/GCC and proprietary toolchains.
Drive cross-team alignment and timely delivery for NVIDIA’s Digital Human Technology initiatives as a Technical Program Manager coordinating research-to-release programs in generative AI and 3D/animation domains.
Lead the alternative investments consulting effort within Wealth & Investment Management, advising financial advisors and portfolio managers on private investment solutions and platform strategy.
Join Cryptio as an Enterprise Solutions Consultant to lead enterprise onboarding and act as a trusted advisor helping institutions adopt crypto-grade finance and reporting at scale.
Experienced financial examiner (EIC) needed to lead complex insurance financial examinations and regulatory consulting engagements for a national regulatory services firm.
SOAIS is hiring an Oracle Cloud OTM Functional Consultant (local to TN or MN) to lead OTM Cloud configurations, implementations, and integration efforts across transportation and logistics modules.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
137 jobs