Our technology has no boundaries! Nvidia is building the most modern and groundbreaking compute platforms globally for widespread use. It’s because of our work that scientists, researchers and engineers can advance their ideas. At its core, our visual computing technology not only enables an amazing computing experience, but it is also energy efficient! We pioneered a supercharged form of computing loved by the most demanding computer users in the world - scientists, designers, artists, and gamers. It’s not just technology though! It is our people, some of the brightest in the world, and our diverse company culture make NVIDIA one of the most fun, innovative and dynamic places to work in the world! At the center of NVIDIA's culture are our core values like innovation, excellence and determination and team, that guide us to be the best we can be.
We are looking for a Senior Network Validation Engineer to lead & hands on contribute to Network validation activities in the Datacenter Systems Engineering team. You'll work closely with solutions, Network & Storage architects, HW system engineers, validation engineers, OEM/ODMs, and AE teams to ensure product validation and test coverage are optimal for Data Center scale AI products. The ideal candidate is self-motivated, works well with different teams, very comfortable in a lab environment and demonstrates passion towards product level validation. They should have strong debug analysis fundamentals as well as automation and scripting experience. They must be capable of thriving in fast paced environment with evolving product definitions.
What you’ll be doing:
Design validation plans from bare metal to at scale data center integration tests.
Debug, triage issues, perform root cause analysis, verify fixes, define new tests, and improve product test plans.
Configure, administer, troubleshoot, and oversee the qualification of Ethernet and InfiniBand networks in large-scale datacenter environments.
Perform server function & network validations including Ethernet & InfiniBand protocol & system level reliability test end to end application tests.
Design, develop, and maintain automation frameworks and test automation suites, including automated reporting, while consistently increasing end-to-end automation coverage with each release cycle.
Track and coordinate all validation activities from bring up to production release.
Collaborate with multi-functional teams including application teams, HW designers, networking team, FW, security etc. to debug any HW/SW product issues.
Provide inputs to architecture teams for next generation Data Center networking design.
What we need to see:
M.S. degree in Engineering/Computer Science/related field (or equivalent experience).
10+ years of experience.
Over 5 years of proven experience in Software Quality Engineering and Network Testing, including significant contributions to QA strategies and test documentation.
Strong skills in Python (preferred) or other scripting languages like Perl, Shell and hands-on experience with Jenkins or similar CICD based pipelines
Strong technical abilities, problem solving, designing, coding and debugging skills
Extensive hands-on experience in configuring and troubleshooting data center networking, including Layer 2/Layer 3 protocols such as VLAN, BGP, EVPN, and spine-leaf topology & InfiniBand networks experience desired.
Experience with using test tools from Ixia or Spirent and working experience in test management
Hands on experience working on Unix or Linux based OS
Great team player with multi-tasking ability and good interpersonal & documentation skills
Solid foundation in and understanding of software engineering practices
Excellent design, debugging and problem-solving skills, with a strong bias for action, quality and engineering excellence.
Ways to stand out from the crowd:
Certificate in CCIE (Routing & Switching / Service Provider / Data Center).
Demonstrated experience with RDMA (Remote Direct Memory Access) technologies and related protocols such as InfiniBand or RoCE.
Knowledge or experience of AI Data Center validation with GPU clusters.
Experience in REST API & Kubernetes and background in network automation tools like Ansible, Jenkins & Robot framework.
Experience in IPv6 & Telemetry at a Data Center scale with Observability tools like Grafana & Prometheus preferred
NVIDIA is widely considered one of the technology world’s most desirable employers. We employ some of the most forward-thinking and talented people in the world. Are you passionate about joining our life work to amplify human imagination and intelligence? If you are creative, collaborative, and have a passion for creating custom silicon solutions that power forward-looking computing systems, we want to hear from you.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 160,000 USD - 253,000 USD.You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Senior marketing leader needed to lead global esports and competitive gaming programs for GeForce, driving partnerships, campaigns, and GTM for competitive PC gaming.
Lead software validation and automation for datacenter-scale storage and infrastructure to ensure high-quality AI/HPC platforms at NVIDIA.
Sargent & Lundy seeks a senior Licensing Engineering Consultant with deep nuclear licensing experience to lead regulatory submissions and safety evaluations for new-build and fuel-cycle projects.
Peraton is hiring an Enterprise Services Engineering Manager to lead cloud and enterprise infrastructure engineering efforts supporting national security missions.
AECOM is hiring a mid-level Civil Engineer with 6+ years of aviation experience to lead and support airfield and landside design projects in the Seattle region.
Sargent & Lundy is hiring a Transmission Line Engineer 3 to perform design, analysis, and PLS-CADD modeling for 69–765 kV transmission lines in a hybrid Peachtree Corners role.
AECOM is hiring an entry-level Mechanical Engineer in Arlington, VA to assist with HVAC design, BIM-based modeling, and construction documentation on building projects.
A hands-on summer internship at Moog focused on failure investigation and root-cause analysis for students in mechanical, aerospace, or related engineering programs.
SGS is looking for a Senior Project Engineer to lead and manage electrical product safety compliance testing and certification activities at the Suwanee, GA laboratory.
AtkinsRéalis is hiring a Principal Geotechnical Tailings Engineer to lead technical design and multidisciplinary teams for tailings, heap leach and mine waste projects in North America.
SCS Engineers seeks an Associate Professional in Greenwood Village, CO to provide engineering support for landfill and solid waste projects, including design, permitting, field sampling, and construction oversight.
CACI seeks an Embedded Systems Technician in San Antonio to develop and integrate TPS and automated aviation test equipment for military aviation customers.
Early-career Water Resources Engineer (EIT) needed to perform hydrologic/hydraulic modeling, GIS support, and stormwater design for HEI's Maple Grove, MN office.
Lead and scale cross-functional AI engineering efforts to deliver production-grade LLM, VectorDB, and cloud ML solutions while mentoring top-tier ML talent and driving organizational AI strategy.
Saronic Technologies seeks a Marine Auxiliary Machinery Engineer to design, model, and validate propulsion and fluid power systems for next-generation autonomous surface vessels.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
220 jobs