Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Senior Service Reliability Operations Administrator image - Rise Careers
Job details

Senior Service Reliability Operations Administrator

NVIDIA's NGC team is looking for highly motivated System Administrator/DevOps engineers to design, develop and implement a global, dynamic, innovative Service Reliability Operations Center, to provide extraordinary levels of support for our Cloud products and services. As a key member of the CIS Team (Compute Infrastructure Support), you will partner with other key members of our organization including Site Reliability Engineering, Security Operations Center, DevOps teams, and other partners to help make our services capable of providing near 100% availability. On the rare occasion that an incident occurs, you will be our front line to decrease the frequency and duration of any issue. Working in partnership with the development community the CIS team will develop monitors, alarms, and alerts to help make the service more reliable and improve our customer experience.

What you will be doing:

  • The team will provide their services 24/7 with a follow-the-sun environment which will span continents. You will report directly to a manager in the United States.

  • Some CIS shifts require either a Saturday or Sunday each week. The hours worked may include an early or late start (10hrs-per-day x 4 days-per-week schedule) to ensure that the combination the US and India teams provide 24/7 coverage.

  • Every CIS team member will use alerts and alarms to help prevent issues and incidents when possible. You may also work with the developer community to develop and implement predictive support or diagnostic routines.

  • Perform systems administration tasks, network administration tasks, security incident monitoring to drive our actions.

  • CIS team members will work with developers to learn how the service works, then translate that understanding into runbooks which the entire team will use. As new features and functionality are added, you will also update and evolve the runbooks as needed.

  • Help discover incidents and issues, including initiating the incident management procedure. Bring in subject matter authorities or service owners as needed to resolve issues. Feedback will help us continually improve our service.

  • Your interpersonal skills will help keep the team engaged through resolution and ensure our clients believe we value their time and effort.

  • May perform other tasks that will help us provide extraordinary service levels for our customers.

What we need to see:

  • 5+ years of experience administering open system servers in a Production environment. 3+ years of experience working in demanding Internet, Cloud, or Telecommunications environments in a Systems Administration, DevOps, SRE, or NOC role.

  • B.S. in relevant disciplines or equivalent experience.

  • Expertise using monitoring tools and problem ticketing systems.

  • Strong problem-solving, analytical, and troubleshooting abilities.

  • Strong server administration experience. Shell scripting, automation, DNS, DHCP, storage concepts, basic networking, IP Tables, etc. RHCE or equivalent level of knowledge.

  • Experience scripting in Python preferred, but not required. Prior experience running virtual machines under open source or commercial hypervisors. Experience operating services running on public or private clouds.

  • Knowledge and understanding of application containers and container orchestration systems. Basic understanding of Git.

  • Experience performing system administration tasks using Ansible. Prior experience analyzing system and network performance using monitoring alerts, data, and graphs.

  • Demonstrate ability to master and maintain complicated environments.

Our technology has no boundaries! NVIDIA is building the world’s most groundbreaking and innovative compute platforms in the world. It’s because of our work that scientists, researchers and engineers can advance their ideas. At its core, our visual computing technology not only enables an amazing computing experience, it is energy efficient! We pioneered a supercharged form of computing loved by the most demanding computer users in the world - scientists, designers, artists, and gamers. It’s not just technology though! It is our people, some of the brightest in the world, and our company culture make NVIDIA one of the most fun, innovative and dynamic places to work in the world! At the center of NVIDIA's culture are our core values like innovation, excellence and determination and team, that guide us to be the best we can be.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 124,000 USD - 195,500 USD for Level 3, and 140,000 USD - 224,250 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until September 26, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA Glassdoor Company Review
4.6 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
NVIDIA DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of NVIDIA
NVIDIA CEO photo
Jensen Huang
Approve of CEO

Average salary estimate

$174125 / YEARLY (est.)
min
max
$124000K
$224250K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Posted 22 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

NVIDIA's DGX Cloud is hiring a Senior Customer Success Engineer to help internal teams adopt and optimize cloud AI platforms across compute, storage, and networking.

Photo of the Rise User
NVIDIA Hybrid US, CA, Santa Clara
Posted 18 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

NVIDIA is hiring a System Validation Engineer to lead post-silicon validation of GPUs and SoCs, focusing on functional, stress, PVT, and power/performance testing across Windows and Linux environments.

UMD Hybrid University of Maryland College Park
Posted 7 hours ago

The Robert H. Smith School of Business is hiring a Senior AV Specialist to lead AV system support, programming, and project coordination for classrooms and event spaces on the College Park campus.

Photo of the Rise User
Posted 21 hours ago

Lead the technical and cybersecurity architecture for a high-impact federal modernization program, consolidating legacy systems into a secure, cloud-native environment.

Photo of the Rise User

Senior technical leader needed to architect and govern enterprise COTS/SaaS solutions, integrations, and vendor relationships for a healthcare genomics company operating in regulated environments.

Posted 18 hours ago

Lead design, troubleshooting, and operations for a multi-cloud, multi-site enterprise network as a Senior Network Engineer supporting UltraViolet Cyber's delivery team.

Photo of the Rise User
Posted 18 hours ago

Experienced cloud security engineer needed to design secure AWS environments, integrate ICAM and Zero Trust, and drive ATO achievement for a DoD-aligned modernization program.

Photo of the Rise User
Posted 5 hours ago

Novanta is hiring a Senior Network Engineer to design and operate its global WAN, cloud interconnects, and enterprise security infrastructure for a distributed, mission-critical technology company.

Posted 15 hours ago

nVent’s Digital Leadership Development Program is a three-year rotational early-career program seeking recent graduates to build ERP and business-process expertise across global digital teams.

TAMUS Hybrid College Station, TX
Posted 8 hours ago

Support ARCMTS as a Security Engineer I to improve endpoint and infrastructure security, conduct assessments, and help research teams meet regulatory and grant-related security requirements.

Photo of the Rise User

Senior Cloud Architect needed to lead cloud-first architecture, security, automation and multi-tenant managed-services strategy for Smile Digital Health’s FHIR-based platform across Azure/AWS.

Photo of the Rise User
NBCUniversal Hybrid 904 Sylvan Ave, Englewood Cliffs, NEW JERSEY
Posted 12 hours ago

Lead the design and delivery of enterprise-wide infrastructure and content security services for Versant/NBCUniversal, ensuring resilient, compliant, and modern defenses across cloud, broadcast, and corporate environments.

Photo of the Rise User
Posted 5 hours ago

Serve as the in-office technical liaison for Hexagon US Federal in Huntsville, bridging remote application SMEs and on-site infrastructure to resolve complex support issues.

Photo of the Rise User
Posted 6 hours ago

Lead AHEAD's Managed Security delivery organization to oversee SOC operations, incident response, and service strategy for enterprise clients.

Photo of the Rise User
Posted 19 hours ago

Harvard University is hiring a Business Systems Analyst to develop and support Oracle Planning/Hyperion models, calc scripting, and Groovy business rules while collaborating with stakeholders to deliver scalable planning solutions.

NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.

191 jobs
MATCH
Calculating your matching score...
BADGES
Badge ChangemakerBadge Diversity ChampionBadge Family FriendlyBadge Global CitizenBadge Work&Life Balance
CULTURE VALUES
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
BENEFITS & PERKS
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
September 24, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!