For more than 25 years, NVIDIA has changed the landscape of digital imaging, personal gaming, and high-performance computing. Our success depends on reliable, informative telemetry and data systems that provide real-time understandings of our sophisticated, distributed infrastructure. As an engineer on our team, you will play a key role in building the next generation of observability for a diverse set of sophisticated workloads. You will transform raw telemetry data into actionable insights. You will architect, develop, and maintain infrastructure that supervises workload health, performance, and usage in critical engineering systems. This allows our global teams to work at peak efficiency. This role offers an outstanding mix of core software engineering, data management, and workload observability.
What you'll be doing:
Collaborate closely with internal chip design teams to understand their workflows and determine observability needs to help improve the overall efficiency of our chip development process.
Compose, build and maintain robust and scalable platforms and infrastructures for capturing, storing, visualizing and processing the data collected from chip build workflows.
Maintain and update the observability tools and systems to meet the needs of new/evolving chip design workflows.
Keep up to date with recent developments in the area related to observability tools, frameworks and strategies and advocate for their integration within the organization.
What we need to see:
Candidates must hold a BS or above degree in Computer Science or equivalent experience
Minimum 4+ years of professional experience developing and managing observability infrastructure.
Familiarity with EDA (Electronic Design Automation) workflows and tools used in the semiconductor industry.
Proficiency in programming and scripting using Python, Perl. Familiarity with databases, containerized applications, observability stack components. Experience in building data pipelines for a compute cluster using open-source technologies and building custom components as vital. Experience with C++ is a plus.
Solid grasp of software engineering principles and methodologies such as OOP, CI/CD. Ability to translate ambiguous problems into concrete solvable pieces.
Excellent communication and collaboration skills. Ability to adapt in a fast-paced environment with evolving requirements.
Ways to stand out from the crowd:
Background knowledge in accelerated computing (parallel programming) or experience running CPU-vectorized or GPU-based workloads, even if not directly tied to observability.
Hands-on experience in developing user interfaces using technologies such as HTML, CSS, JS, ReactJS or VueJS.
A passion for improving engineering productivity and efficiency with a data-driven philosophy.
You will also be eligible for equity and benefits.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead technical engagements with Intelligence Community customers to architect and deliver NVIDIA accelerated computing and AI solutions that meet mission-critical requirements.
NVIDIA is hiring a senior Global Account Manager for Developer Relations to lead strategic AEC and industrial ISV partnerships and accelerate adoption of NVIDIA's software portfolio across product and engineering teams.
Fullscript is hiring an Intermediate Backend Developer to build and maintain integration services, scalable data pipelines, and backend APIs that support finance, fulfillment, and operations in a remote-first environment.
GoGuardian is hiring a remote Staff Software Engineer to lead architecture and build safety-focused SaaS features for K–12 schools while mentoring engineers and operating cloud infrastructure.
Lead the design and implementation of real-time AI runtime security services in a backend role focused on Envoy, WebAssembly, gRPC, and cloud-scale distributed systems at Palo Alto Networks.
Build and scale the core backend systems, APIs, and developer tools that power Kernel's AI agent platform as a founding backend engineer on a small, fast-moving team.
Senior Fullstack engineer needed to craft scalable Go services and interactive React dashboards for a K-12 analytics platform at GoGuardian.
Lead Growth engineering efforts at Hims & Hers as a Staff Software Engineer focused on frontend excellence, performance, and cross-team architecture to drive measurable improvements in conversion and retention.
Experienced Senior Software Engineer sought to lead development of high-quality, secure SaaS/cloud applications for a distributed US team represented by Jobgether.
HarmonEyes is hiring a Software Engineer to design scalable real-time eye-tracking systems, develop platform and SDK components, and productionize ML-driven signal processing in a high-impact engineering role.
Help build and scale cloud-native backend systems and real-time data platforms that power next-generation sports entertainment experiences for millions of users.
Senior Software Developer role to architect and implement enterprise Java applications and RESTful services for Citizant's government-focused IT programs.
Lead Freemodel's backend as a hands-on Principal Engineer, owning Go services on AWS, guiding a small team, and using AI tools to ship faster and smarter.
Work on the software layer that connects Revel’s high-performance runtime to industrial hardware, implementing portable, high-performance drivers and HALs primarily in Rust.
Join Vasion as a Software Engineer building scalable SaaS features and collaborating across product, design, and QA to deliver customer-focused solutions.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
73 jobs