We are now looking for a Senior System Software Engineer to work on Dynamo. NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution in AI, enabling breakthroughs in problems from image classification to speech recognition to natural language processing. We are a fast-paced team building back-end services and software to make design and deployment of new AI models easier and accessible to all users.
What you'll be doing:
In this role, you will develop open source software to serve inference of trained AI models running on GPUs. You will balance a variety of objectives: build robust, scalable, high performance software components to support our distributed inference workloads; work with team leads to prioritize features and capabilities; load-balance asynchronous requests across available resources; optimize prediction throughput under latency constraints; and integrate the latest open source technology.
What we need to see:
Masters or PhD or equivalent experience
3+ years in Computer Science, Computer Engineering, or related field
Ability to work in a fast-paced, agile team environment
Excellent Rust/Python / C++ programming and software design skills, including debugging, performance analysis, and test design.
Experience with high scale distributed systems and ML systems
Ways to stand out from the crowd:
Prior work experience improving performance of AI inference systems.
Background with deep learning algorithms and frameworks. Especially experience Large Language Models and frameworks such as PyTorch, TensorRT, and ONNX Runtime.
Experience building and deploying cloud services using HTTP REST, gRPC, protobuf, JSON and related technologies.
Have familiarity with the latest AI research and working knowledge of how these systems are efficiently implemented.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most expert and passionate people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come help us build the real-time, efficient computing platform driving our success in the multifaceted and quickly growing field Deep Learning and Artificial Intelligence.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 218,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA SSGE seeks a Full-Stack Developer to build end-to-end ETL pipelines, backend services, and interactive dashboards that drive business decisions across the organization.
Lead development of LSEG's WMX platform to deliver index/benchmark capabilities, drive technical delivery across teams, and mentor engineering talent.
Lead the architecture and delivery of scalable, cloud-native APIs and services as a Principal Software Engineer on a remote-first team focused on AI-enabled customer experiences.
Tessera Labs seeks a Senior Forward-Deployed Engineering Lead to act as the primary technical authority embedded with customer delivery pods and drive end-to-end AI modernization efforts.
Lead front-end architecture and deliver enterprise Angular applications at 3Pillar, driving technical direction and mentoring engineering teams to build scalable UI solutions.
Senior engineering role leading the design and scaling of Cash App's critical ledgering infrastructure while influencing org-wide technical strategy and mentoring engineering teams.
Lead Quizlet’s Partner & Connector Ecosystems engineering teams to build secure, scalable APIs, SDKs, and connector platforms that extend learning into third-party products and AI environments.
Lead development of network automation and NETCONF/YANG-based configuration management for Rocket Lab’s satellite constellation, applying strong network programmability and systems engineering skills.
KLA is looking for a Senior AI Ops Engineer to architect and deliver scalable, reproducible ML experiment and deployment pipelines that bridge research and production.
Lead development of scalable donor, voter, and volunteer management tools for progressive campaigns and nonprofits as a remote Lead Software Engineer.
Best Egg is hiring a Senior Software Engineer (MLOps) to lead production ML and LLM deployment, CI/CD, and scalable inference infrastructure across a fast-growing fintech platform.
Coca-Cola is seeking a Marketing Technology Engineering Intern in Atlanta to help design, implement, and test GCP-based MarTech integrations and cloud solutions.
McGraw Hill is hiring a senior full-stack engineer to develop scalable, responsive learning tools for its Connect platform used by millions of students and instructors.
Sonic Automotive's DUCKS team is hiring a Senior Software Engineer to design and implement scalable Python APIs and data-driven services that support appraisal, pricing, and operational analytics.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
45 jobs