Job details

Staff Software Engineer, Inference (Bay Area / Paris / Remote)

What You’ll Do

Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics
Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilization
Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks
Optimize workloads for both throughput (batching, scheduling, quantization) and latency (caching, memory management, graph compilation)
Develop monitoring and debugging tools to guarantee reliability, determinism, and rapid diagnosis of regressions across both stacks

What You’ll Bring

Deep experience in distributed systems, ML infrastructure, or high-performance serving (8+ years)
Production-grade expertise in Python, with strong background in systems languages (C++/Rust/Go)
Low-level performance mastery: CUDA, Triton, kernel optimization, quantization, memory and compute scheduling
Proven track record scaling inference workloads in both throughput-oriented cluster environments and latency-critical on-device deployments
System-level mindset with a history of tuning hardware–software interactions for maximum efficiency, throughput, and responsiveness

CUDA Triton Python C++ Rust Inference Low-latency Quantization Kernel GPU Distributed systems ML infrastructure Profiling Batching Scheduling On-device Robotics Staff engineer Kernel development Throughput

Average salary estimate

$240000 / YEARLY (est.)

min

max

$180000K

$300000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Staff Software Engineer, Controls (Bay Area / Paris)

Genesis AI Hybrid No location specified

VIEW

Posted 10 hours ago

Lead the design and deployment of advanced control, state estimation, and trajectory optimization systems for general-purpose robots, working closely with hardware and algorithm teams.

Staff Software Engineer, Training (Bay Area / Paris / Remote)

Genesis AI Hybrid No location specified

VIEW

Posted 10 hours ago

Lead the optimization and scaling of distributed training infrastructure for foundation models, improving wall-clock convergence by tuning data pipelines, kernels, and multi-node systems.

Site Reliability Engineer (SRE) - CloudVision

Arista Networks Hybrid Remote, OR, Ireland

VIEW

Posted 10 hours ago

Arista is hiring a Senior Site Reliability Engineer to manage and scale the global CloudVision service fleet running on Kubernetes, ensuring reliability, observability, and automated operations.

Software Engineer (Professional Services Team)

Instructure Hybrid Salt Lake City, UT

VIEW

Posted 11 hours ago

Health Savings Account (HSA)

Dental Insurance

Vision Insurance

Disability Insurance

Flexible Spending Account (FSA)

Family Medical Leave

Paid Holidays

Instructure is hiring a Software Engineer on the Professional Services team to build custom integrations, deliver customer-focused solutions, and mentor fellow engineers.

Engineering Manager

Smalls Hybrid No location specified

VIEW

Posted 16 hours ago

Smalls is hiring an Engineering Manager who will split time between hands-on engineering and team leadership to scale product and systems for a fast-growing DTC subscription business.

Software Engineer - Full Stack Robotics Intern Summer 2026

Reframe Systems Hybrid Andover

VIEW

Posted 4 hours ago

Work onsite with Reframe Systems' robotics and engineering team to develop software for robotic manipulation workcells and production automation in our Andover micro-factory during Summer 2026.

2026 Associate Software Engineer

Demiurge Studios Hybrid Boston, MA

VIEW

Posted 1 hour ago

Demiurge Studios is hiring an Associate Software Engineer in Boston to implement, test, and iterate on game systems alongside cross-disciplinary teams for console, PC, and mobile projects.

Staff Software Engineer, Training (Bay Area / Paris / Remote)

Genesis AI Hybrid No location specified

VIEW

Posted 10 hours ago

Lead the optimization and scaling of distributed training infrastructure for foundation models, improving wall-clock convergence by tuning data pipelines, kernels, and multi-node systems.

Software Engineer, IoT Reliability (remote in North America)

MLabs Hybrid No location specified

VIEW

Posted 2 hours ago

Lead the reliability and observability efforts for a large IoT fleet at MLabs, improving device health through monitoring, tooling, and cross-functional collaboration.

Software Engineer, Ray Data

Anyscale Hybrid No location specified

VIEW

Posted 10 hours ago

Work on Ray Datasets to improve large-scale data processing, performance, and stability at Anyscale, contributing to an open-source platform used by teams running production ML workloads.

Platform Engineer

Standard Template Labs Hybrid New York City

VIEW

Posted 12 hours ago

Help build and operate the core platform powering an AI-first enterprise SaaS at a fast-moving, venture-backed startup based in Midtown Manhattan.

Director, Programmable Vision Accelerator Software

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 24 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Senior technical leader needed to steer PVA system software and DSP SDK development for NVIDIA’s Tegra mobile SoC platform, driving architecture, roadmap, and team execution.

Developer Technology Engineer, AI - New College Graduate 2025

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 9 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is hiring a new college graduate AI Developer Technology Engineer to develop, optimize, and deploy high-performance deep learning solutions on GPUs while collaborating with research, architecture, and software teams.

Junior Software Engineer/Developer

Saalex Hybrid No location specified

VIEW

Posted 17 hours ago

Spalding, a Saalex Company, is hiring a Junior Software Engineer to support DoD-focused web and cloud modernization efforts with a hybrid schedule in Patuxent River, MD.

Audio Software Engineering Intern (Summer 2026)

Motorola Solutions Hybrid Plantation, FL

VIEW

Posted 13 hours ago

Motorola Solutions is hiring a Summer 2026 Audio Software Engineering Intern to help prototype and evaluate mission-critical audio algorithms and lab automation in a hybrid Plantation, FL role.

G Genesis AI

4 jobs

MATCH

Calculating your matching score...

FUNDING

Other

DEPARTMENTS

Software Engineering

SENIORITY LEVEL REQUIREMENT

Senior Level

TEAM SIZE

No info