Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Staff Software Engineer, Inference (Bay Area / Paris / Remote) image - Rise Careers
Job details

Staff Software Engineer, Inference (Bay Area / Paris / Remote)

What You’ll Do

  • Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics

  • Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilization

  • Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks

  • Optimize workloads for both throughput (batching, scheduling, quantization) and latency (caching, memory management, graph compilation)

  • Develop monitoring and debugging tools to guarantee reliability, determinism, and rapid diagnosis of regressions across both stacks

What You’ll Bring

  • Deep experience in distributed systems, ML infrastructure, or high-performance serving (8+ years)

  • Production-grade expertise in Python, with strong background in systems languages (C++/Rust/Go)

  • Low-level performance mastery: CUDA, Triton, kernel optimization, quantization, memory and compute scheduling

  • Proven track record scaling inference workloads in both throughput-oriented cluster environments and latency-critical on-device deployments

  • System-level mindset with a history of tuning hardware–software interactions for maximum efficiency, throughput, and responsiveness

Average salary estimate

$240000 / YEARLY (est.)
min
max
$180000K
$300000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Lead the design and deployment of advanced control, state estimation, and trajectory optimization systems for general-purpose robots, working closely with hardware and algorithm teams.

Lead the optimization and scaling of distributed training infrastructure for foundation models, improving wall-clock convergence by tuning data pipelines, kernels, and multi-node systems.

Photo of the Rise User

Arista is hiring a Senior Site Reliability Engineer to manage and scale the global CloudVision service fleet running on Kubernetes, ensuring reliability, observability, and automated operations.

Photo of the Rise User
Health Savings Account (HSA)
Dental Insurance
Vision Insurance
Disability Insurance
Flexible Spending Account (FSA)
Family Medical Leave
Paid Holidays

Instructure is hiring a Software Engineer on the Professional Services team to build custom integrations, deliver customer-focused solutions, and mentor fellow engineers.

Photo of the Rise User
Smalls Hybrid No location specified
Posted 16 hours ago

Smalls is hiring an Engineering Manager who will split time between hands-on engineering and team leadership to scale product and systems for a fast-growing DTC subscription business.

Work onsite with Reframe Systems' robotics and engineering team to develop software for robotic manipulation workcells and production automation in our Andover micro-factory during Summer 2026.

Photo of the Rise User

Demiurge Studios is hiring an Associate Software Engineer in Boston to implement, test, and iterate on game systems alongside cross-disciplinary teams for console, PC, and mobile projects.

Lead the optimization and scaling of distributed training infrastructure for foundation models, improving wall-clock convergence by tuning data pipelines, kernels, and multi-node systems.

Lead the reliability and observability efforts for a large IoT fleet at MLabs, improving device health through monitoring, tooling, and cross-functional collaboration.

Photo of the Rise User
Anyscale Hybrid No location specified
Posted 10 hours ago

Work on Ray Datasets to improve large-scale data processing, performance, and stability at Anyscale, contributing to an open-source platform used by teams running production ML workloads.

Posted 12 hours ago

Help build and operate the core platform powering an AI-first enterprise SaaS at a fast-moving, venture-backed startup based in Midtown Manhattan.

Photo of the Rise User
Posted 24 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Senior technical leader needed to steer PVA system software and DSP SDK development for NVIDIA’s Tegra mobile SoC platform, driving architecture, roadmap, and team execution.

Photo of the Rise User
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

NVIDIA is hiring a new college graduate AI Developer Technology Engineer to develop, optimize, and deploy high-performance deep learning solutions on GPUs while collaborating with research, architecture, and software teams.

Photo of the Rise User
Posted 17 hours ago

Spalding, a Saalex Company, is hiring a Junior Software Engineer to support DoD-focused web and cloud modernization efforts with a hybrid schedule in Patuxent River, MD.

Photo of the Rise User

Motorola Solutions is hiring a Summer 2026 Audio Software Engineering Intern to help prototype and evaluate mission-critical audio algorithms and lab automation in a hybrid Plantation, FL role.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
September 10, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!