Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Staff Software Engineer, Training (Bay Area / Paris / Remote) image - Rise Careers
Job details

Staff Software Engineer, Training (Bay Area / Paris / Remote)

What You’ll Do

  • Drive down wall-clock time to convergence by profiling and eliminating bottlenecks across the foundation model training stack stack, from data pipelines to GPU kernels

  • Design, build, and optimize distributed training systems (PyTorch) for multi-node GPU clusters, ensuring scalability, robustness, and high utilization

  • Implement efficient low-level code (CUDA, cuDNN, Triton, custom kernels) and integrate it seamlessly into high-level training frameworks

  • Optimize workloads for hardware efficiency: CPU/GPU compute balance, memory management, data throughput, and networking

  • Develop monitoring and debugging tools for large-scale runs, enabling rapid diagnosis of performance regressions and failures

What You’ll Bring

  • Deep experience in distributed systems, ML infrastructure, or high-performance computing (8+ years)

  • Production-grade expertise in Python

  • Low-level performance mastery: CUDA/cuDNN/Triton, CPU–GPU interactions, data movement, and kernel optimization

  • Scaling at the frontier: experience with PyTorch and training jobs using data, context, pipeline, and model parallelism

  • System-level mindset with a track record of tuning hardware–software interactions for maximum utilization

Average salary estimate

$255000 / YEARLY (est.)
min
max
$180000K
$330000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Lead the design and deployment of advanced control, state estimation, and trajectory optimization systems for general-purpose robots, working closely with hardware and algorithm teams.

Photo of the Rise User
Posted 19 hours ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Customer-Centric
Social Impact Driven
Rapid Growth
Maternity Leave
Paternity Leave
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Paid Holidays
Paid Time-Off

Senior Software Engineer to help design and ship scalable, AI-driven fleet safety features across web and API surfaces for Samsara’s Connected Operations Cloud.

Photo of the Rise User
Posted 16 hours ago

Build accessible, performant front-end experiences using modern JavaScript and React/Next.js while collaborating with design and backend teams in a remote-first environment.

Photo of the Rise User
Posted 12 hours ago

A technically fluent journalist role focused on building data-driven interactive applications and visualizations to support investigative reporting at ProPublica.

Photo of the Rise User
Pinterest Hybrid San Francisco, CA, US; Remote, US
Posted 3 hours ago

Experienced engineering manager wanted to lead Pinterest’s User Understanding backend team building large-scale data pipelines and ML-serving infrastructure to power personalization for hundreds of millions of users.

Photo of the Rise User
Posted 13 hours ago

Experienced engineering leader sought to manage and mentor backend engineers building scalable, observable microservices systems for a growing energy technology company.

Photo of the Rise User
Posted 7 hours ago

As a Software Engineering Co-op at VIAVI Solutions, you will gain hands-on experience developing and testing C/C++ software for network validation systems, contributing to design reviews and product improvements.

Photo of the Rise User
Posted 10 hours ago

Tomo is hiring a Senior Back End Software Engineer to lead design and implementation of scalable Python microservices and shape platform architecture for a fully remote U.S. engineering team.

Photo of the Rise User
Posted 21 hours ago

Alphatec Spine seeks a Senior Site Reliability Engineer to improve uptime, automation, and observability for its Informatix cloud platform.

Photo of the Rise User

Help the Ethereum Foundation lower barriers to adoption for ERC-4337 and EIL by building developer tools, plugins, and multichain testing frameworks.

Photo of the Rise User

Lead the design and implementation of Go-based, containerized cloud-security services at Illumio to provide real-time visibility and breach containment across multi-cloud environments.

Photo of the Rise User
bem Hybrid San Francisco
Posted 5 hours ago

bem is hiring a Platform Engineer to design and operate multi-cloud data and GPU compute infrastructure that powers a high-accuracy AI platform for enterprise workflows.

Photo of the Rise User
Posted 18 hours ago

Experienced full-stack developer needed to design and maintain cloud-native services and front-end applications for enterprise reporting and workflow solutions at PowerPlan.

Posted 16 hours ago

Toyota is hiring a Product Security Development Engineer to manage CI/CD, deployment pipelines, and security-focused automation for connected-vehicle software at our Plano site.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
September 10, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!