Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
ML Infrastructure Engineer (Staff / Principal) image - Rise Careers
Job details

ML Infrastructure Engineer (Staff / Principal)

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a ML Infrastructure Engineer (Staff / Principal) in California (USA).

This role offers the opportunity to lead the development and optimization of cutting-edge ML infrastructure for large-scale generative and predictive AI models. You will work at the intersection of machine learning, physics, and computational chemistry, driving scalable, high-performance systems that accelerate AI research in molecular modeling. The position involves designing distributed training pipelines, optimizing GPU operations, and building robust MLOps frameworks that push the boundaries of AI performance. You will collaborate closely with researchers, engineers, and scientists, mentoring junior team members while contributing to long-term technical strategy. This is a hands-on, high-impact role where your work directly enables groundbreaking discoveries in molecular AI.


  • Accountabilities:
  • Lead engineering efforts for building and scaling distributed ML training and inference infrastructure across GPU clusters and cloud environments.
  • Optimize model efficiency in terms of throughput, latency, memory, and GPU utilization, pushing hardware to its performance limits.
  • Design and implement MLOps tools and frameworks for automated, reliable deployment and evaluation of AI models.
  • Collaborate with researchers and cross-functional teams to integrate infrastructure with generative and predictive AI workflows.
  • Drive long-term platform vision, contributing to architectural decisions, tooling improvements, and best practices.
  • Mentor junior engineers and research interns, fostering a culture of technical excellence and innovation.

  • Requirements:
  • Extensive experience in distributed ML training and inference on large-scale GPU clusters.
  • Proficiency in PyTorch, PyTorch Lightning, PyTorch Geometric, Ray, or similar frameworks.
  • Strong engineering skills with the ability to design, implement, and maintain robust, scalable systems.
  • Experience optimizing GPU workloads and performance engineering for high-throughput ML pipelines.
  • Independent thinker with a strong sense of ownership and ability to deliver from first principles to production-quality systems.
  • Curiosity and problem-solving mindset for working at the intersection of AI, physics, chemistry, and biology.
  • Nice to Have:
  • Experience building and maintaining cluster infrastructure with Kubernetes and Terraform.
  • Expertise in GPU programming, XLA, Triton, CUDA, or deep learning compiler stacks.
  • Familiarity with molecular systems (proteins, small molecules, 3D structures), ML force fields, or point cloud data.
  • Experience contributing to highly collaborative, cross-functional teams in research or production ML environments.

  • Benefits:
  • Competitive salary and equity package.
  • Comprehensive health benefits: medical, dental, and vision fully covered for employees.
  • 401(k) plan.
  • Open (unlimited) PTO policy and paid family leave (maternity and paternity).
  • Life, long-term, and short-term disability insurance.
  • Free meals at office locations and other employee perks.
  • Opportunities for growth, mentorship, and hands-on impact in cutting-edge molecular AI research.


Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly.

🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.

📊 It compares your profile to the job’s core requirements and past success factors to determine your match score.

🎯 Based on this analysis, we automatically shortlist the three candidates with the highest match to the role.

🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.

The process is transparent, skills-based, and free of bias — focusing solely on your fit for the role. Once the shortlist is completed, we share it directly with the company that owns the job opening. The final decision and next steps (such as interviews or additional assessments) are then made by their internal hiring team.

Thank you for your interest!

 


#LI-CL1

Average salary estimate

$265000 / YEARLY (est.)
min
max
$190000K
$340000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User

A U.S.-based MSP is looking for a results-driven Account Executive II to drive enterprise and mid-market managed services revenue with uncapped commission potential.

Photo of the Rise User
Posted 21 hours ago

A US partner company is hiring a remote Application Developer to build and maintain business-critical applications using modern languages and agile practices.

Posted 22 hours ago

Observable Space is seeking an Embedded Software Engineer to design and maintain embedded Linux systems, drivers, and high-speed peripheral bring-up for next-generation ground and space telescopes in a hybrid Los Angeles role.

Photo of the Rise User
Posted 19 hours ago

Experienced Platform Engineer needed to design and optimize scalable backend systems and cloud infrastructure for a leading data orchestration platform (fully remote).

Photo of the Rise User

Lead DevOps and MLOps engineering for HBS Foundry to build, scale, and operationalize a secure Generative AI application platform within Harvard Business School.

Photo of the Rise User
Posted 40 minutes ago

Lead and grow an engineering team at Iru to design and deliver scalable, secure core services for a modern AI-era security platform headquartered in Miami.

Photo of the Rise User

Lead multiple engineering teams to drive acquisition and retention experiences across web and mobile for a high-impact, remote-friendly consumer platform.

Photo of the Rise User
Posted 16 hours ago

Lead the design and implementation of Harvey’s internal backend platform to enable product teams to ship reliable, observable, and high-performance services faster.

Photo of the Rise User
Posted 21 hours ago
Startup Mindset
Collaboration over Competition
Growth & Learning
Inclusive & Diverse

Senior ML Systems Engineer to own and evolve the training framework and tooling that enables reliable, high-performance large-scale LLM training.

Posted 18 hours ago

Penn State's ECRM program seeks an experienced Salesforce Developer to build and maintain Salesforce solutions, integrations, and platform releases that support university-wide constituent management.

Photo of the Rise User

Lead full‑stack enterprise software development and mentor engineering teams while delivering high‑quality Java-based applications and modern SPA front-ends for a US partner represented by Jobgether.

Photo of the Rise User
Posted 20 hours ago

Blossom Health, a Series A AI-native startup tackling the mental health crisis, is hiring Software Engineers in SoHo to build scalable, clinician-facing products and integrate modern AI capabilities.

Posted 21 hours ago

TENEX seeks a Principal AI Engineer in Sarasota, FL to architect and productionize AI-driven detection, investigation, and remediation systems for a next-generation MDR platform.

Photo of the Rise User

An AI-forward staffing technology partner is hiring a Senior Full Stack Engineer (Contract) to modernize payroll and workforce-management systems using Java, Python/Django, and scalable service-oriented architectures.

Photo of the Rise User

GameChanger seeks an experienced Senior Backend Software Engineer to lead development and reliability improvements for its subscriptions platform, working remotely across the U.S. or from our Manhattan office.

Jobgether has the ambition to disrupt the recruitment industry as we know it by simplifying it and making it more accurate 🎯 Jobgether platform connects candidates and companies based on: - Skills -... Values - Ambition - Personality The candidat...

639 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
November 30, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!