Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
MLE Intern, ML Runtime & Optimization (Spring 2026, Master/PhD) image - Rise Careers
Job details

MLE Intern, ML Runtime & Optimization (Spring 2026, Master/PhD)

Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ai’s leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022. In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural “XB100” 2023 list of the world’s top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in Nov. 2024.

Responsibility

The ML Infrastructure team at Pony.ai provides a set of tools to support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment and monitoring.

As a Machine Learning Engineer Intern in ML Runtime & Optimization, you will be developing technologies to advance the training and inferences of the AI models in autonomous driving systems.

This includes:

  • Performing in-depth analysis and optimization to model training and deployment to achieve the state of art in performance and efficiency in autonomous driving.
  • Work across the entire AI framework/compiler stack (e.g. Torch, CUDA and TensorRT), support model development and prototype key deep learning algorithms.
  • Analyze the tradeoffs between performance, cost and energy for autonomous driving.
  • Collaborating closely with diverse groups in Pony.ai to influence the next-generation compute platform HW and SW design.
  • Research the latest model architectures, programming models and hardware.

  • Currently pursuing a Masters or PhD program or a related discipline.
  • Strong programming skills in C/C++ or Python.
  • Solid understanding of CPU or GPU execution model, e.g. threads, registers, cache, memory, cost and performance trade-off, etc.
  • Experience in benchmarking, profiling and validating performance.
  • Strong communication skills and ability to work cross-functionally between software and hardware teams

Preferred Qualifications:

One or more of the following fields are preferred

  • Experience with parallel programming: CUDA, ROCm, Triton, Cutlass, etc.
  • Experience in computer vision, image processing, machine learning and deep learning.
  • Experience in model optimization techniques such as quantization, pruning, etc.
  • Experience in optimizing the utilization of compute resources, identifying and resolving compute and data flow bottlenecks.
  • Strong knowledge of software design, programming techniques and algorithms.
  • Strong knowledge of common deep learning frameworks and libraries.
  • Strong knowledge on system performance, GPU optimization or ML compiler.

Note

  • This position is fully onsite in Fremont, at least 3 months.

Compensation

  • Master: $7000/month
  • PhD: $10,000/month

Average salary estimate

$102000 / YEARLY (est.)
min
max
$84000K
$120000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Twilio Hybrid Remote - US
Posted 20 hours ago
Inclusive & Diverse
Social Impact Driven
Collaboration over Competition
Growth & Learning
Maternity Leave
Paternity Leave
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching

Twilio is hiring a remote Software Engineer to design and build scalable backend and API services for its high-throughput Catalog Platform that handles complex pricing at scale.

Photo of the Rise User

Aircall seeks a Full-Stack Software Engineer on the Integrations team in Bellevue to deliver end-to-end, production-ready integration features with a strong focus on frontend React experience.

Stride Consulting seeks a hands-on Lead/Senior Full-Stack Python Engineer to drive full-stack development, technical leadership, and client-facing delivery across cloud-native engagements.

Photo of the Rise User

Senior engineering leader needed to head a global team and deliver scalable, secure web and mobile solutions while embedding AI-driven development practices across the organization.

Photo of the Rise User
Posted 11 hours ago

Kanopi Studios is hiring a remote WordPress Engineer in Canada to develop custom themes, plugins, and Gutenberg blocks while ensuring performance, security, and accessibility for mission-driven clients.

Senior Application Engineer/Architect to design and deliver configurable, API-first Policy Administration Systems for an enterprise insurance platform, working remotely during EST hours.

Photo of the Rise User

Illumio is hiring a Staff Security Automation Engineering Lead to lead a team building security automation, integrations, and tooling that strengthen breach containment across hybrid multi-cloud environments.

Photo of the Rise User

Be part of a small, high-impact engineering team at Blissway building scalable systems for tolling, payments, and highway safety.

Photo of the Rise User
Posted 3 hours ago

Build fullstack systems at PermitFlow that turn complex permitting workflows into AI-powered, user-facing features used by the nation’s leading builders.

Photo of the Rise User

Visa is hiring a Senior Consultant, Software Engineer to design and deliver highly scalable, secure payment services using Java, Spring Boot, modern frontend frameworks, and CI/CD practices while driving GenAI adoption.

Photo of the Rise User
Posted 7 hours ago

Lead development of high-quality iOS and Android apps for a national air carrier, guiding architecture, delivering features, and mentoring junior engineers.

Photo of the Rise User
Posted 19 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

NVIDIA is hiring a Senior Systems Software Engineer in Santa Clara to develop and scale test automation tools and frameworks using Python, C/C++ and bash to accelerate software verification.

Photo of the Rise User
Posted 12 hours ago

Experienced full-stack engineer needed to build and scale enterprise AI applications, combining frontend/back-end expertise with hands-on LLM, RAG, and MLOps experience.

Pony.ai, an autonomous driving startup based in China and Silicon Valley, has raised a $100 million extension to its last funding round. Pony.ai, founded in 2016, makes an autonomous driving system called PonyAlpha aimed at facilitating driverless...

2 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Internship, onsite
DATE POSTED
December 20, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!