Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Machine Learning Intern - Vision-Language-Action Pretraining image - Rise Careers
Job details

Machine Learning Intern - Vision-Language-Action Pretraining

At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we’ve built a world-class team in Automated Driving, Energy & Materials, Human-Centered AI, Human Interactive Driving, Large Behavior Models, and Robotics.


This is a Fall 2025/Winter 2026 paid 12-week internship opportunity in Los Altos, HQ. Please note that this internship will be an in-office hybrid role.


The Mission

Make general-purpose robots a reality.


The Challenge

We envision a future where robots assist with household chores and cooking, aid the elderly in maintaining their independence, and enable people to spend more time on the activities they enjoy most. To achieve this, robots need to be able to operate reliably in messy, unstructured environments. Our mission is to answer the question “What will it take to create truly general-purpose robots that can accomplish a wide variety of tasks in settings like human homes with minimal human supervision?”. We believe that the answer lies in using large-scale datasets of physical interaction from a variety of sources and building on the latest advances in machine learning to learn general purpose robot behaviors from this data.


The Team

Our goal is to revolutionize the field of robotic manipulation, enabling long-horizon dexterous behaviors to be efficiently taught, learned, and improved over time in diverse, real world environments.

Within this broader mission, the Pretraining team focuses on building foundation models that can effectively bridge visual, language, and robotic domains. We combine large-scale model training with empirical validation in simulation and on physical robots, emphasizing both fundamental research advances and practical capabilities. Our work spans computer vision, multi-modal learning, and robotic control, with particular focus on scaling up model architectures, training data (action-free and data containing robot actions), and training approaches that generalize effectively to physical manipulation tasks.


The Internship

As a Research Scientist Intern, you will conduct research in robot foundation model pretraining alongside our core technical team. You'll work on developing and implementing large-scale multi-modal models that bridge visual, language, and robotic domains and validate them on our simulated and physical robot fleet.


For this internship specifically, the project will focus on pre-training autoregressive Vision-Language-Action (VLA) models, building on recent advances such as MolmoAct, with the goal of grounding multimodal reasoning in action spaces for robotics. In addition to large-scale autoregressive modeling, the work will involve co-training across multiple modalities -- including text, vision, depth, and action trajectories -- to enable richer representations and more robust cross-modal alignment. Explorations in this area will primarily focus on the data space (filtering web data, training on images and videos, etc.) while also leaving room for experimenting with novel architectural choices. The ideal candidate would preferably have some background in LLM/VLM training, as well as evaluating on robot simulation or real-world tasks.


Responsibilities
  • Advance the state of the art in training large-scale robot foundation models, and validate the impact of that research on real-world benchmarks and robots.
  • Work as part of a dynamic, closely-knit research team.
  • Implement high-performance machine-learning pipelines and optimize data and learning stacks for scalability, efficiency, and performance.
  • Present results in verbal and written communications at international conferences, internally, and via open-source contributions to the community.
  • Collaborate with internal research scientists, our engineering team, and our partner labs at top academic research universities including MIT, Stanford, Berkeley, CMU, Columbia, and Princeton to drive pioneering research at scale.


Qualifications
  • Currently pursuing a Ph.D. in Machine Learning, Robotics, or related fields.
  • Publications at high-impact conferences/journals (e.g., NeurIPS, ICML, ICLR, RSS, CoRL, CVPR, ACL, etc.) on some of the aforementioned topics.
  • Passionate about large scale challenges in ML grounded in physical systems, especially in the space of robotics.
  • Proficiency with one or more coding languages and systems, preferably Python, Unix, and a Deep Learning framework (e.g., PyTorch).
  • Ability to work in collaboration with other researchers and engineers to invent and develop interesting research ideas.
  • Experience training large-scale foundation models (LLMs, VLMs, diffusion models, etc) is desirable.
  • Familiarity with robots and the challenges inherent in conducting research on physical hardware platforms is desirable.


Please add a link to Google Scholar and include a full list of publications when submitting your CV to this position.


The pay range for this position at commencement of employment is expected to be between $45 and $65/hour for California-based roles; however, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. Note that TRI offers a generous benefits package including vacation and sick time. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.


Please reference this Candidate Privacy Notice to inform you of the categories of personal information that we collect from individuals who inquire about and/or apply to work for Toyota Research Institute, Inc. or its subsidiaries, including Toyota A.I. Ventures GP, L.P., and the purposes for which we use such personal information.


TRI is fueled by a diverse and inclusive community of people with unique backgrounds, education and life experiences. We are dedicated to fostering an innovative and collaborative environment by living the values that are an essential part of our culture. We believe diversity makes us stronger and are proud to provide Equal Employment Opportunity for all, without regard to an applicant’s race, color, creed, gender, gender identity or expression, sexual orientation, national origin, age, physical or mental disability, medical condition, religion, marital status, genetic information, veteran status, or any other status protected under federal, state or local laws.


It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment.

Toyota Research Institute Glassdoor Company Review
4.2 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
Toyota Research Institute DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Toyota Research Institute
Toyota Research Institute CEO photo
Gill Pratt
Approve of CEO

Average salary estimate

$114400 / YEARLY (est.)
min
max
$93600K
$135200K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User

Assistant Academic Research Scientist supporting the Viswanath Lab's pediatric research at Emory University, performing experimental design, data analysis, and protocol development to advance basic and translational projects.

Photo of the Rise User

Support a pragmatic clinical trial investigating a mindfulness-based pain management program by coordinating participant interactions, data collection, device setup, and study operations on a per-diem, hybrid basis at BMC.

Support AEI’s Center for Technology, Science, and Energy as an in-person research assistant focused on energy and climate policy, data analysis, and editorial support for senior fellows.

Photo of the Rise User

Support clinical research and lab operations at OHSU's School of Nursing as a part-time Student Worker performing basic research tasks and participant monitoring during varied hours.

Photo of the Rise User
LSU Hybrid 0125 Madison B. Sturgis (New Agronomy Bldg)
Posted 6 hours ago

LSU AgCenter is hiring an Instructor/Lab Coordinator to manage STPAL operations, instrumentation, personnel, and quality systems to support soil and plant testing for research, extension, and the public.

Produce concise, scientifically rigorous reviews and resources on emerging drug modalities and design concepts for an industry-focused knowledge platform.

Photo of the Rise User
ICF Hybrid Nationwide Remote Office (US99)
Posted 57 minutes ago

ICF is hiring a Child & Youth Research Analyst to perform qualitative and quantitative data collection, analysis, and reporting for child welfare and youth-focused research and evaluation projects.

Photo of the Rise User

St. Jude Children's Research Hospital seeks an Administrative Coordinator to provide high-level administrative, event, and operational support to the Center of Excellence for Structural Cell Biology in Memphis, TN.

Photo of the Rise User

EIP seeks an Associate in AI to deliver market insights, support AI-focused deal flow, and advise portfolio and strategic partners on commercialization and deployment of AI technologies in the energy ecosystem.

Photo of the Rise User
Posted 24 hours ago

Eurofins is hiring a detail-oriented Histology Laboratory Research Associate in Boston to execute histology workflows, molecular pathology assays, and digital image analysis to support research and client projects.

GE Aerospace Research seeks Master's-level interns for a 10–12 week summer 2026 Edison Research Internship in Niskayuna to work on multidisciplinary aerospace research projects.

Photo of the Rise User

The Gibson Lab seeks a motivated postdoctoral researcher to lead experiments at the interface of chromatin biology, condensate biophysics, and genome regulation at St. Jude Children's Research Hospital.

Photo of the Rise User

Experienced research professional needed to manage grant-funded occupational health studies, including data collection/analysis, project coordination, and scientific writing for OHSU's Institute of Occupational Health Sciences.

TRI's mission is to improve the quality of human life through advances in artificial intelligence, automated driving, and robotics.

4 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Internship, hybrid
DATE POSTED
September 10, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!