ABOUT BASETEN
Baseten provides the infrastructure, tooling, and expertise needed to bring great AI products to market - fast. Backed by top investors including IVP, Spark Capital, Greylock, and Conviction, we’re trusted by leading AI-driven innovators like Writer, Abridge, Bland, Patreon, Descript, Retool, and Zed to deliver industry-leading performance, security, and reliability for their mission-critical workloads. With our recent $75M Series C funding, we’re growing fast to make AI accessible across all products.
THE ROLE
As an intern at Baseten, you’ll work on real projects and contribute to systems and products that help our users ship their ML products. You won't be off in a corner doing "intern work" – we're moving too fast and have too much to build for that. Instead, you'll receive hands-on support and mentorship to help you grow fast and make a real impact quickly.
Engineering interns can join one of our four teams:
Core Product: You’ll help build the core Baseten developer workflows that enable users to get value out of ML models. The Core Product team is at the forefront of new product development across a large surface area, including model APIs, training, and dedicated deployment.
Forward Deployed Engineering: You will partner closely with our customers to understand their problems and engineer ML solutions. This role provides a unique front-row view into the opportunities and challenges facing companies implementing ML and AI solutions at scale.
Model Performance: You will implement, refine, and productionize cutting-edge techniques (e.g. quantization, speculative decoding, KV cache reuse, chunked prefill, and LoRA) for ML model inference and infrastructure.
Infrastructure: You'll architect and support development of our ML inference platform that powers production AI applications. You'll make technical decisions for the infrastructure enabling developers to deploy, scale, and monitor ML models with high performance and reliability.
RESPONSIBILITIES
Own small projects end-to-end, functioning as both an engineer and a project manager, with a focus on user empathy, project specification, and end-to-end execution
Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity
Fix bugs and resolve customer issues with urgency
Help drive long-term improvements to reliability of systems and velocity of development
Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates
REQUIREMENTS
2+ prior internships or research experiences
Minimum of 3-months commitment required to intern
Working towards a Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field. Advanced coursework in machine learning, systems, and infrastructure is a plus. Graduating in 2025 or 2026.
A 5-day workweek, during which you will be in-office in San Francisco or New York a minimum of 3 days a week
Familiarity with building tools for technical audiences
Proficient coding abilities in one or more popular programming or scripting languages
BENEFITS
Competitive compensation package.
This is a unique opportunity to be part of a rapidly growing startup in one of the most exciting engineering fields of our era.
An inclusive and supportive work culture that fosters learning and growth.
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.
At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Drive automation testing for cutting-edge video streaming technology at Multi Media, LLC, supporting millions of users worldwide.
Patreon is hiring a Staff Software Engineer specializing in storage systems to enhance their platform that enables creators to monetize their work.
Lead CVS Health’s Medicaid software engineering teams in building transformative healthcare solutions as Lead Director of Software Engineering.
Serif seeks a Founding Engineer to architect and implement cutting-edge AI agent systems that streamline user workflows and empower solo entrepreneurs.
Contribute as a Full Stack Software Engineer at Sayari, leveraging TypeScript and large-scale data systems to enhance global commercial transparency.
A Staff Software Engineer role at Finvari building innovative front-end and full-stack solutions to streamline construction finance and accounting operations.
Experienced Software Engineer II needed at Unum to advance the NaviLink platform with robust development skills in a dynamic, agile setting.
Experienced Senior Full Stack Software Engineer needed at Kandji to design and develop scalable, secure software solutions that elevate Apple device management.
Seeking a skilled Rust Engineer to build scalable backend infrastructure powering blockchain applications at Helius, a pioneering Solana developer platform.
Experienced Software Engineer with TypeScript expertise sought for a remote contract role to develop and maintain a high-impact open-source VS Code extension.
Contribute as a backend engineer at AllSpice to advance our ECAD parsing engine and accelerate hardware development automation.
Lead Delinea’s engineering team for Platform Authorization, delivering secure and scalable cloud-native access control as a hands-on Software Engineering Manager.
A Principal Software Engineer role at 15Five focusing on guiding engineering practices, technical leadership, and strategic product-aligned decision-making within a hybrid work model.