Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
ML Inference Platform Intern (6 months) image - Rise Careers
Job details

ML Inference Platform Intern (6 months)

About AION

AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute power for AI training, fine-tuning, inference, data labeling, and full stack AI/ML lifecycle.

Led by high-pedigree founders with previous exits, AION is well-funded by major VCs with strategic global partnerships. Headquartered in the US with global presence, the company is building its initial core team across India, London and SF. 

Who You Are

You're an ML systems engineer who's passionate about building high-performance inference infrastructure. You don't need to be an expert in everything - this field is evolving too rapidly for that - but you have strong fundamentals and the curiosity to dive deep into optimization challenges. You thrive in early-stage environments where you'll learn cutting-edge techniques while building production systems. You think systematically about performance bottlenecks and are excited to push the boundaries of what's possible in AI infrastructure.

Key Responsibilities

  • Learn and implement ML inference optimization techniques including KV-cache management, dynamic batching, and quantization under mentorship.
  • Contribute to GPU optimization projects using CUDA with hands-on learning of Triton kernel development and performance tuning.
  • Build model benchmarking and evaluation frameworks to assess performance across different models and optimization strategies.
  • Research and experiment with trending open-source models (DeepSeek R1, Qwen 3, Llama variants) to understand optimization opportunities.
  • Implement cost-performance analysis tools to understand tradeoffs between speed, quality, and resource usage.
  • Explore agent system implementations and multi-step reasoning workflows for future platform capabilities.
  • Document learning and create technical guides for internal team knowledge sharing and customer education.

Skills & Experience

  • High agency individual with strong willingness to experiment and learn with the team.
  • Previous internships or projects in ML infrastructure, contributions using PyTorch/ML frameworks, competitive programming achievements, research experience in ML systems, familiarity with agent systems or reasoning techniques.
  • Strong coding and implementation skills in Python and C++ with demonstrated ability to write performant, production-quality code.
  • Experience reading and contributing to large codebases with proof of open-source contributions (GitHub profile required).
  • Proof of technical work through projects like Google Summer of Code, hackathon wins, competitive programming, or significant open-source contributions.
  • Working knowledge of deep learning fundamentals including neural networks, transformers, and basic training/inference concepts.
  • Basic understanding of PyTorch including model development and tensor operations.
  • Fundamental knowledge of GPU computing or strong willingness to learn CUDA programming.
  • Working knowledge of at least one inference framework (vLLM, TensorRT-LLM, Hugging Face) through coursework or personal projects.
  • Understanding of distributed systems concepts and performance optimization principles.
  • Join the ground floor of a mission-driven AI startup revolutionizing compute infrastructure.
  • Learn from world-class engineers and gain hands-on experience with cutting-edge inference optimization techniques.
  • Work with a high-caliber, globally distributed team backed by major VCs.
  • Significant learning and growth opportunity in one of the fastest-moving areas of AI infrastructure.
  • Competitive internship compensation with potential for full-time conversion.
  • Fast-paced, flexible work environment with room for ownership and impact.

Average salary estimate

$90000 / YEARLY (est.)
min
max
$60000K
$120000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Posted 5 hours ago

Lead and grow Anrok's Infrastructure team to design, operate, and scale the cloud systems that power global, compliant digital commerce.

Photo of the Rise User

Design and implement embedded C++ software for Joby’s electric, fly-by-wire aircraft while collaborating with systems and hardware teams to deliver safety-critical flight software.

Photo of the Rise User
Brillio Hybrid Phoenix, Arizona, United States
Posted 10 hours ago

Senior Salesforce Order Management Architect wanted to lead the design and delivery of scalable OMS solutions, integrations, and orchestration flows for a leading digital transformation services provider.

Lead the design and implementation of internal AI agents on Basis's Atlas team to make the company agent-native and scale production-quality agentic systems for accounting.

Photo of the Rise User
Posted 21 hours ago

Lead cloud architecture and infrastructure automation at Coupang to design and operate highly available, scalable, and efficient platform services for global teams.

Photo of the Rise User

Lead architecture and full-stack development for data acquisition, time-series databases, APIs, and React UIs to accelerate pulser production and Demonstration System operations at Pacific Fusion.

Posted 20 hours ago

Strala, a seed-backed AI startup transforming insurance claims, seeks a Forward Deployed Engineer to build, deploy, and iterate production LLM and ML solutions in partnership with customers in San Francisco.

Photo of the Rise User
Posted 1 hour ago
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Sabbatical
Paid Holidays

As a Forward Deployed Engineer at Handshake AI, you’ll design and ship custom full‑stack solutions for partner AI labs, owning projects from discovery through deployment in a fast-moving, customer-facing role.

Photo of the Rise User
Posted 20 hours ago

Lead architecture and scale distributed backend services for Lightspark's Spark, a Bitcoin-native Layer 2 for fast, low-cost global payments.

Posted 2 hours ago

Lead architecture and delivery for Toyota’s North American Supplier Quality software products, driving cloud-native development, production support and cross-functional alignment.

Photo of the Rise User

Genentech's Solutions team is hiring a software engineer to design and implement scalable cloud-native data pipelines and applications that accelerate computational biology and drug discovery.

Posted 17 hours ago

OutSystems is hiring a Software Engineer to build AI agent features in its low-code platform, working remotely and learning under senior engineers.

Photo of the Rise User

Senior Full Stack Software Engineer to develop React front-ends and Go back-end services for a combined Transcarent + Accolade healthcare platform.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Internship, remote
DATE POSTED
August 26, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!