Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Member of Technical Staff — Inference image - Rise Careers
Job details

Member of Technical Staff — Inference

About the Role

As Member of Technical Staff (Inference), you’ll push the limits of frameworks, refine our agent architecture, and build the benchmarks that define performance at scale. You’ll own the systems that take our frontier models from the lab into lightning-fast production-ready services.

This is not a maintenance role — you’ll be experimenting with the latest serving research, optimizing for every millisecond, and shipping infrastructure that our researchers and products depend on daily.

Responsibilities

  • Architect and optimize high-performance inference infrastructure for large foundation models

  • Benchmark and improve latency, throughput, and agent responsiveness

  • Work with researchers to deploy new model architectures and multi-step agent behaviors

  • Implement caching, batching, and prioritization to handle high-volume requests

  • Build monitoring and observability into inference pipelines

Qualifications

  • Strong experience in distributed systems and low-latency ML serving

  • Skilled with performance optimization tools and techniques, and experienced in developing solutions for critical performance gains

  • Hands-on with vLLM, SGLang, or equivalent frameworks

  • Familiarity with GPU optimization, CUDA, and model parallelism

  • Comfort working in a high-velocity, ambiguity-heavy startup environment

What makes us interesting

  • Small, elite team of ex-founders, researchers from top AI Labs, top CS grads, and engineers from top companies

  • True ownership You will not be blocked by bureaucracy, shipping meaningful work within weeks rather than months

  • Serious momentum We're well-funded by top investors, moving fast, and focused on execution

What we do

  • Ship consumer products powered by cutting-edge AI research, and

  • Build infrastructure that facilitates research and product, and

  • Innovate cutting-edge research that will open up new consumer product forms

The Details

  • Full-time, onsite role in Menlo Park

  • Startup hours apply

  • Generous salary, with additional benefits to be discussed during the hiring process

Average salary estimate

$170000 / YEARLY (est.)
min
max
$120000K
$220000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Posted 20 hours ago

Technical interns are invited to engage in impactful AI research and engineering projects within a well-funded, innovative Menlo Park startup.

Photo of the Rise User

Innovative ecommerce search platform Constructor seeks a skilled full-stack engineer focused on backend development to enhance merchandiser tools and scale features.

Photo of the Rise User
Posted 23 hours ago
Passion for Exploration
Dare to be Different
Customer-Centric
Diversity of Opinions
Inclusive & Diverse

Lead the QA strategy and development of Attentive’s mobile SDKs, driving quality and innovation for a top AI-driven marketing platform.

Photo of the Rise User
Warner Bros. Discovery Hybrid NY New York 30 Hudson Yards
Posted 23 hours ago
Inclusive & Diverse
Dare to be Different
Collaboration over Competition
Growth & Learning
Medical Insurance
Dental Insurance
Vision Insurance
Life insurance
Disability Insurance
Paid Time-Off
Paid Holidays

Warner Bros. Discovery is seeking a Software Engineer II to advance AI-driven media supply chain solutions at their New York office.

Photo of the Rise User
Aretum Hybrid No location specified
Posted 18 hours ago

Aretum seeks an experienced Angular Developer to create user-centric, high-performance frontend applications for mission-critical federal projects in a fully remote role.

Photo of the Rise User

Experienced Senior Staff Software Engineer sought to lead hypervisor virtualization R&D for Crusoe’s cutting-edge, AI-focused cloud infrastructure.

Photo of the Rise User
ms Hybrid New York, New York, United States of America
Posted 11 hours ago

Contribute as a Software Engineer III at Morgan Stanley, creating innovative financial technology solutions that impact markets worldwide.

Photo of the Rise User

Drive next-generation automation and CI/CD innovations for robotic-assisted surgery platforms as a Staff Software Engineer at Intuitive.

Photo of the Rise User
Posted 11 hours ago

An innovative startup is looking for a Full Stack Engineer experienced in TypeScript and backend systems to drive AI-powered product development remotely across the US.

Photo of the Rise User
Posted 5 hours ago

Solventum is hiring a Senior DBA Product Performance Engineer to lead database performance tuning and performance testing efforts for healthcare solutions.

TrainingPeaks seeks a skilled Senior Software Engineer to build scalable backend systems that empower endurance athletes and coaches worldwide.

Photo of the Rise User
Aretum Hybrid No location specified
Posted 14 hours ago

Innovate and architect scalable Java backend solutions at Aretum, supporting critical federal government missions with a dedicated and growth-focused team.

Photo of the Rise User

Capital One is looking for passionate Software Engineers proficient in Python, TypeScript, and AWS to build innovative financial technology solutions.

Photo of the Rise User
Posted 6 hours ago

Drive innovation and lead data engineering teams at Capital One to deliver scalable, customer-focused software and data products.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
August 12, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!