Rise Jobs & Careers icon Vllm Jobs

Browse 20 exciting jobs hiring in Vllm now. Check out companies hiring such as Prime Intellect, Fiddler AI, NVIDIA in Miami, Baton Rouge, Newport News.

Posted 7 hours ago

Work at the intersection of RL, post-training evaluation, and production agent infrastructure to shape and deploy agentic AI systems used by real customers.

Photo of the Rise User
Posted 4 days ago

Fiddler AI is hiring a Staff Backend Engineer to architect and build scalable backend systems and observability pipelines for LLMs and agentic applications at an early-stage, mission-driven company.

Photo of the Rise User
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Technical Marketing Engineer needed to produce developer-focused technical content, samples, and benchmarks that demonstrate and improve NVIDIA's AI platform software usability.

SharkNinja Hybrid NEEDHAM, Massachusetts
Sponsored
SharkNinja Hybrid NEEDHAM, Massachusetts
Sponsored
Sponsored

Lead the architecture and delivery of a scalable, secure AI infrastructure platform while building and mentoring a high-caliber engineering organization at the Texas Institute for Electronics.

Photo of the Rise User
Posted 9 days ago

Senior Software Engineer to join LinkedIn's AI Platform team to design and optimize large-scale training, feature-engineering, and serving infrastructure for LLMs and recommendation systems.

Linux Foundation Hybrid 548 Market St PMB 57274, San Francisco, CA
Posted 9 days ago

Lead community strategy for the PyTorch Foundation by building relationships across projects like PyTorch, vLLM, and DeepSpeed to grow a collaborative open-source AI developer ecosystem.

Photo of the Rise User
Posted 9 days ago

Lead the zero-to-one design and implementation of a high-throughput, low-latency LLM inference stack as an early engineering hire at an SF-based AI startup.

Nebius Hybrid Amsterdam, Netherlands; Berlin, Germany; London, United Kingdom; Prague, Czech Republic; Remote - Europe; Remote - United States; United States
Posted 11 days ago

Join Nebius AI Studio to build and scale a high-performance inference platform that makes deploying foundation models fast, reliable, and effortless at massive scale.

Photo of the Rise User

Lead the design and implementation of cloud-native backend and AI model-serving infrastructure on GCP for a mission-driven cybersecurity team.

Photo of the Rise User

Lead development of scalable, high-performance GCP-based backend and model-serving infrastructure for the ATP Cloud team at Palo Alto Networks.

FM Hybrid MALVERN, Pennsylvania
Sponsored
SharkNinja Hybrid NEEDHAM, Massachusetts
Sponsored
Photo of the Rise User

Senior technical leader partnering with sales to architect and deliver enterprise Red Hat solutions—spanning RHEL, OpenShift, Ansible, hybrid cloud, and AI/LLM GPU services—while mentoring teams and driving strategic customer outcomes.

Photo of the Rise User
NVIDIA Hybrid US, CA, Santa Clara
Posted 18 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

NVIDIA seeks a Senior Research Engineer to design, implement, and scale open-source post-training and RL algorithms for Nemotron generative AI models.

Photo of the Rise User
Posted 18 days ago

Join Tonic.ai as an NLP-focused Machine Learning Engineer to design, fine-tune, and productionize LLM-based systems that detect and redact sensitive data and power synthetic data products.

Photo of the Rise User
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Drive extreme-performance LLM inference and industry benchmarking at NVIDIA by optimizing vLLM and MLPerf workloads on cutting-edge NVIDIA GPUs.

Photo of the Rise User
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

Accelerate and scale OpenAI’s inference stack on AMD GPUs by driving kernel performance, distributed execution, and communication-library integration across large GPU clusters.

Photo of the Rise User
Posted 23 days ago

Lead the architecture and build scalable, fault‑tolerant systems for Crusoe’s managed AI inference platform to serve LLMs at massive scale.

Posted 26 days ago

Work across frontend and backend systems at Compa to build scalable, production-grade software powering enterprise compensation intelligence.

Posted 28 days ago

Gimlet Labs is hiring a Software Engineer (AI Performance) to drive model and GPU-level performance improvements for production-scale inference in San Francisco.

Posted 28 days ago

BentoML seeks an Inference Optimization Engineer to accelerate LLM inference across GPUs and distributed serving stacks, reducing latency and GPU costs while contributing to open-source tooling.

Photo of the Rise User
Posted 28 days ago

Produce and scale safe, cost-efficient LLM inference for global AI products as an ML Ops Engineer on a hybrid, high-impact team at Bjak.

SharkNinja Hybrid NEEDHAM, Massachusetts
Sponsored
SharkNinja Hybrid NEEDHAM, Massachusetts
Sponsored
Employment type
Remote/Onsite
Application Type
Date Posted
Department
Work Experience
Industries
Skills
Company size
Funding
Company Culture
Benefits & Perks
Company Rating
Salary (USD)
Keywords to Exclude

How much do vllm jobs pay?

Below 50k*
0
0%
50k-100k*
0
0%
Over 100k*
19
100%
*average yearly salary (USD)