Vllm Jobs

Browse 20 exciting jobs hiring in Vllm now. Check out companies hiring such as Prime Intellect, Fiddler AI, NVIDIA in Miami, Baton Rouge, Newport News.

VIEW COMPANIES

Applied Research - Evals & Data

Prime Intellect Hybrid San Francisco

VIEW

Posted 7 hours ago

Work at the intersection of RL, post-training evaluation, and production agent infrastructure to shape and deploy agentic AI systems used by real customers.

Staff Backend Engineer (Hybrid)

Fiddler AI Hybrid No location specified

VIEW

Posted 4 days ago

Fiddler AI is hiring a Staff Backend Engineer to architect and build scalable backend systems and observability pipelines for LLMs and agentic applications at an early-stage, mission-driven company.

Technical Marketing Engineer - AI Platform Software

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 4 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Technical Marketing Engineer needed to produce developer-focused technical content, samples, and benchmarks that demonstrate and improve NVIDIA's AI platform software usability.

Engineering Manager, R&D

SharkNinja Hybrid NEEDHAM, Massachusetts

VIEW

Sponsored

Senior Electrical Engineer

SharkNinja Hybrid NEEDHAM, Massachusetts

VIEW

Sponsored

Mechanical Engineering Manager

SharkNinja Hybrid NEEDHAM, Massachusetts

VIEW

Sponsored

Director of Software Engineering, Texas Institute of Electronics

UTAustin Hybrid AUSTIN, TX

VIEW

Posted 5 days ago

Lead the architecture and delivery of a scalable, secure AI infrastructure platform while building and mentoring a high-caliber engineering organization at the Texas Institute for Electronics.

Senior Software Engineer, AI Platform

LinkedIn Hybrid Mountain View, CA

VIEW

Posted 9 days ago

Senior Software Engineer to join LinkedIn's AI Platform team to design and optimize large-scale training, feature-engineering, and serving infrastructure for LLMs and recommendation systems.

Director of Community, PyTorch

Linux Foundation Hybrid 548 Market St PMB 57274, San Francisco, CA

VIEW

Posted 9 days ago

Lead community strategy for the PyTorch Foundation by building relationships across projects like PyTorch, vLLM, and DeepSpeed to grow a collaborative open-source AI developer ecosystem.

Inference Engineer

Awesome Motive Hybrid San Francisco

VIEW

Posted 9 days ago

Lead the zero-to-one design and implementation of a high-throughput, low-latency LLM inference stack as an early engineering hire at an SF-based AI startup.

Senior Software Engineer — AI Studio (Inference Platform)

Nebius Hybrid Amsterdam, Netherlands; Berlin, Germany; London, United Kingdom; Prague, Czech Republic; Remote - Europe; Remote - United States; United States

VIEW

Posted 11 days ago

Join Nebius AI Studio to build and scale a high-performance inference platform that makes deploying foundation models fast, reliable, and effortless at massive scale.

Principal Engineer Software (ATP Cloud Service)

Palo Alto Networks Hybrid Santa Clara, CA

VIEW

Posted 11 days ago

Lead the design and implementation of cloud-native backend and AI model-serving infrastructure on GCP for a mission-driven cybersecurity team.

Principal Engineer Software (ATP Cloud Service)

Palo Alto Networks Hybrid Santa Clara, CA

VIEW

Posted 12 days ago

Lead development of scalable, high-performance GCP-based backend and model-serving infrastructure for the ATP Cloud team at Palo Alto Networks.

Reinsurance Territory Manager

FM Hybrid MALVERN, Pennsylvania

VIEW

Sponsored

Field Quality Assurance Compliance Auditor - Manufacturing

FM Hybrid WESTMINSTER, Maryland

VIEW

Sponsored

VP, Social Media Acceleration

SharkNinja Hybrid NEEDHAM, Massachusetts

VIEW

Sponsored

Principal Specialist Solution Architect

Red Hat Hybrid Remote US NY

VIEW

Posted 13 days ago

Senior technical leader partnering with sales to architect and deliver enterprise Red Hat solutions—spanning RHEL, OpenShift, Ansible, hybrid cloud, and AI/LLM GPU services—while mentoring teams and driving strategic customer outcomes.

Senior Research Engineer

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 18 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA seeks a Senior Research Engineer to design, implement, and scale open-source post-training and RL algorithms for Nemotron generative AI models.

Machine Learning Engineer (NLP)

Tonic AI Hybrid San Francisco

VIEW

Posted 18 days ago

Join Tonic.ai as an NLP-focused Machine Learning Engineer to design, fine-tune, and productionize LLM-based systems that detect and redact sensitive data and power synthetic data products.

Senior Software Engineer, AI Systems - vLLM and MLPerf

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 18 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Drive extreme-performance LLM inference and industry benchmarking at NVIDIA by optimizing vLLM and MLPerf workloads on cutting-edge NVIDIA GPUs.

Software Engineer, Inference – AMD GPU Enablement

OpenAI Hybrid San Francisco

VIEW

Posted 19 days ago

Inclusive & Diverse

Feedback Forward

Collaboration over Competition

Growth & Learning

Accelerate and scale OpenAI’s inference stack on AMD GPUs by driving kernel performance, distributed execution, and communication-library integration across large GPU clusters.

Principal Software Engineer, Managed AI

Crusoe Hybrid No location specified

VIEW

Posted 23 days ago

Lead the architecture and build scalable, fault‑tolerant systems for Crusoe’s managed AI inference platform to serve LLMs at massive scale.

Software Engineer, All Teams

Compa Hybrid Irvine

VIEW

Posted 26 days ago

Work across frontend and backend systems at Compa to build scalable, production-grade software powering enterprise compensation intelligence.

Software Engineer (AI Performance)

Gimlet Labs Hybrid San Francisco

VIEW

Posted 28 days ago

Gimlet Labs is hiring a Software Engineer (AI Performance) to drive model and GPU-level performance improvements for production-scale inference in San Francisco.

Inference Optimization Engineer

BentoML Hybrid San Mateo

VIEW

Posted 28 days ago

BentoML seeks an Inference Optimization Engineer to accelerate LLM inference across GPUs and distributed serving stacks, reducing latency and GPU costs while contributing to open-source tooling.