Rise Jobs & Careers icon Llm Inference Jobs

Browse 24 exciting jobs hiring in Llm Inference now. Check out companies hiring such as webAI, Jobgether, Modular (CA) in Milwaukee, Anaheim, Mesa.

Photo of the Rise User
Posted 3 days ago

Lead the strategy and delivery of distributed inference, LLM integrations, and on-device ML features at webAI to enable privacy-first, enterprise-grade AI on the edge.

Photo of the Rise User

Lead the product direction for large-scale ML inference infrastructure, driving roadmap, customer-facing technical decisions, and delivery of reliable, high-throughput model serving solutions for a U.S.-remote team.

Photo of the Rise User
Modular (CA) Hybrid United States / Canada
Posted 5 days ago

Lead development of high-performance, distributed LLM inference systems at Modular to enable fast, scalable, production-grade AI deployments.

FM Hybrid NORWOOD, Massachusetts
Sponsored
Photo of the Rise User
Posted 5 days ago

Help design and operate scalable, multi-cloud LLM inference infrastructure at Modular as a Backend Engineer focused on distributed systems and ML inference.

Photo of the Rise User

Lead technical product strategy and execution for webAI’s distributed inference and on-device LLM platform, partnering closely with engineering and research to deliver enterprise-grade AI solutions.

Photo of the Rise User
Posted 7 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Senior Software Developer to drive low-level, high-performance AI networking and inference infrastructure using C/C++/Rust, GPU kernels and RDMA at NVIDIA.

Photo of the Rise User
Posted 7 days ago
Transparent & Candid
Collaboration over Competition
Inclusive & Diverse
Growth & Learning

Build secure, scalable infrastructure and governance systems for enterprise AI agents as a Software Engineer on Rubrik's Agent Cloud team.

d-Matrix is hiring a Senior Staff ML Researcher to develop and implement algorithmic and numerical techniques that optimize LLM inference on next-generation DNN accelerators at its Santa Clara hybrid headquarters.

Photo of the Rise User

Coinbase is hiring a Machine Learning Platform Engineer to design and operate low‑latency inference, streaming pipelines, and distributed training infrastructure that powers fraud detection, personalization, and blockchain analysis.

Photo of the Rise User
Posted 12 days ago

Lead Developer Relations on the West Coast to grow Featherless’s open-model community, create technical demos and content, and represent the platform at events and hackathons.

thomsonreuters Hybrid USA-New York-3 Times Square
Posted 14 days ago

Lead end-to-end development of large-scale AI and deep learning solutions at Thomson Reuters Labs, driving production-grade LLM, retrieval, and data-pipeline capabilities across legal and news products.

Photo of the Rise User
Posted 17 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Lead the Dynamo engineering team at NVIDIA to design, build, and operationalize high-performance, fault-tolerant LLM inference and GenAI serving infrastructure.

Photo of the Rise User
Posted 17 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Lead the design and optimization of large-scale AI inference systems at NVIDIA, developing high-performance kernels, compilers, and orchestration for state-of-the-art models.

Photo of the Rise User
Mercor Hybrid San Francisco
Posted 18 days ago

Mercor is seeking an early-career Data Scientist to run experiments, build dashboards, and prototype models that improve matching and evaluation at its San Francisco headquarters.

Photo of the Rise User

Lead a talented engineering team to design, build, and operate large-scale LLM serving and model deployment infrastructure that powers personalized recommendations at scale.

Photo of the Rise User
Anduril Industries Hybrid Reston, Virginia, United States
Posted 19 days ago

Anduril is hiring a Software Engineer, AI in Reston to build, optimize, and deploy real-world ML/LLM systems that power mission-critical defense and intelligence capabilities.

Photo of the Rise User
Posted 20 days ago

Lead the GenAI Platform engineering team at Abridge to design, deliver, and operate LLM workflows, agentic systems, and retrieval/evaluation infrastructure for clinical AI products.

Photo of the Rise User

Capital One is hiring a Senior Lead AI Engineer to design and productionize foundational LLM, inference, and agentic AI systems that are scalable, cost-efficient, and responsible.

Photo of the Rise User

Help shape GPU-accelerated inference and AI infrastructure as a Spring intern working on CUDA, models, and scalable training/inference systems in San Francisco.

Photo of the Rise User
Posted 27 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

NVIDIA is hiring a Senior Software Development Engineer to build and optimize TensorRT-LLM inference software that powers large-scale generative AI on GPUs.

Photo of the Rise User
Unify Hybrid No location specified
Posted 28 days ago

Work on cutting-edge production AI systems at Unify, building agents, retrieval, and inference infrastructure to power the next generation of go-to-market products.

Photo of the Rise User

Work as a hands-on engineering intern building GPU-optimized AI infrastructure and inference systems with a San Francisco-based team.

Photo of the Rise User
Posted 28 days ago

Relace is hiring a hands-on Machine Learning Engineer to optimize GPU kernels, performance tune large-scale ML systems, and productionize cutting-edge models from our SF FiDi office.

Photo of the Rise User

Palo Alto Networks is hiring a Principal Machine Learning Platform Engineer to architect and scale a high-performance ML inference platform for the Prisma AIRS AI security product.

Employment type
Remote/Onsite
Application Type
Date Posted
Department
Work Experience
Industries
Skills
Company size
Funding
Company Culture
Benefits & Perks
Company Rating
Salary (USD)
Keywords to Exclude

How much do llm inference jobs pay?

Below 50k*
0
0%
50k-100k*
0
0%
Over 100k*
2
100%
*average yearly salary (USD)

Top companies hiring for llm inference jobs

Best cities to find llm inference jobs