Llm Inference Jobs

Browse 24 exciting jobs hiring in Llm Inference now. Check out companies hiring such as webAI, Jobgether, Modular (CA) in Milwaukee, Anaheim, Mesa.

VIEW COMPANIES

Technical Product Manager, AI

webAI Hybrid Austin

VIEW

Posted 3 days ago

Lead the strategy and delivery of distributed inference, LLM integrations, and on-device ML features at webAI to enable privacy-first, enterprise-grade AI on the edge.

Senior Technical Product Manager Token Factory - Inference (Remote - US)

Jobgether Hybrid No location specified

VIEW

Posted 4 days ago

Lead the product direction for large-scale ML inference infrastructure, driving roadmap, customer-facing technical decisions, and delivery of reliable, high-throughput model serving solutions for a U.S.-remote team.

Cloud Inference Engineer

Modular (CA) Hybrid United States / Canada

VIEW

Posted 5 days ago

Lead development of high-performance, distributed LLM inference systems at Modular to enable fast, scalable, production-grade AI deployments.

Underwriter I

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

Data Scientist-Statistics OR Operations Research

FM Hybrid JOHNSTON, Rhode Island

VIEW

Sponsored

Senior Research Scientist - Material Flammability, Fire Dynamics and Lithium-ion Battery Safety

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

Backend Engineer, Cloud Inference

Modular (CA) Hybrid United States / Canada

VIEW

Posted 5 days ago

Help design and operate scalable, multi-cloud LLM inference infrastructure at Modular as a Backend Engineer focused on distributed systems and ML inference.

Staff Technical Product Manager, AI

webAI Hybrid Austin

VIEW

Posted 5 days ago

Lead technical product strategy and execution for webAI’s distributed inference and on-device LLM platform, partnering closely with engineering and research to deliver enterprise-grade AI solutions.

Senior Software Developer, AI Networking

NVIDIA Hybrid US, TX, Remote

VIEW

Posted 7 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Senior Software Developer to drive low-level, high-performance AI networking and inference infrastructure using C/C++/Rust, GPU kernels and RDMA at NVIDIA.

Software Engineer - Agent Cloud

Rubrik Hybrid Seattle, WA

VIEW

Posted 7 days ago

Transparent & Candid

Collaboration over Competition

Inclusive & Diverse

Growth & Learning

Build secure, scalable infrastructure and governance systems for enterprise AI agents as a Software Engineer on Rubrik's Agent Cloud team.

Senior Staff ML Researcher - LLM Algorithmic Optimization

d-Matrix Hybrid Santa Clara, Ca

VIEW

Posted 9 days ago

d-Matrix is hiring a Senior Staff ML Researcher to develop and implement algorithmic and numerical techniques that optimize LLM inference on next-generation DNN accelerators at its Santa Clara hybrid headquarters.

Software Engineer, Machine Learning Platform: EVENT

Awesome Motive Hybrid Remote - USA

VIEW

Posted 9 days ago

Coinbase is hiring a Machine Learning Platform Engineer to design and operate low‑latency inference, streaming pipelines, and distributed training infrastructure that powers fraud detection, personalization, and blockchain analysis.

Developer Relations Lead (DevRel)

Awesome Motive Hybrid San Francisco

VIEW

Posted 12 days ago

Lead Developer Relations on the West Coast to grow Featherless’s open-model community, create technical demos and content, and represent the platform at events and hackathons.

Senior Research Scientist – Computational Wind Engineering

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

FM Approvals Quality Assurance Compliance Auditor - Manufacturing

FM Hybrid ALPHARETTA, Georgia

VIEW

Sponsored

FM Approvals Research Campus Engineering Technician - Materials

FM Hybrid WEST GLOCESTER, Rhode Island

VIEW

Sponsored

Lead Research Engineer

thomsonreuters Hybrid USA-New York-3 Times Square

VIEW

Posted 14 days ago

Lead end-to-end development of large-scale AI and deep learning solutions at Thomson Reuters Labs, driving production-grade LLM, retrieval, and data-pipeline capabilities across legal and news products.

Manager, Software Engineering - Dynamo

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 17 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead the Dynamo engineering team at NVIDIA to design, build, and operationalize high-performance, fault-tolerant LLM inference and GenAI serving infrastructure.

Senior Software Engineer, AI Inference Systems

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 17 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead the design and optimization of large-scale AI inference systems at NVIDIA, developing high-performance kernels, compilers, and orchestration for state-of-the-art models.

Data Scientist

Mercor Hybrid San Francisco

VIEW

Posted 18 days ago

Mercor is seeking an early-career Data Scientist to run experiments, build dashboards, and prototype models that improve matching and evaluation at its San Francisco headquarters.

Machine Learning Engineering Manager - LLM Serving (Remote - US)

Jobgether Hybrid No location specified

VIEW

Posted 19 days ago

Lead a talented engineering team to design, build, and operate large-scale LLM serving and model deployment infrastructure that powers personalized recommendations at scale.

Software Engineer, AI

Anduril Industries Hybrid Reston, Virginia, United States

VIEW

Posted 19 days ago

Anduril is hiring a Software Engineer, AI in Reston to build, optimize, and deploy real-world ML/LLM systems that power mission-critical defense and intelligence capabilities.

Engineering Manager, GenAI Platform

Abridge Hybrid San Francisco

VIEW

Posted 20 days ago

Lead the GenAI Platform engineering team at Abridge to design, deliver, and operate LLM workflows, agentic systems, and retrieval/evaluation infrastructure for clinical AI products.

Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)

Capital One Hybrid New York, NY

VIEW

Posted 25 days ago

Capital One is hiring a Senior Lead AI Engineer to design and productionize foundational LLM, inference, and agentic AI systems that are scalable, cost-efficient, and responsible.

Member of Technical Staff (Spring Internship)

Awesome Motive Hybrid San Francisco

VIEW

Posted 25 days ago

Help shape GPU-accelerated inference and AI infrastructure as a Spring intern working on CUDA, models, and scalable training/inference systems in San Francisco.

Senior Software Development Engineer, TensorRT-LLM

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 27 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is hiring a Senior Software Development Engineer to build and optimize TensorRT-LLM inference software that powers large-scale generative AI on GPUs.

FM Affiliated Production Underwriter (I, II, III, SR.)

FM Hybrid NEW YORK, New York

VIEW

Sponsored

Senior Research Engineer – Mechanical - Rotating Machinery

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

High-Hazard Occupancy Specialist - Pulp and Paper and Construction

FM Hybrid JOHNSTON, Rhode Island

VIEW

Sponsored

Software Engineer - AI

Unify Hybrid No location specified

VIEW

Posted 28 days ago

Work on cutting-edge production AI systems at Unify, building agents, retrieval, and inference infrastructure to power the next generation of go-to-market products.

Member of Technical Staff (Summer Internship)

Awesome Motive Hybrid San Francisco

VIEW

Posted 28 days ago

Work as a hands-on engineering intern building GPU-optimized AI infrastructure and inference systems with a San Francisco-based team.

Machine Learning Engineer

Awesome Motive Hybrid San Francisco

VIEW

Posted 28 days ago

Relace is hiring a hands-on Machine Learning Engineer to optimize GPU kernels, performance tune large-scale ML systems, and productionize cutting-edge models from our SF FiDi office.

Principal Machine Learning Platform Engineer (Prisma AIRS)

Palo Alto Networks Hybrid Santa Clara, CA

VIEW

Posted 30 days ago

Palo Alto Networks is hiring a Principal Machine Learning Platform Engineer to architect and scale a high-performance ML inference platform for the Prisma AIRS AI security product.

Employment type

Remote/Onsite

Application Type

Date Posted

Department

Work Experience

Industries

Skills

Company size

Funding

Company Culture

Benefits & Perks