Inference Engineer Jobs

Browse 28 exciting jobs hiring in Inference Engineer now. Check out companies hiring such as NVIDIA, Rackspace, Palo Alto Networks in New Orleans, Montgomery, Dallas.

VIEW COMPANIES

Senior AI Engineer, NeMo Retriever - Model Optimization and MLOps

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 21 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Work on NeMo Retriever to optimize and containerize LLM/MLLM models and build MLOps pipelines that deliver low-latency, production-grade inference for retrieval-augmented AI systems.

AI Model Serving Specialist

Rackspace Hybrid United States - Remote

VIEW

Posted yesterday

Work with Rackspace customers to deploy, optimize, and operationalize LLM/ML model-serving platforms in private and hybrid cloud environments to meet latency, throughput, security, and cost SLAs.

Senior Staff AI Engineer

Palo Alto Networks Hybrid Santa Clara, CA

VIEW

Posted 2 days ago

Palo Alto Networks is hiring a Senior Staff AI Engineer to lead design and delivery of enterprise-grade AI/ML solutions and platform capabilities across the organization.

ML/AI Engineer

Fluency Hybrid San Francisco

VIEW

Posted 3 days ago

A hands-on ML/AI Engineer role to architect and productionize hybrid ML and LLM-driven systems that extract structured workflow understanding from noisy enterprise data at scale.

Senior Machine Learning Engineer

ClarityPay Hybrid New York City

VIEW

Posted 3 days ago

ClarityPay is looking for a Senior Machine Learning Engineer to build and deploy Reinforcement Learning, bandit, and Bayesian optimization solutions that drive operational improvements in collections and offer optimization.

Machine Learning Engineer - Deployments Team

Roboflow Hybrid No location specified

VIEW

Posted 5 days ago

Experienced ML engineer needed to lead deployment, optimization, and scaling of computer vision models across cloud and edge environments for a fast-growing computer-vision platform.

Member of Technical Staff Backend

Fuku Hybrid No location specified

VIEW

Posted 6 days ago

High-impact backend engineer role building production ML/agent infrastructure and distributed systems to power AI-driven compliance for banks and fintechs.

Principal Data Engineer – ML Platforms

Altarum Hybrid No location specified

VIEW

Posted 7 days ago

Lead the design and delivery of secure, scalable, cloud-agnostic ML platform infrastructure and pipelines to enable reliable, explainable AI and analytics for public health at Altarum.

Software Engineer, ML

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 7 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is hiring a Software Engineer, ML to optimize state-of-the-art ML training and inference across GPU hardware and software stacks.

Machine Learning Engineer - Search, Ranking & Personalization

Fuku Hybrid No location specified

VIEW

Posted 8 days ago

High-impact ML Engineer role building search, ranking, and personalization systems for a fast-growing consumer shopping platform with strong user retention and competitive equity.

Staff Software Engineer (Backend)

Awesome Motive Hybrid New York City

VIEW

Posted 9 days ago

Lead the architecture and engineering of Amigo's backend platform to scale real-time inference, multi-LLM orchestration, and secure EHR integrations across millions of conversations.

Founding Machine Learning Engineer

Bjak Hybrid New York

VIEW

Posted 14 days ago

Lead the ML stack as a founding Machine Learning Engineer at a stealth, self-funded AI group, defining models, training pipelines, and scalable inference for a global consumer product.

Founding AI/ML Research Engineer

Bjak Hybrid New York

VIEW

Posted 14 days ago

A founding AI/ML research engineer role to design and build core model, data, and inference systems for a stealth, high-impact consumer AI product backed by a profitable US$2B group.

Founding AI Engineer

Bjak Hybrid New York

VIEW

Posted 14 days ago

Lead the technical design and implementation of A1’s foundational LLM systems—training pipelines, inference stacks, and deployment architecture—for a global consumer AI product.

Member of Technical Staff - Research Scientist Intern

Virtue Group Hybrid San Francisco

VIEW

Posted 14 days ago

Virtue AI seeks a Research Scientist Intern in San Francisco to develop and integrate cutting-edge agent and LLM security techniques, including red-teaming, guardrail models, and efficient inference methods.

Staff Machine Learning Platform Engineer

Faire Hybrid San Francisco, CA

VIEW

Posted 15 days ago

Lead the architecture and delivery of Faire's machine-learning platform, building scalable feature stores, model serving, and inference infrastructure to power production ML across the marketplace.

Principal Engineer Software (ATP Cloud Service)

Palo Alto Networks Hybrid Santa Clara, CA

VIEW

Posted 18 days ago

Lead design and implementation of cloud-native, high-performance backend services and AI model-serving infrastructure for Palo Alto Networks' ATP Cloud team.

Member of Technical Staff, Synthetic Data

Cohere Hybrid Toronto

VIEW

Posted 21 days ago

Startup Mindset

Collaboration over Competition

Growth & Learning

Inclusive & Diverse

Work at the intersection of research and engineering to build scalable synthetic data pipelines that directly improve the quality and efficiency of Cohere's language models.

Pre-Sales Engineer, Cloud and AI

Gcore Hybrid ,, US, United States

VIEW

Posted 21 days ago

Gcore is hiring a seasoned Pre-Sales Engineer (Cloud & AI) to lead technical engagements, solution design, and customer success for GPU and cloud infrastructure across the Americas.

Senior Machine Learning Platform Engineer: EVENT (Remote - US)

Jobgether Hybrid No location specified

VIEW

Posted 24 days ago

Senior Machine Learning Platform Engineer to design and optimize feature pipelines, distributed training, and low-latency inference systems for a remote US team building production ML infrastructure.

Cloud Inference Engineer

Modular (CA) Hybrid United States / Canada

VIEW

Posted 26 days ago

Lead development of high-performance, distributed LLM inference systems at Modular to enable fast, scalable, production-grade AI deployments.

Backend Engineer, Cloud Inference

Modular (CA) Hybrid United States / Canada

VIEW

Posted 26 days ago

Help design and operate scalable, multi-cloud LLM inference infrastructure at Modular as a Backend Engineer focused on distributed systems and ML inference.

Associate Director, Creative Operations Lead

Understood Hybrid Remote

VIEW

Posted 26 days ago

Understood is hiring an Associate Director, Creative Operations Lead to drive data-informed creative and growth analytics, using modeling and experimentation to increase engagement and retention.

Senior Machine Learning Engineer

Samsara Hybrid Remote - US

VIEW

Posted 28 days ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Work/Life Harmony

Customer-Centric

Social Impact Driven

Rapid Growth

Maternity Leave

Paternity Leave

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Paid Holidays

Paid Time-Off

Samsara is hiring a Senior Machine Learning Engineer to build scalable ML infrastructure and end-to-end ML applications that power real-world IoT products and improve operational safety and efficiency.

ML Operations, Full Stack (New Grad)

Abridge Hybrid San Francisco

VIEW

Posted 28 days ago

Early-career ML Operations / Full Stack engineer to help design, deploy, and optimize scalable model serving and training infrastructure for Abridge’s AI-driven healthcare platform.

Software Engineer - Agent Cloud

Rubrik Hybrid Seattle, WA

VIEW

Posted 29 days ago

Transparent & Candid

Collaboration over Competition

Inclusive & Diverse

Growth & Learning

Build secure, scalable infrastructure and governance systems for enterprise AI agents as a Software Engineer on Rubrik's Agent Cloud team.

Software Engineer, Machine Learning Platform: EVENT

Awesome Motive Hybrid Remote - USA

VIEW

Posted 30 days ago

Coinbase is hiring a Machine Learning Platform Engineer to design and operate low‑latency inference, streaming pipelines, and distributed training infrastructure that powers fraud detection, personalization, and blockchain analysis.

ML Engineer

Phare Health Hybrid No location specified

VIEW

Posted 30 days ago

Phare (part of R1) is hiring ML Engineers to build the internal training, benchmarking, and deployment infrastructure that turns research models into production-ready systems for healthcare revenue operations.

Employment type

Remote/Onsite

Application Type

Date Posted

Department

Work Experience

Industries

Skills

Company size

Funding

Company Culture

Benefits & Perks