Gpu Inference Jobs

Browse 19 exciting jobs hiring in Gpu Inference now. Check out companies hiring such as Ataraxis AI, Awesome Motive, USAA in Buffalo, Madison, Irving.

VIEW COMPANIES

Research Engineer (Systems/Optimization)

Ataraxis AI Hybrid New York

VIEW

Posted 2 days ago

Contribute to cutting-edge AI research in oncology by building, optimizing, and deploying scalable machine learning models and evaluation frameworks at Ataraxis AI.

AI/LLM Engineer

Awesome Motive Hybrid San Francisco

VIEW

Posted 3 days ago

Tamarind Bio is hiring an AI/LLM Engineer in San Francisco to build scalable, production-grade workflows and enhance an ML copilot for computational biology.

Infrastructure Engineer

Awesome Motive Hybrid San Francisco

VIEW

Posted 4 days ago

Help architect and scale a production ML inference platform at Tamarind Bio to serve hundreds of biological models and support rapid customer growth.

Fraud Investigator I (Mid-Level) – Fraud Prevention

USAA Full-Time PHOENIX, Arizona

VIEW

Sponsored

FM Approvals Research Campus Engineering Technician - Materials

FM Hybrid WEST GLOCESTER, Rhode Island

VIEW

Sponsored

Customer Service Advisor

USAA Full-Time PHOENIX, Arizona

VIEW

Sponsored

Technical Program Manager, AI and ML Software

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 4 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is seeking a seasoned Technical Program Manager to lead Deep Learning Inference programs, coordinating cross-functional engineering teams to deliver scalable AI software and hardware integrations.

Senior Technical Marketing Engineer - GPU and System Architecture

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 6 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead technical marketing for NVIDIA GPU and rack-scale systems, communicating architecture, performance, and deployment value to hyperscalers, OEMs, and data center operators.

Senior AI Software Engineer, GenAI Framework

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 8 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA seeks a Senior AI Software Engineer to extend Megatron Core and NeMo frameworks through distributed training innovations, performance tuning, and scalable tooling for large-scale LLM and multimodal model workflows.

Senior Product Manager – AI Infrastructure

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 8 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead product strategy and go-to-market for NVIDIA's AI Infrastructure, focusing on inference software, Kubernetes integrations, and customer-driven AI Factory solutions.

Senior Software Engineer, Machine Learning Inference

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 8 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead the design and optimization of high-performance deep learning inference software on NVIDIA GPUs as a Senior Software Engineer on the TensorRT team.

Software Engineer (AI Performance)

Gimlet Labs Hybrid San Francisco

VIEW

Posted 8 days ago

Gimlet Labs is hiring a Software Engineer (AI Performance) to drive model and GPU-level performance improvements for production-scale inference in San Francisco.

Inference Optimization Engineer

BentoML Hybrid San Mateo

VIEW

Posted 9 days ago

BentoML seeks an Inference Optimization Engineer to accelerate LLM inference across GPUs and distributed serving stacks, reducing latency and GPU costs while contributing to open-source tooling.

Home Mobility Sales Consultant

MobilityWorks Regular Full-Time PLAINFIELD, Illinois

VIEW

Sponsored

Customer Service Advisor

USAA Full-Time TAMPA, Florida

VIEW

Sponsored

FM Post-Doctoral Fellow - Advanced Nuclear and Emerging Energy Risks

FM Hybrid NORWOOD, Massachusetts

VIEW

Sponsored

ML Ops Engineer (Remote)

Bjak Hybrid New York

VIEW

Posted 9 days ago

Bjak seeks an MLOps Engineer to run and scale open-source LLMs into production, optimizing for cost, latency, and reliability while working in a flexible hybrid model.

Field Applications Engineer, Embedded Systems - NALA

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 13 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is seeking an experienced Embedded Field Applications Engineer to support customers building AI-enabled embedded systems on the Jetson platform across the NALA region.

2026 University Graduate - Machine Learning Engineer

Adobe Hybrid Seattle

VIEW

Posted 14 days ago

Contribute to Adobe Firefly’s GenAI Services by building optimized inference pipelines, integrating generative models into flagship products, and developing scalable ML systems for production.

Member of Technical Staff, Training and Inference

Boson AI Hybrid Santa Clara HQ

VIEW

Posted 20 days ago

Boson AI seeks an experienced research engineer to optimize training and inference pipelines on GPU clusters using CUDA/Triton, PyTorch, and distributed optimization techniques.

Senior Manager, Software Engineering - NIM Factory

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 23 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead and scale the NIM Factory engineering organization to deliver reliable, performant, and secure AI inference services from day‑0 launches through enterprise hardening.

Senior Manager, Data Science - GenAI Digital Assistant

Capital One Hybrid New York, NY

VIEW

Posted 24 days ago

Lead research and engineering-driven development of GenAI conversational assistants, guiding a cross-functional team to fine-tune, optimize, and deploy LLM-powered features that improve customer digital experiences.

Account Executive - AI Studio (Cloud Computing, AI/ML)

Lavendo Hybrid San Francisco

VIEW

Posted 26 days ago

Lead enterprise sales for a public AI cloud provider by driving adoption of AI Studio’s GPU-accelerated infrastructure and GenAI services across large customers.

Backend Engineer – Inference Optimization

Vercept Hybrid Seattle

VIEW

Posted 26 days ago

Work with a top-tier research team in Seattle to optimize inference pipelines for large foundation models, improving latency, throughput, and efficiency at scale.

Staff Software Engineer, Inference (Bay Area / Paris / Remote)

Genesis AI Hybrid No location specified

VIEW

Posted 28 days ago

An experienced systems and ML-inference engineer is needed to lead development of low-latency, high-throughput inference pipelines spanning on-device and cluster deployments.

Employment type

Remote/Onsite

Application Type

Date Posted

Department

Work Experience

Industries

Skills

Company size

Funding

Company Culture

Benefits & Perks