Gpu Serving Jobs

Browse 9 exciting jobs hiring in Gpu Serving now. Check out companies hiring such as NVIDIA, Attentive, Coupang in Huntsville, St. Louis, Pittsburgh.

VIEW COMPANIES

Senior Deep Learning Software Engineer, Inference

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 4 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Senior engineer role to optimize and extend NVIDIA's GPU-accelerated inference stacks (vLLM, SGLang, FlashInfer) for LLMs and generative AI across datacenter and edge accelerators.

Senior Software Engineer, ML Platform

Attentive Hybrid San Francisco, CA

VIEW

Posted 19 hours ago

Passion for Exploration

Dare to be Different

Customer-Centric

Diversity of Opinions

Inclusive & Diverse

Lead the development and operation of Attentive’s ML platform to enable high-velocity, reliable training and low-latency serving for production ML applications.

Senior Staff Machine Learning Engineer

Coupang Hybrid Mountain View, USA

VIEW

Posted 2 days ago

Lead end-to-end, production-scale ML and LLM initiatives at Coupang to improve search, recommendations and generative product experiences.

Truck Driver - Class A Floater

Penske Truck Leasing Hybrid LANGHORNE, Pennsylvania

VIEW

Customer Accounts Advisor

Aarons Corporate Retail Store EVERETT, Washington

VIEW

Truck Driver - Local Class B - Penske Logistics

Penske Truck Leasing Hybrid PHOENIX, Arizona

VIEW

Software Engineer, Infra & ML Ops

Monarch Money Hybrid No location specified

VIEW

Posted 2 days ago

Monarch is hiring a hands-on Infrastructure & MLOps Engineer to build and operate scalable cloud and AI infrastructure that powers their personal finance platform.

Sr Principal Engineer Software (ATP Cloud Service)

Palo Alto Networks Hybrid Santa Clara, CA

VIEW

Posted 7 days ago

Palo Alto Networks is hiring a Sr Principal Software Engineer to lead backend and model-serving infrastructure development for ATP Cloud services in Santa Clara, focusing on scalable, high-performance cloud-native systems.

Machine Learning Engineer-Model Serving Infrastructure

ByteDance Hybrid Seattle, WA, USA

VIEW

Posted 16 days ago

Work on the core model-serving infrastructure at ByteDance to design and scale distributed inference systems that power ranking and recommendation across products.

Software Engineer, Machine Learning Infrastructure

David AI Hybrid San Francisco

VIEW

Posted 20 days ago

Build and scale core ML infrastructure—data pipelines, training frameworks, and production model serving—to power David AI’s audio research and production products.