Browse 30 exciting jobs hiring in Model Serving now. Check out companies hiring such as Reddit, Nirvana Insurance, GenBio AI in Irving, Houston, Cleveland.
Lead Reddit’s Embeddings Platform team to architect and deliver scalable, production-ready foundational embeddings and ML infrastructure that power recommendations, discovery, and ML features across the company.
Build and ship production AI Agents and workflows as a founding engineer on a new AI business unit at a high-growth commercial insurance startup.
Lead and shape the infrastructure powering GenBio AI's large biological models, focusing on Kubernetes GPU orchestration, MLOps pipelines, security, and cross-team operational excellence.
Lead the design and deployment of production-scale personalization ML systems for a remote-first US company, shaping strategy, architecture, and technical direction.
Lead product strategy and execution for NVIDIA's AI infrastructure platform, driving roadmap, cross-functional alignment, and delivery of large-scale AI/ML and HPC solutions.
Build and operate scalable backend and ML-serving infrastructure for emotionally intelligent AI systems at an early-stage, high-growth company (remote, US).
Senior Software Engineer to join LinkedIn's AI Platform team to design and optimize large-scale training, feature-engineering, and serving infrastructure for LLMs and recommendation systems.
Lead the zero-to-one design and implementation of a high-throughput, low-latency LLM inference stack as an early engineering hire at an SF-based AI startup.
Dave is seeking a Machine Learning Platform Engineer II to develop and maintain scalable ML infrastructure that enables model development, serving, and monitoring for a large consumer finance platform.
Titan is hiring an ML Data Engineer to build and operate scalable data and model-serving infrastructure for a production AI platform focused on banking and financial services.
Senior Machine Learning Engineer on Plaid’s Data Foundation & AI team to design and scale production ML/AI systems that power fintech product experiences.
HackerOne seeks a Senior Software Applied AI Engineer to develop production AI agents and platform services that power next-generation security tooling for customers like Amazon, GitHub, and the U.S. Department of Defense.
Visa is hiring a Machine Learning Engineer Intern to build and deploy production ML pipelines and monitoring tools as part of its AI/ML Products & Platforms team for Summer 2026.
Lead development of scalable, high-performance GCP-based backend and model-serving infrastructure for the ATP Cloud team at Palo Alto Networks.
Senior Machine Learning Engineer to design, deploy, and scale production ML pipelines and infrastructure for recommendation and content-understanding systems at Bumble's Austin office.
Lead the design and production deployment of scalable ML pipelines and model-serving systems to power recommendations and content understanding for millions of Bumble users.
Lead the architecture and scaling of Bumble’s recommendation and content-understanding ML systems as a Staff Machine Learning Engineer based in Austin.
Lead architecture and build the ML platform that enables fast, reliable model training, serving, and agentic infrastructure for Attentive's AI product suite.
HackerOne is hiring a Staff Software Applied AI Engineer to architect and build production AI platform services and agentic security capabilities that power next-generation offensive security products.
Lawrence Livermore National Laboratory is hiring a Predictive AI Software Engineer to design, build, and maintain LLM-driven agent workflows, data orchestration, and MCP servers for the Bernie AI infrastructure-management program.
Lead Affirm’s centralized Machine Learning organization to define strategy, build talent and platforms, and deliver high-impact models that drive business outcomes across underwriting, fraud, servicing and personalization.
AirOps seeks a hands-on Lead Data Scientist to architect and deliver production NLP, search, and LLM systems that improve content discovery and business outcomes for leading brands.
Help architect and scale a production ML inference platform at Tamarind Bio to serve hundreds of biological models and support rapid customer growth.
Lead the architecture and build scalable, fault‑tolerant systems for Crusoe’s managed AI inference platform to serve LLMs at massive scale.
Specter is hiring an ML Infrastructure Engineer to design and scale training pipelines, optimized model serving, and continuous production workflows for real-time edge perception systems.
August is hiring a Founding AI Engineer to design and deploy production agent systems that automate legal workflows for mid-market law firms.
Lead the architecture and delivery of production-grade AI systems for cyber operations, building resilient agent orchestration, MCP serving infrastructure, and advanced prompt engineering patterns.
Sable is hiring an AI Engineer in San Francisco to develop and productionize multimodal deep-learning systems that power digital-human enterprise workers.
Produce and scale safe, cost-efficient LLM inference for global AI products as an ML Ops Engineer on a hybrid, high-impact team at Bjak.
Lead the design and delivery of production-scale LLM, knowledge-graph, and ML systems for Workday’s Evisort AI team to transform contract intelligence and enterprise document workflows.
Below 50k*
0
|
50k-100k*
1
|
Over 100k*
27
|