Browse 31 exciting jobs hiring in Serving now. Check out companies hiring such as Vercept, Jobgether, Middesk in San Jose, Tulsa, Cleveland.
Work with a top-tier research team in Seattle to optimize inference pipelines for large foundation models, improving latency, throughput, and efficiency at scale.
Lead design and delivery of scalable, secure AI/ML and data platforms that enable analytics and model-driven decision-making across the organization.
Lead Middesk’s data platform and machine learning infrastructure team to scale data acquisition, operationalize ML, and build AI-driven entity resolution that powers our identity verification products.
Join a fast-growing, venture-backed GovCon SaaS startup as a Senior ML/AI Engineer to own production classification and recommendation systems and integrate LLM-driven capabilities into core product workflows.
Senior engineering role to design and lead scalable, secure AI and data platforms that enable analytics, ML/LLM workflows, and data-driven decisions across a distributed, remote-first US team.
Senior engineer role to optimize and extend NVIDIA's GPU-accelerated inference stacks (vLLM, SGLang, FlashInfer) for LLMs and generative AI across datacenter and edge accelerators.
Lead the architecture and delivery of scalable, secure AI and data platforms at Omada Health, enabling analytics, ML, and data-driven decision-making across the company in a remote US Staff Software Engineer role.
Lead the development and operation of Attentive’s ML platform to enable high-velocity, reliable training and low-latency serving for production ML applications.
Lead Airbnb’s Media Ingestion & Serving engineering team to build and operate scalable, highly available media infrastructure that powers immersive experiences across the platform.
Lead account-driven growth for Eyeota data partnerships, improving data quality and commercial outcomes while coordinating onboarding, operations, and sales expansion.
Lead end-to-end, production-scale ML and LLM initiatives at Coupang to improve search, recommendations and generative product experiences.
Monarch is hiring a hands-on Infrastructure & MLOps Engineer to build and operate scalable cloud and AI infrastructure that powers their personal finance platform.
Informa TechTarget is hiring an Ad Operations Coordinator to execute, optimize, and report on digital advertising campaigns for leading technology brands.
Lead the design and delivery of scalable, production ML systems at Scribd to power personalization, recommendations, and generative AI across millions of users.
Lead Fluent’s AdFlow platform operations to drive onboarding, QA, campaign optimization, and scalable processes that maximize campaign performance and revenue.
Palo Alto Networks is hiring a Sr Principal Software Engineer to lead backend and model-serving infrastructure development for ATP Cloud services in Santa Clara, focusing on scalable, high-performance cloud-native systems.
Tinder is hiring a Senior Software Engineer to lead design and implementation of scalable machine learning infrastructure and LLM systems supporting experiments across massive datasets.
Lead the design and production deployment of ML models that transform accounts receivable workflows at a fast-growing fintech startup backed by top-tier investors.
PointClickCare seeks an experienced Principal AI Engineer to lead architecture and delivery of agentic AI systems that drive safe, scalable AI adoption across its healthcare platform.
Socure seeks a Senior Software Engineer to build and maintain AWS-native, low-latency ML platform services that enable safe, reliable model deployment and serving at scale.
Serve legal documents across Jonesboro as an independent contractor for ABC Legal Services using an industry-leading mobile app to claim and manage jobs.
TwelveLabs is hiring a Software Engineer, Machine Learning to build fault-tolerant, low-latency ML services and production APIs for cutting-edge multimodal video understanding systems.
Gartner is hiring a senior AI infrastructure leader to produce influential research and advise C-level and infrastructure leaders on building scalable, secure AI platforms across cloud, on-prem and edge environments.
Lead research on LLM training and inference at Lila Sciences to advance scientific applications of large language models.
Work on the core model-serving infrastructure at ByteDance to design and scale distributed inference systems that power ranking and recommendation across products.
Machine Learning Engineer II to build and optimize large-scale ML pipelines, platform services, and model integrations that power personalization and generative features across Scribd's products.
Senior technical leader needed to architect and drive Quizlet’s AI & data platform, delivering scalable MLOps/DataOps infrastructure that powers personalization and generative features.
Serve legal documents around Flagstaff for ABC Legal Services on a flexible schedule with competitive pay and a $3,000 sign-on bonus.
Build and scale core ML infrastructure—data pipelines, training frameworks, and production model serving—to power David AI’s audio research and production products.
PayPal Ads is hiring a Distinguished Software Engineer to lead ad-technology strategy and build scalable ML-driven personalization and real-time systems across PayPal’s commerce ecosystem.
Build and optimize high-performance inference infrastructure for large foundation models at a fast-moving, well-funded AI startup in Menlo Park.