Browse 26 exciting jobs hiring in Model Inference now. Check out companies hiring such as Jobgether, webAI, Nexxa.AI in Fort Lauderdale, Madison, North Las Vegas.
Lead the product direction for large-scale ML inference infrastructure, driving roadmap, customer-facing technical decisions, and delivery of reliable, high-throughput model serving solutions for a U.S.-remote team.
Lead technical product strategy and execution for webAI’s distributed inference and on-device LLM platform, partnering closely with engineering and research to deliver enterprise-grade AI solutions.
Lead the design and production deployment of generative and multimodal computer vision systems at Nexxa.AI, translating ambiguous customer needs into robust, scalable AI solutions.
Samsara is hiring a Senior Machine Learning Engineer to build scalable ML infrastructure and end-to-end ML applications that power real-world IoT products and improve operational safety and efficiency.
Early-career ML Operations / Full Stack engineer to help design, deploy, and optimize scalable model serving and training infrastructure for Abridge’s AI-driven healthcare platform.
Lead the next generation of AI-driven ranking and recommendation systems for LinkedIn's Feed to improve relevance, personalization, and member engagement at massive scale.
Lead the development and deployment of high-performance, real-time computer vision and multi-sensor AI for smart home devices at TP-Link Systems Inc.
d-Matrix is hiring a Senior Staff ML Researcher to develop and implement algorithmic and numerical techniques that optimize LLM inference on next-generation DNN accelerators at its Santa Clara hybrid headquarters.
Phare (part of R1) is hiring ML Engineers to build the internal training, benchmarking, and deployment infrastructure that turns research models into production-ready systems for healthcare revenue operations.
Senior-level data scientist/ML engineer role focused on designing, deploying, and optimizing machine learning models for Samsung Ads to drive targeting, performance, and revenue improvements.
Contribute to in-vehicle intelligence by building and deploying high-performance ML/DL models and MLOps pipelines for a leading automotive software platform.
Lead Developer Relations on the West Coast to grow Featherless’s open-model community, create technical demos and content, and represent the platform at events and hackathons.
Kilo seeks a technically fluent Senior Partnerships Manager to build and scale strategic relationships with model providers, infra partners, and devtool platforms for its open-source AI coding agent.
DICK'S Sporting Goods is hiring a Machine Learning Engineer II to productionalize causal inference and Bayesian models and build scalable ML pipelines and APIs that deliver real-time business value.
TWG Global is hiring a Senior MAQR Data Scientist to build and productionize advanced ML, forecasting, and quantitative models that drive measurable business impact across its financial and enterprise portfolio.
Lead a talented engineering team to design, build, and operate large-scale LLM serving and model deployment infrastructure that powers personalized recommendations at scale.
Anduril is hiring a Software Engineer, AI in Reston to build, optimize, and deploy real-world ML/LLM systems that power mission-critical defense and intelligence capabilities.
Work on pricing at Opendoor to build predictive models, run experiments, and translate analytic insights into product and executive decisions that shape the housing marketplace.
An engineer-focused, customer-facing role to architect, implement, and deploy production AI inference solutions on Baseten’s platform with hands-on coding and cross-functional ownership.
Yobi is hiring an ML Engineer (Inference/Serving) to build and operate low-latency, production-grade model serving systems that turn behavioral models into reliable, observable services.
Capital One is hiring a Senior Lead AI Engineer to design and productionize foundational LLM, inference, and agentic AI systems that are scalable, cost-efficient, and responsible.
WGU is hiring a Decision Scientist to develop and deploy decision intelligence models that drive personalized student recommendations and improve completion outcomes.
NVIDIA is hiring a Senior Software Engineer to build and optimize GPU-accelerated AI systems and deploy fast, low-power deep learning solutions on embedded and cloud platforms for Maxine and related products.
NVIDIA is looking for a Senior Software Engineer (Deep Learning) to build and optimize real-time video AI models and inference stacks for the Maxine and Broadcast platforms.
Faire is hiring a Senior Data Scientist / Machine Learning Engineer to lead ML and LLM-driven initiatives that improve listing quality and product discovery for retailers on its marketplace.
Ensono seeks a hands-on Machine Learning Engineer to productionize models, build APIs/services, and scale AI-driven operations across enterprise systems like ServiceNow and Snowflake.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
1
|