Browse 13 exciting jobs hiring in Model Optimization now. Check out companies hiring such as Fluency, Skydio, NVIDIA in Greensboro, Baltimore, Boston.
A hands-on ML/AI Engineer role to architect and productionize hybrid ML and LLM-driven systems that extract structured workflow understanding from noisy enterprise data at scale.
Skydio is hiring an Autonomy Engineer to develop high-performance deep learning infrastructure and edge inference systems that power real-time computer vision and autonomy.
NVIDIA seeks an entry-level Deep Learning Software Engineer to help optimize and ship GPU-accelerated inference software for LLMs and generative AI.
Lead the applied LLM systems effort at Plaud to design reasoning pipelines, productionize RAG and memory features, and optimize model inference for reliable, user-centered AI experiences.
Lead the design and implementation of compiler, runtime, and debugger integrations for PyTorch, TensorFlow, JAX, and MXNet on custom hardware to maximize AI model performance at Flux.
Samsara is hiring a Staff Machine Learning Engineer to build and optimize production-scale ML systems and edge AI solutions using petabyte-scale IoT and video data.
NVIDIA seeks a Principal Product Manager to lead AI training and post-training frameworks, building SDKs and tools that maximize large-scale model performance on NVIDIA GPUs.
Lead the strategy and delivery of distributed inference, LLM integrations, and on-device ML features at webAI to enable privacy-first, enterprise-grade AI on the edge.
Lead the architecture and execution of a high-throughput, low-latency ML and simulations platform that enables large-scale model training, inference, and simulation-driven product development.
Lead and grow an engineering team building large-scale LLM and conversational AI solutions that power intelligent enterprise virtual agents at ServiceNow.
MLabs is hiring a senior Machine Learning Engineer to architect and ship AI decisioning systems that power personalization, experimentation, and budget optimization across a large first-party customer dataset.
d-Matrix is hiring a Senior Staff ML Researcher to develop and implement algorithmic and numerical techniques that optimize LLM inference on next-generation DNN accelerators at its Santa Clara hybrid headquarters.
Contribute to in-vehicle intelligence by building and deploying high-performance ML/DL models and MLOps pipelines for a leading automotive software platform.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
8
|