Browse 15 exciting jobs hiring in Inference Optimization now. Check out companies hiring such as Cadence, Red Hat, Genies, Inc. in Stockton, Boise City, Fort Lauderdale.
Lead R&D efforts to develop and deploy advanced applied-math and statistical algorithms for circuit simulation, yield analysis, and optimization at Cadence.
Red Hat is hiring a Machine Learning Engineer to optimize LLM inference and GPU kernels using high-performance Python/C++ and GPU toolchains for scalable model serving.
Genies is hiring an ML Infra and Model Optimization Engineer to build and optimize scalable inference systems and production ML infrastructure for image and 3D generative models in a hybrid LA/SF role.
Conduct cutting-edge post-training LLM research and productionize novel architectures and alignment techniques to deliver high-quality, efficient custom models for customers.
Marvell is hiring an early-career AI Engineer to develop and optimize applied AI systems, RAG workflows, and hardware-accelerated inference for data center and AI infrastructure.
Argonne National Laboratory is seeking a Software Engineer to design and optimize scalable AI inference solutions on HPC resources and accelerators for scientific workflows.
Lead and grow Imprint's Data Science team to deliver rigorous experimentation, forecasting, and ML-driven insights that shape product and marketing strategy.
Work on LM Studio's runtime as an AI/ML Systems Engineer, building and optimizing on-device inference engines and integrations for local LLMs and related AI technologies.
Pax Historia seeks a founding ML systems engineer in San Francisco to build production-grade infrastructure, evaluations, and model tuning that make their AI-driven game both higher-quality and more affordable.
Lead production-grade forecasting, attribution, and experimentation for Eight Sleep's Growth team to directly influence inventory, marketing spend, and business outcomes.
Lead WHOOP’s Sensor Intelligence engineering efforts to build and ship optimized embedded ML algorithms that run reliably on wearable devices.
Build production-grade analytics systems at the intersection of data, finance, and AI while embedding with enterprise clients to turn fragmented spreadsheets into repeatable decision workflows.
Red Hat is hiring a PhD-level Machine Learning Systems Research Intern to advance model optimization and efficient inference techniques for open-source LLMs and vLLM.
A hands-on ML/AI Engineer role to architect and productionize hybrid ML and LLM-driven systems that extract structured workflow understanding from noisy enterprise data at scale.
ClarityPay is looking for a Senior Machine Learning Engineer to build and deploy Reinforcement Learning, bandit, and Bayesian optimization solutions that drive operational improvements in collections and offer optimization.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
2
|