Browse 1 exciting jobs hiring in Speculative Decoding now. Check out companies hiring such as Red Hat in San Francisco, Greensboro, San Diego.
Red Hat is hiring a Machine Learning Engineer to optimize LLM inference and GPU kernels using high-performance Python/C++ and GPU toolchains for scalable model serving.