Browse 1 exciting jobs hiring in Model Parallel now. Check out companies hiring such as BentoML, USAA, MobilityWorks in Salt Lake City, Chicago, Honolulu.
BentoML seeks an Inference Optimization Engineer to accelerate LLM inference across GPUs and distributed serving stacks, reducing latency and GPU costs while contributing to open-source tooling.