Browse 6 exciting jobs hiring in Tensorrt Llm now. Check out companies hiring such as Jobgether, Modular (CA), thomsonreuters in Worcester, Huntsville, Omaha.
Lead the product direction for large-scale ML inference infrastructure, driving roadmap, customer-facing technical decisions, and delivery of reliable, high-throughput model serving solutions for a U.S.-remote team.
Lead development of high-performance, distributed LLM inference systems at Modular to enable fast, scalable, production-grade AI deployments.
Lead end-to-end development of large-scale AI and deep learning solutions at Thomson Reuters Labs, driving production-grade LLM, retrieval, and data-pipeline capabilities across legal and news products.
Lead partner enablement for Generative AI at NVIDIA by architecting and building production-grade agentic AI solutions and reference architectures using NVIDIA's full AI stack.
NVIDIA is hiring a Senior Software Development Engineer to build and optimize TensorRT-LLM inference software that powers large-scale generative AI on GPUs.
Palo Alto Networks is hiring a Principal Machine Learning Platform Engineer to architect and scale a high-performance ML inference platform for the Prisma AIRS AI security product.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
3
|