Browse 14 exciting jobs hiring in Tensorrt now. Check out companies hiring such as Zoox, NVIDIA, Bjak in Madison, Scottsdale, Richmond.
Senior Software Engineer (C++/GPU Performance) to optimize GPU/CPU compute, build instrumentation and benchmarking frameworks, and collaborate with component teams to meet latency and power targets for a next-generation autonomous vehicle platform.
NVIDIA is hiring a Senior Solutions Architect to architect and deliver AI-accelerated CDN and telco solutions that integrate GPUs, edge inference, Kubernetes, and CDN platforms for low-latency, scalable deployments.
Lead the technical design and implementation of A1’s foundational LLM systems—training pipelines, inference stacks, and deployment architecture—for a global consumer AI product.
Lead the design, training, and production deployment of ASR, TTS, and Speech LLM systems at OutcomesAI to power HIPAA-compliant voice agents in clinical settings.
Work at the intersection of research and engineering to build scalable synthetic data pipelines that directly improve the quality and efficiency of Cohere's language models.
Gcore is hiring a seasoned Pre-Sales Engineer (Cloud & AI) to lead technical engagements, solution design, and customer success for GPU and cloud infrastructure across the Americas.
Lead the product direction for large-scale ML inference infrastructure, driving roadmap, customer-facing technical decisions, and delivery of reliable, high-throughput model serving solutions for a U.S.-remote team.
Lead development of high-performance, distributed LLM inference systems at Modular to enable fast, scalable, production-grade AI deployments.
LlamaIndex is seeking a Multimodal AI Engineer to develop and productionize vision-language and document-understanding models that power large-scale document parsing and RAG applications.
Sciforium seeks a backend software engineer experienced in C++ and Python to build high-performance GPU-level kernels and scalable backend services for its AI model serving platform.
Lead the design and deployment of real-time perception and sensor-fusion models that give autonomous tractors robust, production-ready situational awareness in rugged agricultural environments.
Lead Zoox's Perception organization to build state-of-the-art multi-modal 3D environment models for autonomous urban vehicles.
Lead end-to-end development of large-scale AI and deep learning solutions at Thomson Reuters Labs, driving production-grade LLM, retrieval, and data-pipeline capabilities across legal and news products.
Lead partner enablement for Generative AI at NVIDIA by architecting and building production-grade agentic AI solutions and reference architectures using NVIDIA's full AI stack.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
12
|