Browse 10 exciting jobs hiring in Triton now. Check out companies hiring such as Red Hat, Genies, Inc., Virtue Group in Atlanta, Lubbock, Stockton.
Red Hat is hiring a Machine Learning Engineer to optimize LLM inference and GPU kernels using high-performance Python/C++ and GPU toolchains for scalable model serving.
Genies is hiring an ML Infra and Model Optimization Engineer to build and optimize scalable inference systems and production ML infrastructure for image and 3D generative models in a hybrid LA/SF role.
Virtue AI seeks an Inference Engineer to design and operate high-performance, production-ready inference systems for LLMs and embeddings in San Francisco.
Alignerr is hiring a Senior C++ Full-Stack Engineer to develop and optimize high-performance C++ systems and end-to-end tooling for AI data pipelines and evaluation workflows on a remote, part-time contract.
Virtue AI is hiring a Cloud/Platform Engineer to build one-click deployments and production-grade cloud infrastructure for secure, GPU-backed AI systems across AWS and GCP.
Work with Rackspace customers to deploy, optimize, and operationalize LLM/ML model-serving platforms in private and hybrid cloud environments to meet latency, throughput, security, and cost SLAs.
NVIDIA is hiring a Senior Deep Learning Software Engineer to develop compiler, optimization, and deployment tooling (e.g., Torch-TRT) so advanced models run efficiently on NVIDIA hardware in autonomous driving systems.
NVIDIA is hiring a Senior Deep Learning Software Engineer to optimize PyTorch inference performance using TensorRT across NVIDIA GPUs.
Lead architecture and research for a high-scale ML inference and security platform at Palo Alto Networks' Prisma AIRS, driving MLOps standards and LLM-focused product innovations.
Senior Software Engineer to design and implement high-performance compiler and runtime optimization frameworks across the CUDA/AI stack at NVIDIA.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
10
|