Browse 10 exciting jobs hiring in Cloud Inference now. Check out companies hiring such as Gcore, Jobgether, Modular (CA) in San Antonio, Anaheim, Newport News.
Gcore is hiring a seasoned Pre-Sales Engineer (Cloud & AI) to lead technical engagements, solution design, and customer success for GPU and cloud infrastructure across the Americas.
Lead the product direction for large-scale ML inference infrastructure, driving roadmap, customer-facing technical decisions, and delivery of reliable, high-throughput model serving solutions for a U.S.-remote team.
Lead development of high-performance, distributed LLM inference systems at Modular to enable fast, scalable, production-grade AI deployments.
Help design and operate scalable, multi-cloud LLM inference infrastructure at Modular as a Backend Engineer focused on distributed systems and ML inference.
Build secure, scalable infrastructure and governance systems for enterprise AI agents as a Software Engineer on Rubrik's Agent Cloud team.
Kilo seeks a technically fluent Senior Partnerships Manager to build and scale strategic relationships with model providers, infra partners, and devtool platforms for its open-source AI coding agent.
Lead the Dynamo engineering team at NVIDIA to design, build, and operationalize high-performance, fault-tolerant LLM inference and GenAI serving infrastructure.
Work on the Platform Engineering team to design, build, and operate the multi-cloud platform and core systems that run Modular's AI inference services at scale.
Lead a player‑coach team at Baseten to ensure reliable, high‑performance AI inference infrastructure for enterprise ML workloads.
Adobe is seeking 2026 Software Engineer Interns to design, develop, test, and deploy scalable services and features for Creative Cloud, Document Cloud, Experience Cloud, and Firefly in a co-located hybrid internship.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
1
|