Browse 117 exciting jobs hiring in Cloud Observability now. Check out companies hiring such as Peraton, Crusoe, OpenAI in Honolulu, Phoenix, Providence.
Senior Product Manager responsible for the vision, roadmap, and execution of Datadog-focused Cloud Observability on AWS, partnering with engineering and stakeholders to deliver scalable, mission-critical solutions.
Lead the reliability, scaling, and observability of Crusoe's managed AI platform as a Staff SRE focused on serving and optimizing LLM workloads.
Build and scale backend systems and APIs at OpenAI to power high-throughput AI products like ChatGPT and the OpenAI API.
Lead the technical strategy and engineering delivery for Zocdoc’s booking platform, building scalable APIs and systems that improve the healthcare booking experience for patients and providers.
Experienced DevOps Engineer sought to own cloud-native infrastructure, CI/CD, and reliability for Jasper's AI platform in a remote-first, fast-moving environment.
Lead the design and hands-on implementation of ServiceNow's next-generation multi-cloud orchestration and automation platform as a Principal Software Engineer on the Platform Infrastructure team.
Target's Search Platform team is hiring an Engineer to design, build, and operate scalable search and indexing services that improve guest experiences across platforms.
Lead the architecture and security strategy for Oakland County’s enterprise infrastructure, guiding cloud modernization, IaC, and resilient system design to support mission-driven public services.
Weaviate seeks a Senior Software Engineer to architect and build a greenfield vector storage engine and optimize distributed, low-latency database systems.
ServiceNow is hiring a Senior Software Engineer to own and deliver scalable core platform capabilities, improve developer-facing APIs, and shape system architecture for high-scale distributed services.
Lead a remote engineering team across the US and Canada, shaping product and technical strategy while mentoring engineers to grow as product-minded builders.
Be the technical partner for enterprise customers, driving POCs, production deployments, and operational excellence for Feldera's self-hosted data platform.
Zocdoc is hiring an Engineering Manager to own and scale the Interop Platform — building enterprise-grade APIs, data pipelines, and observability to enable product growth and AI-enabled automation.
Experienced SRE sought to lead platform reliability, automation, and observability for cloud and on‑prem systems while mentoring engineering teams across the U.S.
A systems-minded DevOps/Infrastructure engineer is needed to build and own production deployment, observability, and security tooling for a fast-moving early-stage startup in New York City.
Work on next-generation engineer productivity platforms at NVIDIA, using LLMs and systems engineering to build scalable, AI-accelerated tools that boost chip-design productivity.
Lead Legion’s cloud platform engineering as a hands-on Director of Engineering, Platform—building a remote team, driving cloud architecture and delivery, and coding Java 50% of the time.
Senior engineering leader to build and scale Atlassian's data engineering practice and deliver data platforms and products that enable analytics, ML, and product decision-making.
Dash0 is hiring a Sales Enablement Specialist to build enablement programs and content that empower GTM teams to articulate product value, shorten ramp time, and drive revenue.
Lead the strategy and execution of observability and monitoring products to improve system reliability and operational efficiency across cloud and distributed systems.
Lead full-stack, cloud-native engineering initiatives at ServiceNow to build scalable, reliable, and AI-aware products while mentoring teams and improving engineering practices.
IonQ is hiring a Staff Site Reliability Engineer to strengthen platform reliability, build automation and observability for Kubernetes-based services, and scale infrastructure across cloud and on-prem environments.
Lead and grow a remote DevOps & Security engineering team to build secure, scalable cloud infrastructure and CI/CD practices for ChowNow's restaurant-focused SaaS platform.
Lead engineering for NVIDIA Mission Control to build and operate resilient, AI-enabled cluster automation for large-scale GPU and CPU infrastructure.
Lead the architecture and operation of scalable cloud and edge infrastructure for Voxel's AI-driven site intelligence platform as a Senior Software Engineer - DevOps.
Guidewire is hiring a Technical Manager, Threat Detection Engineering to lead and mentor a team focused on developing CI/CD-integrated detections, conducting threat hunting, and improving detection coverage across cloud and SaaS platforms.
Lead and scale the NIM Factory engineering organization to deliver reliable, performant, and secure AI inference services from day‑0 launches through enterprise hardening.
A US-remote Staff Software Engineer (SRE/DevEx) role to lead reliability, observability, and CI/CD automation across mission-critical platforms.
Datadog is hiring a Staff Software Engineer to lead development of Dashboards platform services and AI-powered features, driving cross-team technical strategy and product delivery.
Hayden AI is hiring a Backend Engineer to build and operate scalable cloud services and CI/CD infrastructure that power AI-enabled, production deployments.
Lead cross-functional engineering Pods at Tailscale to deliver networking features across L4–L7, DNS, and browser connectivity while mentoring engineers and shaping technical strategy in a fully remote role.
Lead a small, high-performing engineering team at Machinify to build scalable backend services and AI-enabled workflows that transform healthcare claims and payment automation.
Datadog is hiring a Boston-based Sales Engineer (Customer Success) to deliver technical demos, support POCs, and collaborate across sales, product, and customer teams to drive adoption and solve customer challenges.
Twilio seeks a senior strategic architect to define platform direction, design highly scalable distributed systems, and elevate technical excellence across the R&D organization.
Datadog seeks a Staff AI Engineer to architect and ship AI-driven notebook experiences and agentic workflows that elevate observability for customers.
Multi Media, LLC seeks a remote Site Reliability Engineer to increase platform reliability, automate operations, and optimize performance for a high-traffic live-streaming product.
Experienced engineering leader sought to set technical direction and scale Sydecar’s platform and engineering org through the next stage of growth.
Doppel is hiring a hands-on Infrastructure Software Engineer in San Francisco to build and operate scalable cloud systems and data pipelines powering its AI-driven social engineering defense platform.
Lead and scale Rerun's engineering organization to deliver reliable, customer-impacting data infrastructure for Physical AI while building managers, processes, and technical execution capacity.
Lead and grow a core AI cloud platform team at Lambda to deliver cluster lifecycle automation, governance capabilities, and enterprise-grade platform features with a focus on reliability and product-driven execution.
Build and productionize backend systems that translate foundation-model capabilities into robust, user-facing AI product features at a research-led Seattle HQ.
Lead the strategy and execution of cloud-based digital products for Allstate Identity Protection, driving roadmap decisions, cross-functional delivery, and measurable product improvements.
NetBox Labs is hiring a Senior Backend Software Engineer to expand and maintain the NetBox Cloud platform, focusing on REST APIs, platform microservices, and cloud infrastructure.
Mid-level DevOps Engineer supporting hybrid (on-prem + Azure) infrastructure, CI/CD pipelines, containers, and automation for Berkshire Hathaway Homestate Companies' Workers Compensation Division.
Lead the design and delivery of scalable, secure full-stack applications at Acquisition.com, driving technical excellence across React/Next.js, TypeScript, Node.js, cloud, and AI integrations.
Versana seeks an experienced Azure Cloud Architect to lead cloud architecture, identity management, observability, and DevOps practices for its real-time syndicated loan data platform.
Senior DevOps Engineer to lead GitLab-centered CI/CD, IaC, container, and cloud automation initiatives for client engagements at NextLink Labs.
Lead the modernization of Mambu’s data platform and architect scalable, secure data services that enable reporting, extraction, and AI-driven insights for a fast-growing fintech.
Lead the product vision and execution for Node Operator infrastructure at Chainlink Labs, building scalable tooling and architecture to support a global operator ecosystem.
Lead architecture and hands-on development of an AI-native enterprise platform as Principal Software Engineer at a stealth-mode startup based in Midtown Manhattan.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
1
|