Browse 50 exciting jobs hiring in Site Reliability Engineer now. Check out companies hiring such as Jobgether, ThoughtSpot, Veeva Systems in Boise City, Jacksonville, Port St. Lucie.
Experienced DevOps engineers are sought to architect and operate scalable cloud infrastructure, automate delivery pipelines, and elevate security and observability across a distributed platform.
Experienced SRE needed to be a primary technical partner for customers, drive reliability and observability across cloud infrastructure, and lead incident management and automation efforts.
Senior Software Engineer - Infrastructure to design and operate scalable, multi-region AWS platform tooling and immutable infrastructure for Veeva's Vault CRM.
Provable is hiring a Senior Infrastructure Engineer to design, automate, and operate GCP and GKE infrastructure for a privacy-first Web3 platform.
Experienced principal-level software engineer to design, build, and operate high-quality developer tooling and resilient infrastructure for Palo Alto Networks' Cortex platform in Santa Clara.
Lead the automation and operational lifecycle of hyper-scale production networks at ServiceNow, driving reliability through code, IaC, and robust incident response for federal and public sector environments.
Peraton is hiring an Azure Engineer to lead incident response, monitoring, and optimization for a multi-tenant Azure GovCloud environment while ensuring security and FedRAMP compliance.
ConductorOne is hiring a Site Reliability Engineer to build and run scalable, automated, and observable infrastructure that keeps their identity governance platform resilient and performant.
Anysource seeks a Staff Site Reliability Engineer to lead end-to-end enterprise deployments and run scalable, secure production infrastructure across Kubernetes and AWS.
Zapier is hiring an SRE to strengthen observability, incident response, and platform reliability across its cloud-native automation platform.
d-Matrix is hiring a contract Manufacturing Infrastructure Engineer in Santa Clara to build and maintain resilient Linux, PostgreSQL, and hybrid/cloud infrastructure for production manufacturing systems.
Experienced Site Reliability Engineer needed to drive reliability, automation, and cloud infrastructure improvements for Patreon's creator platform in a remote-capable role with optional NY or SF office attendance.
Experienced Software Engineer II needed to develop Python microservices, automate infrastructure with Terraform, and maintain CI/CD pipelines in a cloud-native Kubernetes environment.
Experienced SRE needed to architect and run scalable, secure AWS infrastructure while driving observability, automation, and platform reliability across engineering teams.
Developer Infrastructure Engineer role focused on building scalable cloud-native developer tooling and infrastructure to improve engineering productivity and reliability.
Experienced Cloud Infrastructure Engineer wanted to maintain AWS EKS clusters, build observability tooling, coordinate incident response, and improve SRE processes for Tinder's Resiliency team.
Lead SRE efforts for an enterprise SaaS platform, focusing on automation, observability, and scalable Azure infrastructure to maintain high availability and operational excellence.
Senior Site Reliability Engineer to design, automate, and scale WRITER’s cloud infrastructure to deliver reliable, secure, and high-performance services to enterprise customers.
Lead engineering excellence for LinkedIn's PSM and AI/product platforms, driving system health, observability, and cross-functional initiatives that improve reliability and scalability.
Help OnePay scale and secure its cloud platform by building observability, automation, and resilient infrastructure as a Site Reliability Engineer.
Lead reliability and automation for enterprise identity and access at NVIDIA, delivering zero trust solutions across cloud and hybrid infrastructures.
ServiceRocket seeks an experienced Site Reliability Engineer to remotely manage, optimize, and scale enterprise Jira and Confluence Data Center deployments for US-based clients on a contractor basis.
Lead the design and operationalization of zero trust identity and access systems at NVIDIA, scaling secure, automated solutions across cloud and on-prem environments.
Console is seeking a Platform Engineer in San Francisco to architect and operate enterprise-grade cloud infrastructure, lead devops initiatives, and build scalable self-hostable and zero-downtime deployment systems.
Senior Corporate SRE role responsible for enterprise identity, monitoring, networking, automation, and cross-functional infrastructure initiatives at Lucid Software.
Senior infrastructure engineer role focused on designing and operating Scribd's AWS-centric platform and core services to improve reliability, scalability, and observability.
Build and harden the infrastructure that powers widely used AI systems as a Software Engineer on OpenAI's Applied Infrastructure team focused on reliability, scalability, and performance.
Support and improve large-scale ad-serving systems at FreeWheel by developing, debugging, and automating operational processes for high-profile live events.
Senior Site Reliability Engineer to design, automate, and maintain a highly available cloud platform while improving observability, performance, and developer experience.
Lead the design, operation, and automation of Crusoe's Kubernetes-on-bare-metal platform to deliver performant, reliable infrastructure for large-scale AI workloads.
Experienced SRE with distributed systems and LLM experience needed to design and operate scalable, reliable managed AI services for a mission-driven, sustainability-focused AI infrastructure company.
NBCUniversal is hiring a Site Reliability Engineer to maintain and evolve live channel distribution and cloud streaming systems for its broadcast and OTT services.
Help ensure Okta's Workforce Identity Cloud is secure, highly available, and automated by designing, running, and improving production infrastructure as an Associate Site Reliability Engineer.
Become a core platform engineer at CollegeVine, helping build the scalable cloud, CI/CD, and observability systems that power our AI platform for higher education.
Experienced SRE sought to lead platform reliability, automation, and observability for cloud and on‑prem systems while mentoring engineering teams across the U.S.
IonQ is hiring a Staff Site Reliability Engineer to strengthen platform reliability, build automation and observability for Kubernetes-based services, and scale infrastructure across cloud and on-prem environments.
Lead Blinq's reliability strategy and architecture as a Staff Site Reliability Engineer, shaping infrastructure, observability, and on-call culture at a high-growth company.
A US-remote Staff Software Engineer (SRE/DevEx) role to lead reliability, observability, and CI/CD automation across mission-critical platforms.
Viant is hiring a Staff Cloud Reliability Engineer to lead architecture and reliability initiatives across AWS and GCP, driving automation, scalability, and operational excellence.
Multi Media, LLC seeks a remote Site Reliability Engineer to increase platform reliability, automate operations, and optimize performance for a high-traffic live-streaming product.
MNTN is hiring a US-remote DevOps Engineer to manage GCP-based infrastructure, container orchestration, and CI/CD automation to boost system availability and developer productivity.
Imubit is hiring a remote Site Reliability Engineer in Texas to own and improve cloud infrastructure that powers industrial AI applications.
Senior SRE role at Arista to operate and scale the global CloudVision platform, focusing on automation, observability, and reliability across cloud-native infrastructure.
bem is hiring a Platform Engineer to architect and operate multi-cloud data and compute infrastructure for a high-growth AI platform used by enterprise customers.
Alphatec Spine seeks a Senior Site Reliability Engineer to improve uptime, automation, and observability for its Informatix cloud platform.
Veeam is hiring a Staff Site Reliability Engineer to lead SRE practices and architecture for the Veeam Data Cloud, driving reliability, observability, and operational excellence across global teams.
Runloop is hiring a Site Reliability Engineer to ensure the reliability, scalability, and security of our sandbox platform powering AI development.
Experienced Staff AWS SRE to lead scalability, automation, and reliability across a rapidly growing cloud platform serving enterprise search workloads.
Commify is looking for a Site Reliability Engineer to drive operational excellence and reliability for its Azure-based, high-throughput messaging platforms through automation, monitoring and infrastructure-as-code.
Experienced Site Reliability Engineer to lead reliability, observability, automation, and DoD-aligned cybersecurity for mission-critical systems in a remote role.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
59
|