Browse 33 exciting jobs hiring in Site Reliability Engineer now. Check out companies hiring such as Palo Alto Networks, Ascend Learning, Jobgether in Overland Park, Toledo, Minneapolis.
Lead the design and operation of FedRAMP-compliant cloud infrastructure as a Principal SRE at Palo Alto Networks, driving automation, reliability, and security across AWS and GCP environments.
Ascend Learning is hiring a Summer 2026 Software Engineer Intern (SRE) to help build and maintain containerized apps and automated CI/CD pipelines from their Leawood, KS hybrid office.
Experienced cloud-focused Senior Software Engineer wanted to build and operate scalable infrastructure and developer tools across AWS, Kubernetes, and Cloudflare for enterprise platforms.
Lead the architecture and operation of NVIDIA's global observability platform to ensure reliable, high-performance telemetry for large-scale AI and data systems.
Lead DevOps Engineer needed to architect and modernize CI/CD and cloud infrastructure for large-scale enterprise applications in Dallas, TX.
Kalshi is hiring a Site Reliability Engineer to strengthen observability, automate operations, and scale reliable production services for its fast-growing prediction markets platform.
WEX seeks an experienced Senior Staff SRE to define and execute enterprise reliability strategy, build resilient systems, and lead cross-functional initiatives that improve scale, observability, and operational excellence.
SpaceX Starshield is hiring a Senior Site Reliability Engineer to build and operate secure, highly available infrastructure supporting national-security satellite and communications systems.
Senior infrastructure engineer needed to drive resiliency, observability, and scalable real-time systems for Orb's billing platform in a hybrid San Francisco office environment.
NBCUniversal is hiring a Site Reliability Engineer to build, operate, and enhance monitoring and control systems for its IP video distribution and on-air broadcast environments.
Experienced SRE leader needed to architect, automate, and operate cloud-native infrastructure to deliver reliable, scalable services across regulated environments.
Build and scale the compute and infrastructure that powers Chai Discovery's next-generation AI drug design platform as a Software Engineer, Infrastructure.
Senior SRE leader needed to shape reliability practices, mentor engineers, and deliver resilient, scalable cloud infrastructure for a high‑throughput fintech platform.
Lead Peacock's SRE and DevSecOps efforts as Manager, guiding cloud architecture and engineering teams to deliver secure, scalable streaming services for millions of users.
Senior Software Engineer - Reliability (remote, CA) to help build foundational SRE practices, observability, and infrastructure automation for secure, compliant cloud production systems.
Help operate and scale a high-performance GPU cluster used by cutting-edge ML research and production teams as a Senior Site Reliability Engineer.
Trunk is hiring a Forward Deployed Engineer to lead end-to-end private and on-premises deployments, collaborate with enterprise IT, and ensure secure, reliable operation of its CI Reliability Platform.
Lead reliability engineering for LinkedIn's massive streaming platform—designing, coding, and operating pub/sub infrastructure to ensure scalable, highly available data flow across the company.
Lead efforts to improve reliability and performance of Alpaca's streaming infrastructure (RabbitMQ/Redpanda) as a Staff Site Reliability Engineer on a remote, North-America-based team.
Lead the development of AI-driven, self-healing SaaS infrastructure as a Senior Site Reliability Engineer at a remote-friendly US company focused on operational excellence and scalable reliability.
Senior Site Reliability Engineer (remote, US) needed to drive automation and reliability at scale, collaborating with cross-functional teams and leading operational excellence initiatives.
Ciroos is hiring a Senior Forward Deployed Engineer to lead enterprise deployments of its AI SRE Teammate, ensuring reliable production outcomes and translating operational pain into product improvements.
Lead reliability engineering at Quizlet as a Senior Staff SRE—architect resilient, self-healing systems, modernize infrastructure, and mentor senior engineers for a global learning platform.
Netic seeks a founding Product Infrastructure Engineer to build and scale the cloud backbone that runs its autonomous AI agents and drives the next wave of agentic products in the physical services economy.
ServiceNow seeks a Site Reliability Engineer (Federal) for the 3rd shift to maintain and improve government cloud infrastructure reliability through automation, monitoring, and deep systems engineering.
Quizlet seeks a Staff Site Reliability Engineer to own platform-wide reliability, automation, and scaling for their San Francisco-based infrastructure team.
Quizlet is hiring a Senior Site Reliability Engineer to build automation, observability, and self-healing infrastructure that ensures reliable, scalable delivery of AI-driven learning services.
Help drive reliability and scalability for Palo Alto Networks' Advanced URL Filtering platform by building secure, automated cloud infrastructure and operational tooling.
Help Ashby scale reliably as a Staff Platform Engineer by building pragmatic, developer-friendly infrastructure, improving observability and reliability, and owning platform initiatives end-to-end.
Waabi is hiring a Senior/Staff Infrastructure Engineer to design and operate high-performance physical and cloud infrastructure that powers its self-driving vehicle research and deployments.
Senior Software Engineer (Site Reliability) to architect resilient, scalable services and mentor engineering teams in support of WGU’s mission to expand access to higher education.
Axiom is hiring a Site Reliability Engineer to build and operate scalable, highly available cloud infrastructure for its serverless data analytics platform.
Palo Alto Networks is hiring a Principal Engineer to build and operate scalable backend services powering Prisma AIRS, the company’s AI security platform.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
22
|