Browse 222 exciting jobs hiring in Sre now. Check out companies hiring such as Jobgether, InStride, SharkNinja in Omaha, Lincoln, Henderson.
Provide expert technical support and build scalable automation for cloud security solutions while supporting customers on US‑East hours.
Act as the technical liaison for customers and engineering to ensure reliable enterprise API integrations and excellent developer adoption.
Principal Site Reliability Engineer to lead AWS architecture, automation, and reliability practices for a remote-first engineering team focused on scalable, secure learning platforms.
Lead the strategy and delivery of an AI-native, compliance-aware network observability platform for hyperscale and sovereign cloud environments at ServiceNow.
Lead the reliability, scalability, and observability of research compute clusters to enable large‑scale ML and HPC workloads for an innovative research-focused engineering team in California.
Senior Software Engineer - Infrastructure needed to build and operate resilient, multi-tenant AWS cloud platforms using IaC, observability tooling, and container orchestration in a hybrid work model.
Experienced SRE needed to be a primary technical partner for customers, drive reliability and observability across cloud infrastructure, and lead incident management and automation efforts.
Lead the architecture and development of high-scale backend systems for innovative digital products, providing technical leadership and mentorship within a fast-growing company.
Senior DevOps Engineer needed to lead multi-cloud (Azure & GCP) infrastructure, CI/CD, and reliability efforts for a company modernizing government workflows.
Kikoff is looking for a Platform Engineer to design and build secure, automated self-service infrastructure and CI/CD workflows that accelerate engineering velocity in a regulated fintech environment.
Lead Docker’s platform infrastructure efforts—driving reliability, cloud cost optimization, developer tooling, and service-mesh architecture for a global remote engineering organization.
Lead product strategy and a team of product managers to accelerate developer productivity and ensure scalable, reliable infrastructure across a global SaaS platform.
Experienced DevOps/SRE to own cloud infrastructure, CI/CD, and platform reliability for a fast-growing solar software scale-up.
Senior Software Engineer (DevOps Platform) to develop automation and internal tooling that powers Wealthfront’s infrastructure across physical data centers and cloud environments.
Help drive Clipboard’s platform by building infrastructure-as-code, cloud automation, and internal tooling to improve reliability and developer velocity for a high-growth, remote-first marketplace.
Lead the automation and operational lifecycle of hyper-scale production networks at ServiceNow, driving reliability through code, IaC, and robust incident response for federal and public sector environments.
Lead and build Compa’s first Core Infrastructure team to drive multi-cloud, encryption, and platform reliability initiatives as a hands-on engineering manager.
Lead the strategy and execution of Astronomer’s observability platform as Director of Engineering, scaling systems and people to deliver enterprise-grade data observability.
Develop and lead platform initiatives that measurably boost developer productivity and platform adoption for a remote-focused healthcare tech company.
ConductorOne is hiring a Site Reliability Engineer to build and run scalable, automated, and observable infrastructure that keeps their identity governance platform resilient and performant.
Support and harden Hadrian’s factory IT infrastructure in Torrance by administering servers, networks, virtualization, and endpoints to enable secure, high-availability manufacturing operations.
Work remotely with engineering teams to design and implement observability solutions, contribute to customer codebases, and create actionable documentation for SigNoz's open-source platform.
Wiz is hiring a Solutions Support Engineer (US‑East, remote) to resolve complex customer issues, build automation for support workflows, and help secure customers' cloud environments.
Lead and grow a Solutions Support Engineering team to deliver world-class technical support for Wiz's cloud security platform while driving automation, knowledge, and escalation improvements.
Lead and grow a high-performing DevOps organization at Sphera, driving cloud migrations, CI/CD, containerization, and security for enterprise and government solutions.
Voleon is seeking a Senior Cluster Site Reliability Engineer to ensure high-availability, observability, and scalable operations for our research compute clusters across on-prem and cloud environments.
DXC Technology is hiring a Senior Manager DevOps in Charleston to define DevOps strategy, drive cloud automation (AWS/Azure), and lead a global team to deliver CI/CD and IaC at scale.
NVIDIA is hiring a seasoned Software Product Manager to lead Base Command Manager product strategy, releases, and cross-functional execution for AI infrastructure.
Veeam is seeking an Incident Manager to lead communications and coordination for SaaS incidents, improving reliability and customer trust through clear processes and tooling.
Work remotely as a Security Engineer, Observability to build and operate scalable security telemetry and observability systems that power threat detection and incident response across cloud and on-prem environments.
Anysource seeks a Staff Site Reliability Engineer to lead end-to-end enterprise deployments and run scalable, secure production infrastructure across Kubernetes and AWS.
Lead reliability and automation initiatives to keep critical enterprise SaaS systems performant and highly available in a remote-friendly, fast-paced environment.
Lead the design and implementation of scalable developer infrastructure and productivity tooling for a fast-growing, remote-first technology organization.
Lead and scale Vanta’s Core Platform engineering organization to accelerate product delivery, improve developer productivity, and maintain engineering-driven compliance.
Lead the design and operation of hybrid cloud and bare-metal GPU infrastructure to power high-performance simulation, ML, and factory automation at Atomic Industries.
Zapier is hiring an SRE to strengthen observability, incident response, and platform reliability across its cloud-native automation platform.
Lead a privacy engineering team to design and operate privacy-first systems and automation that safeguard customer data and ensure global compliance.
Catio is hiring a Senior SRE to design and build AWS infrastructure, IaC, and observability pipelines that power a fast-growing AI platform for technical leaders.
Experienced Site Reliability Engineer needed to drive reliability, automation, and cloud infrastructure improvements for Patreon's creator platform in a remote-capable role with optional NY or SF office attendance.
Lead technical escalations and mentor the support organization while delivering deep troubleshooting and advocacy for Drata’s enterprise customers on a remote-first compliance automation platform.
Lead platform engineering initiatives to design, automate, and operate secure, production-ready cloud infrastructure for large-scale services at Palo Alto Networks.
Lead Lambda’s Core Services engineering team to design, scale, and operate CI/CD, cloud automation, and workflow systems supporting gigawatt-scale AI infrastructure.
MSX International is hiring a DMS Product Owner in Dearborn to lead digital customer experience initiatives, translate customer needs into prioritized backlogs, and coordinate resolution across product, engineering, and field teams.
Experienced software engineer sought to develop Python microservices, manage Kubernetes deployments, and evolve CI/CD and Terraform infrastructure for Experian's Model Risk Management platform.
Lead the architecture, performance tuning, security, and automation of enterprise database platforms for a global investment firm operating in New York or Stamford.
Lead a remote-first engineering team to design and scale continuous integration and build infrastructure that accelerates developer delivery and reliability.
Experienced Software Engineer II needed to develop Python microservices, automate infrastructure with Terraform, and maintain CI/CD pipelines in a cloud-native Kubernetes environment.
Experienced SRE needed to architect and run scalable, secure AWS infrastructure while driving observability, automation, and platform reliability across engineering teams.
A crypto-first CTO Advisor role to harden reliability, introduce pragmatic SRE/SDLC/MLOps cadence, and hand back operational runbooks and dashboards to the engineering team.
Senior DevOps Engineer to design and operate Kubernetes-based ephemeral infrastructure and developer tooling for fast, reliable pre-production environments in a remote US role.
Below 50k*
1
|
50k-100k*
8
|
Over 100k*
307
|