Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Site Reliability Engineer image - Rise Careers
Job details

Site Reliability Engineer - job 1 of 2

About Zapier

We're humans who simply think computers should do more work.

At Zapier, we’re not just making software—we’re building a platform to help millions of businesses globally scale with automation and AI. Our mission is to make automation work for everyone by delivering products that delight our customers. You’ll collaborate with brilliant people, use the latest tools, and leverage the flexibility of remote work. Your work will directly fuel our customers’ success, and as they grow, so will you.

Site Reliability Engineer

Job Posted: October 1st, 2026

Location: Remote, NAMER (West Coast)

Zapier’s Internal Platform provides engineers with a reliable, frictionless foundation for building, shipping, and operating software. Our Reliability Platform team owns observability, incident response, and service ownership, and we’re hiring a Site Reliability Engineer to help strengthen Zapier’s reliability posture.

Want to learn more about working at Zapier?

We know we have a lot of competition for your skills. If you’re wondering what things would be like at Zapier, read on about:

Why This Role Matters

This isn’t just an infrastructure or tooling role. We’re looking for an engineer who’s excited to get hands-on with Zapier’s reliability challenges. You’ll help improve how we observe our systems, detect and respond to incidents, and build the systems that make Zapier more resilient at scale.

About You

  • You’re an experienced engineer with 4+ years in systems, infrastructure, or backend software roles (SaaS, cloud-native environments preferred).

  • You thrive writing production-grade code — in Go, Python, or something equivalent.

  • You’ve worked with infrastructure-as-code (Terraform, or equivalent), cloud (AWS), and container orchestration (Kubernetes).

  • You have hands-on experience with observability (metrics, logging, dashboards, alerts) and can reason about instrumentation and alert design.

  • You enjoy solving complex systems challenges and finding ways to improve performance and reliability.

  • You’re comfortable jumping into incidents, diagnosing across telemetry, coordinating with teams, and contributing to postmortems.

  • You think proactively about reducing toil and automating repetitive work.

  • You’re comfortable influencing peers by suggesting better practices, reviewing designs, and driving small cross-team improvements.

  • You communicate clearly—whether in async docs, real-time discussions, or knowledge sharing with the team.

  • You align with Zapier’s values and thrive in a collaborative, remote-first environment.

  • You approach new tools and ideas with curiosity and openness—especially around AI in reliability workflows. You’ve experimented with AI tools (or are eager to learn) and see them as part of your everyday toolkit.

Things You’ll Do

  • Build and improve platform tooling that helps Zapier engineers observe and operate their services.

  • Partner with product teams to raise the bar on observability and incident response.

  • Operate and evolve core observability systems, including logging, metrics, alerting, and dashboards.

  • Participate in the team’s on-call rotation for owned services and contribute to Zapier’s broader incident response program by improving the processes, tooling, and practices we use to detect, respond, and learn.

  • Write code to automate operations, improve developer experience, and reduce manual toil.

  • Contribute to infrastructure reliability by working with AWS, Kubernetes, Terraform, and other core technologies.

  • Help shape observability and reliability best practices: review instrumentation designs, suggest improvements, and advocate for effective alerting.

  • Share knowledge through documentation, pairing, and mentoring.

  • Explore and pilot AI-augmented tools (e.g. debugging agents, alert correlation, query recommendations) to improve reliability workflows.

Our Stack & Tools

  • Cloud & Infra: AWS, Kubernetes, Redis, Kafka, Terraform

  • Observability: Grafana, Datadog, Opensearch, Prometheus, Sentry

  • Languages: Go, Python, TypeScript

  • CI/CD & Source Control: GitLab, ArgoCD

What Success Looks Like

  • You deliver reliable, maintainable improvements to Zapier’s reliability systems and tooling.

  • You improve how teams detect and resolve incidents by enhancing observability, standardizing tooling and processes, and contributing to effective response workflows.

  • You help product teams gain confidence in their services by guiding them toward better instrumentation and visibility.

  • You influence observability and reliability practices across teams—promoting a thoughtful, customer-focused approach to monitoring, alerting, and design decisions.

  • You connect reliability work to customer impact, helping your team focus on the improvements that matter most.

  • You grow through feedback and reflection, while contributing to a healthy, inclusive team culture—supporting peers, mentoring, and creating space for diverse perspectives.

  • You explore AI tools with curiosity and introduce practical uses such as reducing noise, speeding up debugging, or guiding better operational decisions.


Zapier Glassdoor Company Review
4.0 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
Zapier DE&I Review
4.4 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
CEO of Zapier
Zapier CEO photo
Wade Foster
Approve of CEO

Average salary estimate

$180000 / YEARLY (est.)
min
max
$140000K
$220000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User

Senior engineering leader needed to drive front-office architecture and deliver scalable, event-driven trading and investment systems across cloud environments.

Photo of the Rise User

Lead platform engineering initiatives to design, automate, and operate secure, production-ready cloud infrastructure for large-scale services at Palo Alto Networks.

Photo of the Rise User
Posted 16 hours ago

Lead architecture and implementation of enterprise Maximo solutions, translating complex business requirements into scalable, secure technical designs.

Photo of the Rise User
Posted 11 hours ago

Experienced SailPoint Engineer needed to lead design and implementation of enterprise IAM solutions for a US federal-facing environment.

Photo of the Rise User
Anduril Industries Hybrid Costa Mesa, California, United States
Posted 11 hours ago

Anduril is looking for a Senior ATAK Engineer to architect and deliver high-performance, map-centric Android plugins and backend APIs for EW and UAS operations.

Variance Hybrid San Francisco
Posted 6 hours ago

Build end-to-end AI-driven product features at Variance, delivering polished UIs and robust backend systems to solve large-scale fraud and abuse problems.

Photo of the Rise User
ServiceNow Hybrid Two Addison Circle 15725 North Dallas Parkway Suite 200, Addison, Texas, United States
Posted 7 hours ago
Inclusive & Diverse
Mission Driven
Rise from Within
Diversity of Opinions
Work/Life Harmony
Empathetic
Feedback Forward
Take Risks
Collaboration over Competition
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Conferences Stipend
Paid Time-Off
Maternity Leave
Equity

Lead full-stack development for ServiceNow’s Connected Customer Experience, building scalable web applications and integrating AI-driven capabilities across the platform.

PAE Hybrid US-VA-Dahlgren
Posted 11 hours ago

Senior Software Engineer needed to modernize DoD software processes and tooling for the Naval Surface Warfare Center, providing architecture guidance and cross-team coordination.

Photo of the Rise User
Anduril Industries Hybrid Lexington, Massachusetts, United States
Posted 10 hours ago

Design and build automated hardware test tools and production processes on Anduril’s Imaging team to improve product quality and accelerate deployment.

Photo of the Rise User

Experienced Java engineers with GCP and ML exposure are sought for a fully remote W-2 contract to build scalable internal tools and AI-enhanced workflows for a Fortune 50 client.

Posted 22 hours ago

Bank of America is hiring a Software Engineer II (JavaScript) in Charlotte to design and deliver complex, well-tested application features within an in-office Agile delivery environment.

Photo of the Rise User

Felicity is looking for a seasoned Staff Platform Engineer to build and operate the infrastructure that scales and reliably runs thousands of browser automations for healthcare customers.

Photo of the Rise User

Lead architecture and implementation of a scalable Python-based pricing platform to optimize revenue and monetization across multiple product surfaces in a remote-first fintech environment.

Zapier exists to Make Automation Work for Everyone.

11 jobs
MATCH
Calculating your matching score...
CULTURE VALUES
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
October 2, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!