Role Overview:
The Mail Service Reliability Engineering (SRE) Manager is responsible for ensuring 7x24 incident management and the reliability of mail services. The manager leads a diverse, distributed team across multiple time zones and countries, partnering closely to respond to and resolve mail service incidents and implement changes in production environments. This role is critical to the organization’s commitment to high-availability mail services, ensuring users experience minimal disruptions and rapid recovery from incidents.
Responsibilities:
24/7 Incident Management
Lead, organize, and oversee the team’s 7x24 incident response for all mail applications, ensuring rapid detection and resolution of incidents.
Strive to shorten Mean Time to Detection (MTTD) and Mean Time to Resolution (MTTR) while consistently improving Service Level Objectives (SLO) and Service Level Agreements (SLA).
System & Service Health Monitoring
Implement comprehensive system/service health monitoring.
Design, deploy, and maintain dashboards for real-time visibility of critical metrics (Availability, MTTD, MTTR).
Set up alerts and escalation processes for early issue detection and response.
Runbooks & Operational Excellence
Develop and maintain detailed runbooks for SRE and Operations teams, specifying permissions, documented service impact, and clear step-by-step procedures for incident response and service changes.
Incident Analysis and Remediation
Facilitate root cause analysis and post-mortems for all major incidents, ensuring action items are tracked and implemented for continuous improvement.
Drive remediation, preventive measures, and process enhancements across teams.
Change Management
Oversee safe deployment procedures; ensure readiness for rollback operations during outage.
Record and track impacts to systems and users throughout incidents and change events.
Collaboration
Coordinate with team members and partners across different regions and time zones to ensure seamless handoffs and communication.
Foster a culture of reliability, accountability, and proactive problem-solving.
Qualifications:
Minimum 7 years of proven experience in Incident Management, preferably in a large-scale, distributed mail or messaging system environment, for both on-perm and cloud environments.
Hands-on experience with monitoring tools, dashboard setup, and alerting systems.
Deep understanding of SRE principles: system reliability, operational runbooks, and root cause analysis.
Strong organizational, leadership, and communication skills across diverse, global teams.
Demonstrable record of improving service reliability metrics (MTTD, MTTR, Availability).
The material job duties and responsibilities of this role include those listed above as well as adhering to Yahoo policies; exercising sound judgment; working effectively, safely and inclusively with others; exhibiting trustworthiness and meeting expectations; and safeguarding business operations and brand integrity.
At Yahoo, we offer flexible hybrid work options that our employees love! While most roles don’t require regular office attendance, you may occasionally be asked to attend in-person events or team sessions. You’ll always get notice to make arrangements. Your recruiter will let you know if a specific job requires regular attendance at a Yahoo office or facility. If you have any questions about how this applies to the role, just ask the recruiter!
Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo will consider for employment qualified applicants with criminal histories in a manner consistent with applicable law. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (www.yahooinc.com/careers/contact-us.html) or call +1.866.772.3182. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.
We believe that a diverse and inclusive workplace strengthens Yahoo and deepens our relationships. When you support everyone to be their best selves, they spark discovery, innovation and creativity. Among other efforts, our 11 employee resource groups (ERGs) enhance a culture of belonging with programs, events and fellowship that help educate, support and create a workplace where all feel welcome.
The compensation for this position ranges from $136,125.00 - $283,750.00/yr and will vary depending on factors such as your location, skills and experience.The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions. Our comprehensive benefits include healthcare, a great 401k, backup childcare, education stipends and much (much) more.Currently work for Yahoo? Please apply on our internal career site.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Serco seeks experienced soil and geotechnical site inspectors to perform field inspections, technical reviews, scope development, and cost estimates in support of FEMA Public Assistance across FEMA Zone 2.
Veolia is hiring a Project Engineer to support engineering, QA/QC, CAD drafting, and testing activities across water, waste and energy projects, including on-site work and global travel.
Experienced civil engineer with marine infrastructure and dredging expertise sought to lead technical design and client efforts for waterfront projects at Foth's Memphis/Nashville team.
Contribute to next-generation palm-sized LiDAR development as a hands-on Test Engineering Intern, designing fixtures, automating tests, and analyzing sensor data alongside senior engineers in NYC.
Lead substation design and protection/control efforts as a licensed Professional Engineer, managing a team and client relationships while overseeing relay settings, testing, and commissioning in Raleigh, NC.
Serco seeks a deployable Health Scientist Site Inspector with FEMA Public Assistance experience to perform site inspections, prepare scopes of work and cost estimates, and advise FEMA on technical claims across Zone 2.
Experienced transportation drainage engineer needed to design and deliver stormwater and drainage solutions for major highway and interchange projects at AECOM's Denver office.
Experienced licensed engineer or architect sought to lead Jefferson County Development Services operations—overseeing permitting, inspections, planning, code enforcement, GIS, and stormwater programs.
BlueAlly is hiring an experienced Network Automation Engineer in Baltimore to design, automate, and support enterprise network solutions using tools like Ansible, Terraform, and NetBox.
Be the on-site process engineering lead for AMP’s Commerce City single-stream facility, owning commissioning, operator training, and reliability to meet throughput, recovery, and uptime targets.
Serco is seeking experienced Structural Engineering Site Inspectors to conduct field inspections, validate scopes and estimates, and support FEMA Public Assistance recovery operations across Zone 2.
AECOM is hiring a Senior Traction Power Engineer to design and analyze traction power distribution systems for rail and transit projects while supporting field commissioning and technical documentation.
AECOM is hiring a Transportation Drainage Lead in Murray, UT to lead drainage design, hydraulic modeling, and delivery of complex highway and roadway stormwater solutions.