The Opportunity:
Are you obsessed with what’s under the hood? Do you dream in traces, logs, and metrics? Ready to build a high-performing observability practice that powers reliability, speed, and insight at scale?
We are looking for a lead Observability Engineer to advance and integrate observability capabilities across our enterprise platforms. This role is both strategic and hands-on, ideal for someone passionate about improving visibility, preventing incidents before they occur, automating response, and driving service reliability through data.
You will guide engineering efforts, manage team priorities, mentor teammates, and help unify observability practices across tools such as Dynatrace, Splunk, Cribl, and ServiceNow Event Management. As the lead for the team, you’ll provide technical direction, oversee delivery, and ensure observability services are aligned to enterprise goals. You’ll ensure observability practices support critical business services and product platforms, protecting revenue, client delivery, and overall business continuity. This role reflects Booz Allen’s commitment to trusted partnerships and continuous improvement. Due to the nature of work performed within this facility, U.S. citizenship is required.
What You'll Work On:
Lead and mentor the observability engineering team, managing priorities, delivery schedules, and fostering a culture of collaboration and continuous improvement.
Provide coaching and feedback to support career development and team growth.
Define and evolve observability strategy, ensuring alignment with business KPIs, SLAs, and enterprise objectives.
Deliver observability projects end-to-end, coordinating with stakeholders to ensure high-quality, on-time outcomes.
Build strong relationships with partner teams to drive adoption, improve coverage, and continuously enhance observability services.
Integrate and modernize observability platforms, including Dynatrace, Splunk, Cribl, and ServiceNow Event Management, while leading migrations from legacy tools.
Expand monitoring capabilities with dashboards, visualizations, Real User Monitoring, synthetic monitoring, and network telemetry.
Advance end-to-end observability across infrastructure, applications, and user experience.
Enable business observability by linking telemetry and analytics to revenue protection, client delivery, and SLA performance.
Apply AI-driven observability and AIOps to detect anomalies, reduce false positives, and proactively prevent incidents.
Own observability KPIs, including MTTD, MTTR, incident prevention, or migration progress, and report measurable improvements in service reliability.
Establish and enforce observability standards, runbooks, and change management processes while identifying risks and driving continuous improvement.
Join us. The world can't wait.
You Have:
5+ years of experience with IT operations or engineering
5+ years of experience leading or mentoring technical teams, including coaching and developing talent across levels
Experience with architecting and managing observability platforms such as Dynatrace, Splunk, Cribl, or ServiceNow ITOM
Experience with incident reviews, root cause analysis, and implementing platform improvements
Experience managing delivery schedules, aligning priorities, and coordinating cross-functional dependencies
Experience with automation and scripting tools such as Python, Ansible, or Terraform
Experience with Linux systems, distributed systems, and networking concepts
Experience with cloud platforms such as AWS or Azure and containerized technologies such as Kubernetes, Docker, or serverless
Knowledge of ITIL practices, including monitoring and event management, incident, problem, and change
Bachelor’s degree in Computer Science or Engineering
Nice If You Have:
Experience building dashboards and visualizations for executive and technical audiences, communicating technical context to influence stakeholder decisions
Experience with Real User Monitoring, synthetic monitoring, and end-user experience analytics
Experience with network monitoring and telemetry collection, including SNMP or APIs, integrated into observability workflows
Experience with enterprise-scale migrations and tool rationalization
Experience with ServiceNow ITOM transformations, including discovery, event management, and service mapping
Experience with AIOps, anomaly detection, or machine learning-driven monitoring
Experience with service maturity frameworks or observability centers of excellence
Experience participating in professional networks, technical communities, or thought leadership
Experience working in Agile delivery environments and collaborating across functional teams
Dynatrace, Splunk, ServiceNow, ITIL, or AWS Certification
Compensation
At Booz Allen, we celebrate your contributions, provide you with opportunities and choices, and support your total well-being. Our offerings include health, life, disability, financial, and retirement benefits, as well as paid leave, professional development, tuition assistance, work-life programs, and dependent care. Our recognition awards program acknowledges employees for exceptional performance and superior demonstration of our values. Full-time and part-time employees working at least 20 hours a week on a regular basis are eligible to participate in Booz Allen’s benefit programs. Individuals that do not meet the threshold are only eligible for select offerings, not inclusive of health benefits. We encourage you to learn more about our total benefits by visiting the Resource page on our Careers site and reviewing Our Employee Benefits page.
Salary at Booz Allen is determined by various factors, including but not limited to location, the individual’s particular combination of education, knowledge, skills, competencies, and experience, as well as contract-specific affordability and organizational requirements. The projected compensation range for this position is $99,000.00 to $225,000.00 (annualized USD). The estimate displayed represents the typical salary range for this position and is just one component of Booz Allen’s total compensation package for employees. This posting will close within 90 days from the Posting Date.Identity Statement
As part of the application process, you are expected to be on camera during interviews and assessments. We reserve the right to take your picture to verify your identity and prevent fraud.
Work Model
Our people-first culture prioritizes the benefits of flexibility and collaboration, whether that happens in person or remotely.
Commitment to Non-Discrimination
All qualified applicants will receive consideration for employment without regard to disability, status as a protected veteran or any other status protected by applicable federal, state, local, or international law.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead RMF A&A and cloud security efforts for DoD clients, driving ATO packages, risk mitigation, and secure architecture decisions at Booz Allen.
Lead a remote engineering team to design and scale CI/CD and build infrastructure that accelerates developer productivity and enables reliable, fast delivery.
Lead architecture and delivery of scalable, polished iOS experiences as a Staff iOS Software Engineer at a fast-moving, user-focused company.
DMI is seeking a Software Engineer to design, develop, test, and deploy software solutions supporting government and commercial customers in the Tysons Corner/McLean area.
Lead PHP-based eCommerce architecture and client consulting for a global digital product engineering company, guiding requirements, integrations, and delivery across onsite and offshore teams.
Experienced full-stack Software Engineer needed to develop cloud-native frontend and backend solutions for a remote US team, working across Angular/Vue/Blazor, C# and Python, and modern cloud and CI/CD toolchains.
Lead a high-impact engineering team to design and deliver a modern, scalable SaaS platform while shaping technical roadmap and team culture.
Experienced Software Programmer needed to develop secure full‑stack applications for defense logistics in a hybrid/remote role requiring an active IT‑II/NACLC clearance.
Join Decagon's Security Engineering team to design and ship scalable security tooling and automation that protects a leading conversational AI platform.
Lead design and development of high-scale risk and fraud systems, building APIs, decision engines, and cloud-native microservices to protect users and business operations.
Help build and operate the EW team’s embedded NixOS infrastructure and image pipeline to bring advanced electromagnetic warfare systems from hardware bring-up to deployed capability.
Everything To Gain is looking for an Automations (Scripting) Intern to help create, deploy, and improve automation scripts while receiving hands-on mentorship and real project experience.
Experienced software engineer needed to develop and deploy vehicle-control, sensor-fusion, and networking software for AD&S autonomous systems at Anduril's Costa Mesa facility.
Experienced Solutions Architect needed to design scalable, high-resiliency systems and guide cross-functional teams at a mission-driven online university.