Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Tech Lead - Platform Monitoring Engineer image - Rise Careers
Job details

Tech Lead - Platform Monitoring Engineer

At Databricks, we are passionate about empowering data teams to tackle the world’s most challenging problems — from bringing the next mode of transportation to reality to accelerating the development of medical breakthroughs. We achieve this by building and operating the world’s best data and AI infrastructure platform, enabling our customers to leverage deep data insights and enhance their business. Founded by engineers — and customer-obsessed — we leap at every opportunity to tackle technical challenges, from designing next-gen UI/UX for interfacing with data to scaling our services and infrastructure across millions of virtual machines. And we're only getting started. 

 

We are seeking an exceptional Senior Platform Monitoring Engineer to join our Platform Monitoring Team. This is a high-impact technical role for someone who thrives at the intersection of platform reliability, incident response, and customer obsession. You will serve as a critical first responder for the Databricks Platform, leading complex investigations, designing observability solutions, and driving systemic improvements that enhance customer experience and platform stability.

 

The Impact You Will Have

  • Lead platform incident investigation, coordinating cross-functional teams through rapid detection, mitigation, and resolution to minimize customer impact.
  • Conduct thorough post-incident root cause analysis across infrastructure, services, and cloud providers to identify systemic patterns and prevent future occurrences.
  •  Design and implement customer-focused alerting pipelines and end-to-end observability workflows to enhance detection coverage and reduce mean time to detection.
  • Build automation tools, establish reusable monitoring patterns, and resolve reliability gaps that directly impact customer experience.

 

What We Are Looking For:

  • Minimum of 5 years of experience as an SRE, DevOps Engineer, Production Engineer, or similar role.
  • Production-level experience with at least one major cloud provider (AWS, Azure, GCP) and proficiency in container and orchestration technologies (Docker, Kubernetes).
  • Hands-on experience with monitoring, logging, and alerting tools such as ELK, Prometheus, Grafana, PagerDuty, etc. Ability to architect monitoring solutions that correlate metrics, logs, and traces.
  • Strong proficiency in Python or similar languages with the ability to build production-quality automation tools.
  • Experience owning critical phases of the incident lifecycle from detection through resolution and post-mortem analysis in demanding production environments.
  • BS or Master's, or PhD in Computer Science or Computer Engineering, or related Engineering field.

 

Pay Range Transparency

Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non-commissionable roles or on-target earnings for commissionable roles.  Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks anticipates utilizing the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here.

 

Zone 3 Pay Range
$121,700$170,450 USD

About Databricks

Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark™, Delta Lake and MLflow. To learn more, follow Databricks on TwitterLinkedIn and Facebook.

Benefits

At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit https://www.mybenefitsnow.com/databricks

Our Commitment to Diversity and Inclusion

At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.

Compliance

If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

Average salary estimate

$146075 / YEARLY (est.)
min
max
$121700K
$170450K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Databricks Hybrid Remote - California; United States
Posted 10 hours ago

Senior Incident Manager needed to lead critical cloud-native incident responses, drive technical RCA and reliability improvements, and own stakeholder and customer communications at Databricks.

Photo of the Rise User

WHOOP is hiring a Backend Software Engineer II to develop scalable Java-based services and infrastructure that enable the Healthcare organization to deliver data-driven member experiences.

Photo of the Rise User

Palo Alto Networks is hiring a Sr Staff Engineer to lead platform and productivity efforts that streamline CI/CD, automation, and developer workflows across the engineering organization.

Photo of the Rise User
CannonDesign Hybrid United States - Remote
Posted 2 hours ago

FOS is hiring a seasoned Senior Software Engineer to lead development of scalable SaaS products, drive architecture and DevOps best practices, and mentor engineering teams in a fully remote role across the U.S.

Photo of the Rise User
PhillyTech.Co Hybrid 600 California St, San Francisco, United States, San Francisco, United States, San Francisco, United States
Posted 11 hours ago

Experienced SaaS engineering leader to serve as CTO for a high-growth, AI-enabled legal-tech startup, driving technical strategy, platform scalability, and team growth in a hybrid San Francisco Bay Area role.

Photo of the Rise User
Posted 5 hours ago

Lead Perplexity's AI Products engineering team to build and scale agent-driven search, browser, and vertical experiences integrating frontier ML capabilities.

Senior engineering leader needed to build and scale multi-national software and AI teams for a defense-focused airspace security program based in Atlanta.

Photo of the Rise User
Robust.ai Hybrid San Carlos, CA
Posted 22 hours ago

Robust AI is hiring a firmware engineer to develop and deploy embedded systems firmware for next-generation warehouse robots at its San Carlos headquarters.

Photo of the Rise User
NBCUniversal Hybrid 1 Blachley Road, Stamford, Connecticut
Posted 3 hours ago

Experienced Technical Lead needed to steer full-stack .NET and cloud-native sports application development for NBCUniversal from the Stamford, CT hub in a hybrid capacity.

Photo of the Rise User
Posted 16 hours ago

Vendelux is hiring a Fullstack Engineer in New York to build and scale features on an AI-driven event-marketing platform and drive infrastructure improvements.

Photo of the Rise User
Posted 42 minutes ago

The Department of Technology is hiring a Senior AI Product Engineer to prototype and deliver RAG/LLM-driven tools that safely and practically apply AI to San Francisco city services.

Photo of the Rise User
Posted 18 hours ago

Contribute to cutting-edge real-time 3D medical visualization and AI projects as a Junior Software Developer at a market-leading healthcare technology company.

Photo of the Rise User
Posted 17 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Lead the architecture and hands‑on implementation of a production, always‑on GPU profiling service that enables performance analysis for large‑scale ML workloads.

Photo of the Rise User
Metova Hybrid No location specified
Posted 20 hours ago

Metova is hiring a hands-on Senior Software Developer to lead engineering efforts, mentor teammates, and deliver maintainable web and mobile applications for clients.

Databricks, the data and AI company, helps data teams solve the world’s toughest problems.

3 jobs
MATCH
Calculating your matching score...
BADGES
Badge Future MakerBadge Innovator
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
November 11, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!