While our organization excelled at maintaining server SLOs for Google Maps, we discovered that many user-impacting incidents, particularly those stemming from client-side issues like mobile app rollouts, remained undetected by server-centric monitoring. This realization prompted a strategic shift towards product reliability, prioritizing the end-user experience. This talk will discuss how we navigated this transition, sharing our progress in addressing challenges, the valuable lessons learned, and our evolving vision for a holistic, user-focused reliability strategy.
- WATCH NOW
- 2025 EVENTS
- PAST EVENTS
- 2024
- 2023
- 2022
- February
- RTC @Scale 2022
- March
- Systems @Scale Spring 2022
- April
- Product @Scale Spring 2022
- May
- Data @Scale Spring 2022
- June
- Systems @Scale Summer 2022
- Networking @Scale Summer 2022
- August
- Reliability @Scale Summer 2022
- September
- AI @Scale 2022
- November
- Networking @Scale Fall 2022
- Video @Scale Fall 2022
- December
- Systems @Scale Winter 2022
- 2021
- 2020
- 2019
- 2018
- 2017
- 2016
- 2015
- Blog & Video Archive
- Speaker Submissions