How Snowflake Migrated All Alerts and Dashboards to a Prometheus-Based Metrics System in 3 Months

Wednesday, 30 October, 2024 - 16:5017:30 GMT

Carlos Mendizabal, Snowflake

Abstract: 

This talk goes over how Snowflake migrated its alerts and dashboards in 3 months, a migration that included rewriting all alerts and dashboards used for system monitoring. We'll go over the tooling that enabled us to complete this migration successfully, which included configuration-as-code through Jsonnet and an unit testing framework, and share some important take-aways from this effort.

Carlos Mendizabal, Snowflake

Carlos Mendizabal is a software engineer at Snowflake. He is part of the Observability team and loves to build things (and to ensure they're well monitored!). Previously at Meta, he's also passionate about meeting folks across the industry and keeping up with the latest and greatest in tech. Carlos lives in Seattle, Washington and is also a pilot in his free time.

BibTeX
@conference {302255,
author = {Carlos Mendizabal},
title = {How Snowflake Migrated All Alerts and Dashboards to a {Prometheus-Based} Metrics System in 3 Months},
year = {2024},
address = {Dublin},
publisher = {USENIX Association},
month = oct
}

Presentation Video