Incident Groundhog Day

Wednesday, 30 October, 2024 - 11:5012:30 GMT

Hamed Silatani, Uptime Labs

Abstract: 

Learning how to respond effectively to incidents is hard. One of the reasons is that we never see the same incident twice. While we can learn vital lessons during and after an incident, we can’t hop into a time machine, and apply these lessons to the same incident to discover their impact. What if we could experience the same incident over and over again? What might we learn? This talk describes a ‘staged world’ experiment in which 20 incident managers separately experienced the same simulated incident affecting a fictitious e-commerce company. We discuss what we noticed that differentiated some incident responders from others, and some surprising things that we expected to see, but didn’t.

Hamed Silatani, Uptime Labs

Hamed is co-founder and CEO of Uptime Labs, an incident learning & simulation platform. He has 20 years of experience in engineering leadership, reliability engineering, and IT operations. Having spent the majority of his career at the sharp end of incident response in financial services, he's looking to help all companies master the unexpected.

BibTeX
@conference {302155,
author = {Hamed Silatani},
title = {Incident Groundhog Day},
year = {2024},
address = {Dublin},
publisher = {USENIX Association},
month = oct
}

Presentation Video