Why Attend LISA?
help promote
Get more
Help Promote graphics!
sponsors
usenix conference policies
Workshop 2: Incident Analysis: Maximizing Learning from Your Darkest Hours
Jefferson
Sue Lueder and Nile Geisinger, Google, Inc.
Outages and incidents are inevitable in large scale and complex systems. Fixing the underlying technical problems is a challenge, but even more challenging is identifying and fixing any underlying systemic technical or organizational issues that are making incidents more frequent, more severe, or more costly to resolve.
This hands-on workshop will explore techniques for analyzing and learning from a set of incident postmortems to learn about what types of insights are waiting to be discovered. Postmortems and incident reports for analysis will be provided, but attendees are encouraged to bring some of their own.
Sue Lueder, Google, Inc.
Sue Lueder joined Google as a Site Reliability Program Manager in 2014 and is on the team responsible for disaster testing and readiness, incident management processes and tools, and incident analysis. Previous to Google, Sue was a technical program manager and a systems, software, and quality engineer in wireless and smart energy industries (OnRamp Wireless, Texas Instruments, Qualcomm). She has a M.S. in Organization Development from Pepperdine University and a B.S in Physics from UCSD.
Nile Geisinger, Google, Inc.
Nile Geisinger joined Google as a Site Reliability Engineer in 2015 and is also on the team responsible for disaster testing and readiness, incident management processes and tools, and incident analysis. Prior to Google, Nile worked for Amazon in AWS and the supply chain and founded a startup in Silicon Valley. Nile has a B.S. in Computer Science and a B.A. in Philosophy from U.C. Davis.
author = {Sue Lueder and Nile Geisinger},
title = {Workshop 2: Incident Analysis: Maximizing Learning from Your Darkest Hours},
year = {2015},
address = {Washington, D.C.},
publisher = {USENIX Association},
month = nov
}
connect with us