You are here
Making "Push On Green" a Reality: Issues & Actions Involved in Maintaining a Production Service
Daniel V. Klein, Google, Inc.
Maintaining a production system is complicated. Systems may consist of many components with separate teams responsible for each one (with multiple development, testing, quality assurance, site reliability, and other engineering teams for each component, each with their own hierarchies, rules, and procedures). Having a system to enforce procedures is a good start, but it is far better to have an automated system to actually perform the updates in a safe and controlled manner. We call this process “Push On Green”. We will discuss some of the many factors that have been (or are actively being) addressed in keeping some of our production systems not only up-and-running, but also updated with as little engineer-involvement and user-visible downtime as possible. In the process, we'll show how you can do this in your environment, too.
Daniel V. Klein, Google, Inc.
Daniel Klein is a Site Reliability Engineer at Google Pittsburgh, where his job is to look for trouble before it happens. When he's not doing that, he gives talks, teaches engineering and soft-topics classes, mentors Nooglers, and makes trouble. Occasionally, he sleeps.
Daniel has written a LISA booklet on Monitoring, and has given dozens of invited talks around the world. He promises that you'll enjoy this talk *and* learn some useful stuff, too!
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.
author = {Daniel V. Klein},
title = {Making "Push On Green" a Reality: Issues \& Actions Involved in Maintaining a Production Service},
year = {2014},
address = {Seattle, WA},
publisher = {USENIX Association},
month = nov
}
connect with us