Site Reliability at Google/My First Year at Google

Abstract: 

Tom will speak about what it's like to be on the team that runs www.google.com's services and explore some of the technologies that enable Google's Web services to maintain their high uptime. Google's "service oriented network" (SON) enables the creation of new products that are scalable and maintainable. Tom will give a sysadmin's view of Google technologies such as GFS, MapReduce, Sawmill, and more. He will also describe how to make a policy that is "Googley."

Tom is the author of O'Reilly's Time Management for System Administrators and co-author of The Practice of System and Network Administration from Addison-Wesley. He joined Google in January 2006. A sysadmin and network wonk since 1987, he has worked at Cibernet, Dean for America, Lumeta, Bell Labs/Lucent, Mentor Graphics, and Drew University. He is a frequent presenter at LISA conferences and joint recipient of USENIX and SAGE's 2005 Outstanding Achievement Award.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@conference {268778,
author = {Tom Limoncelli},
title = {Site Reliability at {Google/My} First Year at Google},
year = {2006},
address = {Washington, D.C.},
publisher = {USENIX Association},
month = dec
}

Presentation Video

Presentation Audio