Sailing the Database Seas: Applying SRE Principles at Scale

Tuesday, 29 October, 2024 - 14:0014:40 GMT

Ioannis Androulidakis and Martin Alderete, Booking.com

Abstract: 

In this talk we will demonstrate how we apply core SRE principles in the field of Database Engineering. More specifically, we will talk about the challenges of operating large-scale database systems in multiple cloud environments and how adopting best SRE practices dramatically improved our daily workflows and operations.

We will share insights and concrete use cases around the following topics: Monitoring Distributed Systems, Eliminating Toil and Postmortem Culture.

This talk will equip attendees with ideas and guidelines to better understand and efficiently operate their database systems such as choosing the right SLIs and SLOs, automating capacity planning and embracing a postmortem culture after outages.

Ioannis Androulidakis, Booking.com

Ioannis Androulidakis is a Site Reliability Engineer with a strong background and multiple years of experience in Operating Systems, Observability Tools and Cloud Platforms. He is passionate about OSS technologies and has contributed to multiple open-source projects over the years.

Ioannis holds a diploma in Electrical and Computer Engineering from the National Technical University of Athens, Greece. In 2017 he was accepted for a full-time internship at the IT department of CERN in Geneva, Switzerland. Then, he worked for different companies as Software Engineer and expanded his knowledge in virtualization, distributed systems and cloud-native storage.

Recently he joined Booking.com's Database Engineering team as Site Reliability Engineer in Amsterdam, Netherlands, where he primarily focuses on the reliability and performance of large-scale MySQL clusters.

Martin Alderete, Booking.com

Martin Alderete is a Principal Site Reliability Engineer with a long track record in Engineering, Distributed Systems and System Level Programming in both the academia where after getting his degree he worked as teacher assistant. And the industry where he led different teams building complex systems at scale.

He is passionate about Open Source and new technologies, an active contributor to open-source projects and part of different technical groups.

Before joining Booking.com he worked in multiple industries including space where he worked as a Satellite Reliability Engineer building systems (and bugs!) to operate fleets of satellites.

He is based in Amsterdam but originally from the beautiful Patagonia Argentina.

BibTeX
@conference {302215,
author = {Ioannis Androulidakis and Martin Alderete},
title = {Sailing the Database Seas: Applying {SRE} Principles at Scale},
year = {2024},
address = {Dublin},
publisher = {USENIX Association},
month = oct
}