Attendee Files
![](https://www.usenix.org/sites/all/modules/usenix/usenix_files/images/usenix-locked.png)
![application/pdf](/modules/file/icons/application-pdf.png)
8:45 am–9:00 am
Opening Remarks
Grand Ballroom
Program Co-Chairs: Sarah Butt, SentinelOne, and Dan Fainstein, The D. E. Shaw Group
9:00 am–10:30 am
Opening Plenary Session
Grand Ballroom
20 Years of SRE: Highs and Lows
Monday, 9:00 am–9:25 am
Scam or Savings? A Cloud vs. On-Prem Economic Slapfight
Monday, 9:25 am–9:50 am
Is It Already Time To Version Observability? (Signs Point To Yes.)
Monday, 9:50 am–10:30 am
10:30 am–11:00 am
Break with Refreshments
Pacific Concourse
11:00 am–12:35 pm
Track 1
Grand Ballroom A
Capacity Constraints Unveiled: Navigating Cloud Scaling Realities
Monday, 11:00 am–11:45 am
Sharding: Growing Systems from Node-scale to Planet-scale
Monday, 11:50 am–12:35 pm
Track 2
Grand Ballroom BC
Product Reliability for Google Maps
Monday, 11:00 am–11:45 am
Using Generative AI Patterns for Better Observability
Monday, 11:50 am–12:35 pm
John Feminella, Nuvalence
12:35 pm–1:50 pm
Luncheon
The Atrium
Sponsored by Cortex
1:50 pm–3:25 pm
Track 1
Grand Ballroom A
Build vs. Buy in the Midst of Armageddon
Monday, 1:50 pm–2:35 pm
Compliance & Regulatory Standards Are NOT Incompatible with Modern Development Best Practices
Monday, 2:40 pm–3:25 pm
Track 2
Grand Ballroom BC
The Ticking Time Bomb of Observability Expectations
Monday, 1:50 pm–2:35 pm
Synthesizing Sanity with, and in Spite of, Synthetic Monitoring
Monday, 2:40 pm–3:25 pm
3:25 pm–3:55 pm
Break with Refreshments
Pacific Concourse
3:55 pm–5:30 pm
Track 1
Grand Ballroom A
Migrating a Large Scale Search Dataset in Production in a Highly Available Manner
Monday, 3:55 pm–4:15 pm
OIDC and CICD: Why Your CI Pipeline Is Your Greatest Security Threat
Monday, 4:20 pm–4:40 pm
When Your Open Source Turns to the Dark Side
Monday, 4:45 pm–5:30 pm
Track 2
Grand Ballroom BC
The Sins of High Cardinality
Monday, 3:55 pm–4:15 pm
Optimizing Resilience and Availability by Migrating from JupyterHub to the Kubeflow Notebook Controller
Monday, 4:20 pm–4:40 pm
99.99% of Your Traces Are (Probably) Trash
Monday, 4:45 pm–5:30 pm
Tuesday, March 19
8:00 am–9:00 am
Continental Breakfast
Pacific Concourse
9:00 am–10:30 am
Tuesday Plenary Session
Grand Ballroom
Meeting the Challenge of Burnout
Tuesday, 9:00 am–9:45 am
What We Want Is 90% the Same: Using Your Relationship with Security for Fun and Profit
Tuesday, 9:45 am–10:30 am
10:30 am–11:00 am
Break with Refreshments
Pacific Concourse
11:00 am–12:35 pm
Track 1
Grand Ballroom A
Thawing the Great Code Slush
Tuesday, 11:00 am–11:45 am
Resilience in Action
Tuesday, 11:50 am–12:35 pm
Track 2
Bayview Room
"Logs Told Us It Was Kernel – It Wasn't"
Tuesday, 11:00 am–11:45 am
Autopsy of a Cascading Outage from a MySQL Crashing Bug
Tuesday, 11:50 am–12:35 pm
12:35 pm–1:50 pm
Luncheon
The Atrium
Sponsored by Incident
1:50 pm–3:25 pm
Track 1
Bayview Room
Navigating the Kubernetes Odyssey: Lessons from Early Adoption and Sustained Modernization
Tuesday, 1:50 pm–2:35 pm
Kube, Where’s My Metrics? The Challenges of Scaling Multi-Cluster Prometheus
Tuesday, 2:40 pm–3:25 pm
Track 2
Grand Ballroom BC
What Is Incident Severity, but a Lie Agreed Upon?
Tuesday, 1:50 pm–2:35 pm
Hard Choices, Tight Timelines: A Closer Look at Skip-level Tradeoff Decisions during Incidents
Tuesday, 2:40 pm–3:25 pm
Dr. Laura Maguire, Trace Cognitive Engineering, and Courtney Nash, The VOID
Track 3
Seacliff Room
Workshop: Cloud-Native Observability with OpenTelemetry
Tuesday, 1:50 pm–5:30 pm
3:25 pm–3:55 pm
Break with Refreshments
Pacific Concourse
3:55 pm–5:30 pm
Track 1
Bayview Room
Kubernetes: The Most Graceful Termination™
Tuesday, 3:55 pm–4:40 pm
How We Went from Being Astronauts to Being Mission Control
Tuesday, 4:45 pm–5:30 pm
Track 2
Grand Ballroom BC
Triage with Mental Models
Tuesday, 3:55 pm–4:40 pm
Defence at the Boundary of Acceptable Performance
Tuesday, 4:45 pm–5:30 pm
Track 3 (continued)
Seacliff Room
Workshop: Cloud-Native Observability with OpenTelemetry
Tuesday, 1:50 pm–5:30 pm
7:00 pm–8:00 pm
Lightning Talks
Grand Ballroom BC
- Under Pressure: How We Make Decisions
Laura Nolan - Life as A Firmware Engineer for Server Manageability
Chitkala Sethuraman, Microsoft - Lessons Learned Hearing and Publishing SRE Stories
Prathamesh Sonpatki - Retrospectives, Blameless, and Traffic Courts: Oh My!
J. Paul Reed, Spective Coherence, Inc. - Chrome Now Supports Name Constraints in User-added Certificate Authorities. Now What?
Ted Hahn, TCB Technologies, Inc. - Introducing: Service Level Offsets, A New Way To Buy Reliability
Cail Young, Octopus Deploy - The Why and How of BGP Integration in On-Premise Kubernetes for Enhanced Reliability
Tushar Gupta, Google - You Build It, I Run It: Separate Dev and SRE Teams Are Better
Adam Mckaig - Degenerative AI and You: A Cautionary Talk
Corey Quinn, The Duckbill Group
Wednesday, March 20
8:00 am–9:00 am
Continental Breakfast
Grand Foyer
9:00 am–10:35 am
Track 1
Grand Ballroom A
System Performance and Queuing Theory - Concepts and Application
Wednesday, 9:00 am–9:45 am
It Is OK to Be Metastable
Wednesday, 9:50 am–10:35 am
Track 2
Grand Ballroom BC
The Art of SRE: Building People Networks to Amplify Impact
Wednesday, 9:00 am–9:45 am
Teaching SRE
Wednesday, 9:50 am–10:35 am
10:35 am–11:05 am
Break with Refreshments
Pacific Concourse
11:05 am–12:40 pm
Track 1
Grand Ballroom A
Cross-System Interaction Failures: Don't Fail through the Cracks
Wednesday, 11:05 am–11:50 am
Gray Failure: The Achilles’ Heel of Cloud-Scale Systems
Wednesday, 11:55 am–12:40 pm
Track 2
Grand Ballroom BC
The Invisible Door: Reliability Gaps in the Front End
Wednesday, 11:05 am–11:50 am
Automating Disaster Recovery: The Ultimate Reliability Challenge
Wednesday, 11:55 am–12:40 pm
12:40 pm–1:55 pm
Luncheon
The Atrium
Sponsored by Sentry
1:55 pm–3:30 pm
Track 1
Grand Ballroom A
From Chaos to Clarity: Deciphering Cache Inconsistencies in a Distributed Environment
Wednesday, 1:55 pm–2:15 pm
Patching Your Way to Compliance with a Small Team and a Pile of Technical Debt
Wednesday, 2:20 pm–2:40 pm
Track 2
Grand Ballroom BC
Taming the Linux Distribution Sprawl: A Journey to Standardization and Efficiency
Wednesday, 1:55 pm–2:15 pm
Frontend Design in SRE
Wednesday, 2:20 pm–2:40 pm
Measuring Reliability Culture to Optimize Tradeoffs: Perspectives from an Anthropologist
Wednesday, 2:45 pm–3:05 pm
Storytelling as an Incident Management Skill
Wednesday, 3:10 pm–3:30 pm
3:30 pm–4:00 pm
Break with Refreshments
Grand Foyer
4:00 pm–5:30 pm
Closing Plenary Session
Grand Ballroom
Real Talk: What We Think We Know — That Just Ain’t So
Wednesday, 4:00 pm–4:45 pm
What Can You See from Here?
Wednesday, 4:45 pm–5:30 pm
5:30 pm–5:35 pm
Closing Remarks
Grand Ballroom
Program Co-Chairs: Sarah Butt, SentinelOne, and Dan Fainstein, The D. E. Shaw Group