- LISA '12 Home
- Registration Information
- Registration Discounts
- Organizers
- At a Glance
- Calendar
- Conference Themes
- Training Program
- Technical Sessions
- Workshops
- Data Storage Day
- ION San Diego
- Posters
- Birds-of-a-Feather Sessions
- Exhibition
- Sponsors
- Activities
- Why Attend?
- Hotel and Travel Information
- Services
- Students and Grants
- Questions?
- Help Promote
- Flyer PDF
- Brochure PDF
- For Participants
- Call for Participation
- Past Proceedings
sponsors
usenix conference policies
Theia: Visual Signatures for Problem Diagnosis in Large Hadoop Clusters
Elmer Garduno, Soila P. Kavulya, Jiaqi Tan, Rajeev Gandhi, and Priya Narasimhan, Carnegie Mellon University
Awarded Best Student Paper!
Diagnosing performance problems in large distributed systems can be daunting as the copious volume of monitoring information available can obscure the root-cause of the problem. Automated diagnosis tools help narrow down the possible root-causes—however, these tools are not perfect thereby motivating the need for visualization tools that allow users to explore their data and gain insight on the root-cause. In this paper we describe Theia, a visualization tool that analyzes application-level logs in a Hadoop cluster, and generates visual signatures of each job's performance. These visual signatures provide compact representations of task durations, task status, and data consumption by jobs. We demonstrate the utility of Theia on real incidents experienced by users on a production Hadoop cluster.
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.
author = {Elmer Garduno and Soila P. Kavulya and Jiaqi Tan and Rajeev Gandhi and Priya Narasimhan},
title = {Theia: Visual Signatures for Problem Diagnosis in Large Hadoop Clusters },
booktitle = {26th Large Installation System Administration Conference (LISA 12)},
year = {2012},
isbn = {978-931971-97-3},
address = {San Diego, CA},
pages = {33--42},
url = {https://www.usenix.org/conference/lisa12/technical-sessions/presentation/garduno},
publisher = {USENIX Association},
month = dec
}
connect with us