sponsors
usenix conference policies
Challenges to Error Diagnosis in Hadoop Ecosystems
Jim (Zhanwen) Li, NICTA; Siyuan He, Citibank; Liming Zhu, NICTA and University of New South Wales; Xiwei Xu, NICTA; Min Fu, University of New South Wales; Len Bass and Anna Liu, NICTA and University of New South Wales; An Binh Tran, University of New South Wales
Deploying a large-scale distributed ecosystem such as HBase/Hadoop in the cloud is complicated and error-prone. Multiple layers of largely independently evolving software are deployed across distributed nodes on third party infrastructures. In addition to software incompatibility and typical misconfiguration within each layer, many subtle and hard to diagnose errors happen due to misconfigurations across layers and nodes. These errors are difficult to diagnose because of scattered log management and lack of ecosystem-awareness in many diagnosis tools and processes.
We report on some failure experiences in a real world deployment of HBase/Hadoop and propose some initial ideas for better trouble-shooting during deployment. We identify the following types of subtle errors and the corresponding challenges in trouble-shooting: 1) dealing with inconsistency among distributed logs, 2) distinguishing useful information from noisy logging, and 3) probabilistic determination of root causes.
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.
author = {Jim Zhanwen Li and Siyuan He and Liming Zhu and Xiwei Xu and Min Fu and Len Bass and Anna Liu and An Binh Tran},
title = {Challenges to Error Diagnosis in Hadoop Ecosystems},
booktitle = {27th Large Installation System Administration Conference (LISA 13)},
year = {2013},
isbn = {978-1-931971-05-8},
address = {Washington, D.C.},
pages = {145--154},
url = {https://www.usenix.org/conference/lisa13/technical-sessions/presentation/li},
publisher = {USENIX Association},
month = nov
}
connect with us