usenix conference policies
You are here
Understanding Reliability Implication of Hardware Error in Virtualization Infrastructure
Xin Xu and H. Howie Huang, George Washington University
Hardware errors are no longer the exceptions in modern cloud data centers. Although virtualization provides software failure isolation across different virtual machines (VM), the virtualization infrastructure including the hypervisor and privileged VMs remains vulnerable to hardware errors. Making matters worse is that such errors are unlikely bounded by virtualization boundary and may lead to loss of work in multiple guest VMs due to unexpected and/or mishandled failures. To understand reliability implication of hardware errors in virtualized systems, in this paper we develop a simulation-based framework that enables a comprehensive fault injection study on the hypervisor with a wide range of configurations. Our analysis shows that, in current systems, many hardware errors can propagate through various paths for an extended time before an observed failure (e.g., whole system crash). We further discuss the challenges of designing error tolerance techniques for the hypervisor.
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.
author = {Xin Xu and H. Howie Huang},
title = {Understanding Reliability Implication of Hardware Error in Virtualization Infrastructure},
booktitle = {10th Workshop on Hot Topics in System Dependability (HotDep 14)},
year = {2014},
address = {Broomfield, CO},
url = {https://www.usenix.org/conference/hotdep14/workshop-program/presentation/xu},
publisher = {USENIX Association},
month = oct
}
connect with us