usenix conference policies
Feedback Computing in Leadership Compute Systems
Raghul Gunasekaran and Youngjae Kim, Oak Ridge National Lab
Leadership class systems are heavily shared resource environments with users contending for shared system resources. This results in users experiencing huge performance variations, and also affects the overall throughput of the system. To alleviate the problem, system software tools must be built taking into consideration user requirements and resource availability, a feedback driven approach. Realizing a feedback-based compute environment for peta-scale systems have two challenging tasks. First, collecting discreet, coarse-grained system statistics from multiple systems using minimum system resources and without affecting the user jobs is a hard problem. Second, with discreet data collected from disparate sources the challenge is in associating the data for meaningful interpretations to drive feedback-based decision systems in real-time. In this paper, we elaborate on a feedback-based computing framework with respect to the peta-scale compute and storage system at the Oak Ridge Leadership Computing Facility. We describe our feedback-based approach for dynamic resource allocation, context-aware scheduling and application checkpointing.
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.
author = {Raghul Gunasekaran and Youngjae Kim},
title = {Feedback Computing in Leadership Compute Systems},
booktitle = {9th International Workshop on Feedback Computing (Feedback Computing 14)},
year = {2014},
address = {Philadelphia, PA},
url = {https://www.usenix.org/conference/feedbackcomputing14/workshop-program/presentation/gunasekaran},
publisher = {USENIX Association},
month = jun
}
connect with us