sponsors
help promote
usenix conference policies
Using Hints to Improve Inline Block-layer Deduplication
Sonam Mandal, Stony Brook University; Geoff Kuenning, Harvey Mudd College; Dongju Ok and Varun Shastry, Stony Brook University; Philip Shilane, EMC Corporation; Sun Zhen, Stony Brook University and National University of Defense Technology; Vasily Tarasov, IBM Research; Erez Zadok, Stony Brook University
Block-layer data deduplication allows file systems and applications to reap the benefits of deduplication without requiring per-system or per-application modifications. However, important information about data context (e.g., data vs. metadata writes) is lost at the block layer. Passing such context to the block layer can help improve deduplication performance and reliability. We implemented a hinting interface in an open-source block-layer deduplication system, dmdedup, that passes relevant context to the block layer, and evaluated two hints, NODEDUP
and PREFETCH
. To allow upper storage layers to pass hints based on the available context, we modified the VFS and file system layers to expose a hinting interface to user applications. We show that passing the NODEDUP
hint speeds up applications by up to 5.3 on modern machines because the overhead of deduplication is avoided when it is unlikely to be beneficial. We also show that the PREFETCH
hint accelerates applications up to 1.8 by caching hashes for data that is likely to be accessed soon.
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.
author = {Sonam Mandal and Geoff Kuenning and Dongju Ok and Varun Shastry and Philip Shilane and Sun Zhen and Vasily Tarasov and Erez Zadok},
title = {Using Hints to Improve Inline Block-layer Deduplication},
booktitle = {14th USENIX Conference on File and Storage Technologies (FAST 16)},
year = {2016},
isbn = {978-1-931971-28-7},
address = {Santa Clara, CA},
pages = {315--322},
url = {https://www.usenix.org/conference/fast16/technical-sessions/presentation/mandal},
publisher = {USENIX Association},
month = feb
}
connect with us