sponsors
usenix conference policies
You are here
Discretized Streams: An Efficient and Fault-Tolerant Model for Stream Processing on Large Clusters
12 Tuesday | 13 Wednesday | 14 Thursday | 15 Friday |
---|---|---|---|
HotCloud '12 | TaPP '12 | ||
WiAC '12 | USENIX ATC '12 | ||
UCMS '12 | HotStorage '12 | NSDR '12 | |
USENIX Cyberlaw '12 | WebApps '12 |
Matei Zaharia, Tathagata Das, Haoyuan Li, Scott Shenker, and Ion Stoica, University of California, Berkeley
Many important “big data” applications need to process data arriving in real time. However, current programming models for distributed stream processing are relatively low-level, often leaving the user to worry about consistency of state across the system and fault recovery. Furthermore, the models that provide fault recovery do so in an expensive manner, requiring either hot replication or long recovery times. We propose a new programming model, discretized streams (D-Streams), that offers a high-level functional programming API, strong consistency, and efficient fault recovery. D-Streams support a new recovery mechanism that improves efficiency over the traditional replication and upstream backup solutions in streaming databases: parallel recovery of lost state across the cluster. We have prototyped D-Streams in an extension to the Spark cluster computing framework called Spark Streaming, which lets users seamlessly intermix streaming, batch and interactive queries.
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.
author = {Matei Zaharia and Tathagata Das and Haoyuan Li and Scott Shenker and Ion Stoica},
title = {Discretized Streams: An Efficient and {Fault-Tolerant} Model for Stream Processing on Large Clusters},
booktitle = {4th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 12)},
year = {2012},
address = {Boston, MA},
url = {https://www.usenix.org/conference/hotcloud12/workshop-program/presentation/zaharia},
publisher = {USENIX Association},
month = jun
}
connect with us