Back to Search Start Over

Minimum Backups for Stream Processing With Recovery Latency Guarantees.

Authors :
Li, Hongliang
Li, Xiang
Wei, Xiaohui
Wu, Jie
Jiang, Zhen
Source :
IEEE Transactions on Reliability; Sep2017, Vol. 66 Issue 3, p783-794, 12p
Publication Year :
2017

Abstract

The stream processing model continuously processes online data in an on-pass fashion that can be more vulnerable to failures than other big-data processing schemes. Existing fault-tolerant (FT) approaches have been presented to enhance the reliability of stream processing systems. However, the fundamental tradeoff between recovery latency and FT overhead is still unclear, so these scheme cannot provide recovery latency guarantees. This paper introduces the FT Configuration (FTC) problem and presents a solution for guaranteed recovery latency with minimum backups. A failure effect model is presented to describe the relationship between recovery latency and FTC (the amount and locations of backups). With this model, we design an algorithm to compute FTCs for different types of stream topologies according to recovery latency requirements. Extensive experiments are conducted to verify the correctness and effectiveness of our approach. We prove that our algorithm guarantees recovery latencies for all directed acyclic graph (DAG) stream topologies. For line(s) and tree topologies, our algorithm solves the FTC problem with a time complexity of O(N) . For a general DAG topology, a heuristic function is used to generate FTCs. This causes fewer than 10% more backups on average compared to the optimal solution with a time complexity of $O(N^2)$. [ABSTRACT FROM PUBLISHER]

Details

Language :
English
ISSN :
00189529
Volume :
66
Issue :
3
Database :
Complementary Index
Journal :
IEEE Transactions on Reliability
Publication Type :
Academic Journal
Accession number :
124984470
Full Text :
https://doi.org/10.1109/TR.2017.2712563