Back to Search Start Over

RDMA-Based Apache Storm for High-Performance Stream Data Processing.

Authors :
Zhang, Ziyu
Liu, Zitan
Jiang, Qingcai
Chen, Junshi
An, Hong
Source :
International Journal of Parallel Programming. Oct2021, Vol. 49 Issue 5, p671-684. 14p.
Publication Year :
2021

Abstract

Apache Storm is a scalable fault-tolerant distributed real time stream-processing framework widely used in big data applications. For distributed data-sensitive applications, low-latency, high-throughput communication modules have a critical impact on overall system performance. Apache Storm currently uses Netty as its communication component, an asynchronous server/client framework based on TCP/IP protocol stack. The TCP/IP protocol stack has inherent performance flaws due to frequent memory copying and context switching. The Netty component not only limits the performance of the Storm but also increases the CPU load in the IPoIB (IP over InfiniBand) communication mode. In this paper, we introduce two new implementations for Apache Storm communication components with the help of RDMA technology. The performance evaluation on Mellanox QDR Cards (40 Gbps) shows that our implementations can achieve speedup up to 5 × compared with IPoIB and 10 × with Gigabit Ethernet. Our implementations also significantly reduce the CPU load and increase the throughput of the system. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
08857458
Volume :
49
Issue :
5
Database :
Academic Search Index
Journal :
International Journal of Parallel Programming
Publication Type :
Academic Journal
Accession number :
152058419
Full Text :
https://doi.org/10.1007/s10766-021-00696-0