Back to Search Start Over

A data lake-based security transmission and storage scheme for streaming big data.

Authors :
Zhao, Xiaoyan
Zhang, Conghui
Guan, Shaopeng
Source :
Cluster Computing. Jul2024, Vol. 27 Issue 4, p4741-4755. 15p.
Publication Year :
2024

Abstract

Streaming big data presents unique security challenges due to its real-time generation and distributed transmission methods, making it vulnerable to security threats such as data leakage and tampering. To address these challenges, we propose a secure transmission and storage scheme for streaming big data based on a data lake architecture. Our approach leverages an ECC lightweight encryption algorithm in a streaming data encryption interceptor to filter and encrypt key information at the data source. We also introduce the SSL secure transmission protocol to ensure secure data transmission over a multi-source streaming data transmission channel constructed using Flume and Kafka. Furthermore, we design a data partition scheme based on LZO compression and implement a data lake storage system using Hadoop. Our proposed scheme can efficiently and securely transfer streaming data from multiple data sources to the data lake, while providing high data query efficiency. Experimental results show that our stream data encryption interceptor reduces memory load by 18% and filters and encrypts key information at a faster speed compared to similar schemes. In addition, our data lake storage scheme demonstrates lower data write latency and space occupancy, making it well-suited for large-scale streaming data applications. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13867857
Volume :
27
Issue :
4
Database :
Academic Search Index
Journal :
Cluster Computing
Publication Type :
Academic Journal
Accession number :
178805398
Full Text :
https://doi.org/10.1007/s10586-023-04201-9