Back to Search Start Over

Enhanced query performance for stored streaming data through structured streaming within spark SQL.

Authors :
Jose, Benymol
N., Rajesh
Joseph, Lumy
Source :
Indonesian Journal of Electrical Engineering & Computer Science; Sep2024, Vol. 35 Issue 3, p1744-1750, 7p
Publication Year :
2024

Abstract

Traditional database systems like relational databases can store data which are structured with predefined schema, but in the case of bigdata, the data comes in different formats or are collected from diverse sources. The distributed databases like not only spark querying language (NoSQL) repositories are often used in relation to bigdata analytics, but a continual updating is required in business because of the streaming data that comes from stock trading, online activities of website visitors, and from the mobile applications in real time. It will not have to delay, for some report to show up, to assess and analyse the current situation, to move forward with the next business choice. Apache Spark’s structured streaming offer capabilities for handling streaming data in a batch processing mode with faster responses compared to MongoDB which is a document-based NoSQL database. This study completes similar queries to evaluate Spark SQL and NoSQL database performance, focusing on the upsides of Spark SQL over NoSQL databases in streaming data exploration. The queries are completed with streaming data stored in a batch mode. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
25024752
Volume :
35
Issue :
3
Database :
Complementary Index
Journal :
Indonesian Journal of Electrical Engineering & Computer Science
Publication Type :
Academic Journal
Accession number :
179115019
Full Text :
https://doi.org/10.11591/ijeecs.v35.i3.pp1744-1750