Back to Search
Start Over
QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
- Source :
- Sensors, Volume 21, Issue 12, Sensors, Vol 21, Iss 4160, p 4160 (2021), Sensors (Basel, Switzerland)
- Publication Year :
- 2021
- Publisher :
- MDPI AG, 2021.
-
Abstract
- Large amounts of georeferenced data streams arrive daily to stream processing systems. This is attributable to the overabundance of affordable IoT devices. In addition, interested practitioners desire to exploit Internet of Things (IoT) data streams for strategic decision-making purposes. However, mobility data are highly skewed and their arrival rates fluctuate. This nature poses an extra challenge on data stream processing systems, which are required in order to achieve pre-specified latency and accuracy goals. In this paper, we propose ApproxSSPS, which is a system for approximate processing of geo-referenced mobility data, at scale with quality of service guarantees. We focus on stateful aggregations (e.g., means, counts) and top-N queries. ApproxSSPS features a controller that interactively learns the latency statistics and calculates proper sampling rates to meet latency or/and accuracy targets. An overarching trait of ApproxSSPS is its ability to strike a plausible balance between latency and accuracy targets. We evaluate ApproxSSPS on Apache Spark Structured Streaming with real mobility data. We also compared ApproxSSPS against a state-of-the-art online adaptive processing system. Our extensive experiments prove that ApproxSSPS can fulfill latency and accuracy targets with varying sets of parameter configurations and load intensities (i.e., transient peaks in data loads versus slow arriving streams). Moreover, our results show that ApproxSSPS outperforms the baseline counterpart by significant magnitudes. In short, ApproxSSPS is a novel spatial data stream processing system that can deliver real accurate results in a timely manner, by dynamically specifying the limits on data samples.
- Subjects :
- sampling
Exploit
Computer science
Internet of Things
Real-time computing
TP1-1185
02 engineering and technology
Biochemistry
Article
Analytical Chemistry
Stream processing
Continuous querie
Stateful firewall
spatial data
Spark (mathematics)
0202 electrical engineering, electronic engineering, information engineering
continuous queries
Cities
Electrical and Electronic Engineering
Latency (engineering)
Instrumentation
Spatial analysis
mobility data
Apache Spark
approximate query processing
Data stream mining
Chemical technology
Quality of service
020206 networking & telecommunications
Citie
Atomic and Molecular Physics, and Optics
Algorithm
020201 artificial intelligence & image processing
Internet of Thing
Algorithms
Subjects
Details
- ISSN :
- 14248220
- Volume :
- 21
- Database :
- OpenAIRE
- Journal :
- Sensors
- Accession number :
- edsair.doi.dedup.....1225fe8192e992ab469cfcfb8831ee3e
- Full Text :
- https://doi.org/10.3390/s21124160