1. Living on the Edge
- Author
-
Matthew Malensek, Shrideep Pallickara, Sangmi Lee Pallickara, and Thilina Buddhika
- Subjects
Edge device ,Computer Networks and Communications ,business.industry ,Data stream mining ,Computer science ,010401 analytical chemistry ,Real-time computing ,020206 networking & telecommunications ,02 engineering and technology ,Construct (python library) ,01 natural sciences ,0104 chemical sciences ,Computer Science Applications ,Temporal database ,Hardware and Architecture ,Analytics ,0202 electrical engineering, electronic engineering, information engineering ,Enhanced Data Rates for GSM Evolution ,business ,Software ,Edge computing ,Information Systems ,Data transmission - Abstract
Voluminous time-series data streams produced in continuous sensing environments impose challenges pertaining to ingestion, storage, and analytics. In this study, we present a holistic approach based on data sketching to address these issues. We propose a hyper-sketching algorithm that combines discretization and frequency-based sketching to produce compact representations of the multi-feature, time-series data streams. We generate an ensemble of data sketches to make effective use of capabilities at the resource-constrained edge devices, the links over which data are transmitted, and the server pool where this data must be stored. The data sketches can be queried to construct datasets that are amenable to processing using popular analytical engines. We include several performance benchmarks using real-world data from different domains to profile the suitability of our design decisions. The proposed methodology can achieve up to ∼ 13 × and ∼ 2, 207 × reduction in data transfer and energy consumption at edge devices. We observe up to a ∼ 50% improvement in analytical job completion times in addition to the significant improvements in disk and network I/O.
- Published
- 2021
- Full Text
- View/download PDF