Back to Search
Start Over
Generalized communication cost efficient multi-way spatial join: revisiting the curse of the last reducer.
- Source :
-
GeoInformatica . Jul2020, Vol. 24 Issue 3, p557-589. 33p. - Publication Year :
- 2020
-
Abstract
- With the huge increase in usage of smart mobiles, social media and sensors, large volumes of location-based data is available. Location based data carries important signals pertaining to user intensive information as well as population characteristics. The key analytical tool for location based analysis is multi-way spatial join. Unlike the conventional join strategies, multi-way join using map-reduce offers a scalable, distributed computational paradigm and efficient implementation through communication cost reduction strategies. Controlled Replicate (C-Rep) is a useful strategy used in the literature to perform the multi-way spatial join efficiently. Though C-Rep performance is superior compared to naive sequential join, careful analysis of its performance reveals that such a strategy is plagued by the curse of the last reducer, wherein the skew inherently present in the datasets and the skew introduced by replication operation, causes some of the reducers to take much longer time compared to others. In this work, we design an algorithm GEMS (G eneralized Communication cost E fficient M ulti-Way S patial Join) to address the skewness inherent in the connectivity of spatial objects while performing a multi-way join. We analysed all the algorithms concerned, in terms of I/O and communication costs. We prove that the communication cost of GEMS approach is better than that of C-Rep by a factor O(α) where α is the number of reducers in a single row/column of a grid of reducers. Our experimental results on different datasets indicate that GEMS approach is three times superior(in terms of turn around time) compared to C-Rep. [ABSTRACT FROM AUTHOR]
- Subjects :
- *COST control
*LOCATION analysis
*DATABASES
*SOCIAL media
Subjects
Details
- Language :
- English
- ISSN :
- 13846175
- Volume :
- 24
- Issue :
- 3
- Database :
- Academic Search Index
- Journal :
- GeoInformatica
- Publication Type :
- Academic Journal
- Accession number :
- 143875702
- Full Text :
- https://doi.org/10.1007/s10707-019-00387-6