Back to Search Start Over

Distributed Storage Algorithm for Geospatial Image Data Based on Data Access Patterns.

Authors :
Pan, Shaoming
Li, Yongkai
Xu, Zhengquan
Chong, Yanwen
Source :
PLoS ONE; 7/16/2015, Vol. 10 Issue 7, p1-22, 22p
Publication Year :
2015

Abstract

Declustering techniques are widely used in distributed environments to reduce query response time through parallel I/O by splitting large files into several small blocks and then distributing those blocks among multiple storage nodes. Unfortunately, however, many small geospatial image data files cannot be further split for distributed storage. In this paper, we propose a complete theoretical system for the distributed storage of small geospatial image data files based on mining the access patterns of geospatial image data using their historical access log information. First, an algorithm is developed to construct an access correlation matrix based on the analysis of the log information, which reveals the patterns of access to the geospatial image data. Then, a practical heuristic algorithm is developed to determine a reasonable solution based on the access correlation matrix. Finally, a number of comparative experiments are presented, demonstrating that our algorithm displays a higher total parallel access probability than those of other algorithms by approximately 10–15% and that the performance can be further improved by more than 20% by simultaneously applying a copy storage strategy. These experiments show that the algorithm can be applied in distributed environments to help realize parallel I/O and thereby improve system performance. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
19326203
Volume :
10
Issue :
7
Database :
Complementary Index
Journal :
PLoS ONE
Publication Type :
Academic Journal
Accession number :
108664989
Full Text :
https://doi.org/10.1371/journal.pone.0133029