Back to Search Start Over

A New Approach for Large-Scale Scene Image Retrieval Based on Improved Parallel k-Means Algorithm in MapReduce Environment

Authors :
Hao Shi
Yun Tian
Min Wang
Guohua Hu
Jianfang Cao
Source :
Mathematical Problems in Engineering, Vol 2016 (2016)
Publication Year :
2016
Publisher :
Hindawi Limited, 2016.

Abstract

The rapid growth of digital images has caused the traditional image retrieval technology to be faced with new challenge. In this paper we introduce a new approach for large-scale scene image retrieval to solve the problems of massive image processing using traditional image retrieval methods. First, we improved traditionalk-Means clustering algorithm, which optimized the selection of the initial cluster centers and iteration procedure. Second, we presented a parallel design and realization method for improvedk-Means algorithm applied it to feature clustering of scene images. Finally, a storage and retrieval scheme for large-scale scene images was put forward using the large storage capacity and powerful parallel computing ability of the Hadoop distributed platform. The experimental results demonstrated that the proposed method achieved good performance. Compared with the traditional algorithms with single node architecture and parallelk-Means algorithm, the proposed method has obvious advantages for use in large-scale scene image data retrieval in terms of retrieval accuracy, retrieval time overhead, and computational performance (speedup and efficiency, sizeup, and scaleup), which is a significant improvement from applying parallel processing to intelligent algorithms with large-scale datasets.

Details

Language :
English
ISSN :
15635147
Volume :
2016
Database :
OpenAIRE
Journal :
Mathematical Problems in Engineering
Accession number :
edsair.doi.dedup.....5895e4ad9e9f8488e3f474476cd82007