Back to Search Start Over

Parallel attribute reduction algorithms using MapReduce

Authors :
Jin Qian
Zehua Zhang
Xiaodong Yue
Duoqian Miao
Source :
Information Sciences. 279:671-690
Publication Year :
2014
Publisher :
Elsevier BV, 2014.

Abstract

Attribute reduction is the key technique for knowledge acquisition in rough set theory. However, it is still a challenging task to perform attribute reduction on massive data. During the process of attribute reduction on massive data, the key to improving the reduction efficiency is the effective computation of equivalence classes and attribute significance. Aiming at this problem, we propose several parallel attribute reduction algorithms in this paper. Specifically, we design a novel structure of 〈 key , value 〉 pair to speed up the computation of equivalence classes and attribute significance and parallelize the traditional attribute reduction process based on MapReduce mechanism. The different parallelization strategies of attribute reduction are also compared and analyzed from the theoretic view. Abundant experimental results demonstrate the proposed parallel attribute reduction algorithms can perform efficiently and scale well on massive data.

Details

ISSN :
00200255
Volume :
279
Database :
OpenAIRE
Journal :
Information Sciences
Accession number :
edsair.doi...........2ec7d1da742fc7e3b716301d113c1ce4
Full Text :
https://doi.org/10.1016/j.ins.2014.04.019