Back to Search Start Over

A Classification Algorithm of CART Decision Tree based on MapReduce Attribute Weights.

Authors :
Fubao Zhu
Mengmeng Tang
Lijie Xie
Haodong Zhu
Source :
International Journal of Performability Engineering; Jan2018, Vol. 14 Issue 1, p17-25, 9p
Publication Year :
2018

Abstract

A CART decision tree algorithm based on attribute weight is proposed in this paper because of the present problems of complex classification, poor accuracy, low efficiency, and severe memory consumption of CART decision. What is more, the algorithm is combined with the parallel computing model of MapReduce. Theory of attribute weights is used in the algorithm. A decision tree is built through the sum of weights, which is decided by the degree that the attributes affect a decision. Thus the accuracy of classification through decision tree is improved. Parallel sorting algorithms of CART decision tree for massive data is implemented through the MapReduce programming technology of cloud computing. All the results of theoretical analysis and experimental comparison show that it is very important to mark attributes by weights through MapReduce. Furthermore, the accuracy of the classification of large sample data sets is improved significantly, classification efficiency of decision tree is improved and the trained time is also significantly reduced. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09731318
Volume :
14
Issue :
1
Database :
Supplemental Index
Journal :
International Journal of Performability Engineering
Publication Type :
Academic Journal
Accession number :
131646343
Full Text :
https://doi.org/10.23940/ijpe.18.01.p3.1725