Back to Search
Start Over
A Classification Algorithm of CART Decision Tree based on MapReduce Attribute Weights.
- Source :
- International Journal of Performability Engineering; Jan2018, Vol. 14 Issue 1, p17-25, 9p
- Publication Year :
- 2018
-
Abstract
- A CART decision tree algorithm based on attribute weight is proposed in this paper because of the present problems of complex classification, poor accuracy, low efficiency, and severe memory consumption of CART decision. What is more, the algorithm is combined with the parallel computing model of MapReduce. Theory of attribute weights is used in the algorithm. A decision tree is built through the sum of weights, which is decided by the degree that the attributes affect a decision. Thus the accuracy of classification through decision tree is improved. Parallel sorting algorithms of CART decision tree for massive data is implemented through the MapReduce programming technology of cloud computing. All the results of theoretical analysis and experimental comparison show that it is very important to mark attributes by weights through MapReduce. Furthermore, the accuracy of the classification of large sample data sets is improved significantly, classification efficiency of decision tree is improved and the trained time is also significantly reduced. [ABSTRACT FROM AUTHOR]
- Subjects :
- ALGORITHMS
DECISION trees
CLOUD computing
BIG data
DATA analysis
Subjects
Details
- Language :
- English
- ISSN :
- 09731318
- Volume :
- 14
- Issue :
- 1
- Database :
- Supplemental Index
- Journal :
- International Journal of Performability Engineering
- Publication Type :
- Academic Journal
- Accession number :
- 131646343
- Full Text :
- https://doi.org/10.23940/ijpe.18.01.p3.1725