Back to Search Start Over

Statistical analysis of various splitting criteria for decision trees.

Authors :
Aaboub, Fadwa
Chamlal, Hasna
Ouaderhman, Tayeb
Source :
Journal of Algorithms & Computational Technology; Jan-Dec2023, Vol. 17, p1-13, 13p
Publication Year :
2023

Abstract

Decision trees are frequently used to overcome classification problems in the fields of data mining and machine learning, owing to their many perks, including their clear and simple architecture, excellent quality, and resilience. Various decision tree algorithms are developed using a variety of attribute selection criteria, following the top-down partitioning strategy. However, their effectiveness is influenced by the choice of the splitting method. Therefore, in this work, six decision tree algorithms that are based on six different attribute evaluation metrics are gathered in order to compare their performances. The choice of the decision trees that will be compared is done based on four different categories of the splitting criteria that are criteria based on information theory, criteria based on distance, statistical-based criteria, and other splitting criteria. These approaches include iterative dichotomizer 3 (first category), C4.5 (first category), classification and regression trees (second category), Pearson's correlation coefficient based decision tree (third category), dispersion ratio (third category), and feature weight based decision tree algorithm (last category). On eleven data sets, the six procedures are assessed in terms of classification accuracy, tree depth, leaf nodes, and tree construction time. Furthermore, the Friedman and post hoc Nemenyi tests are used to examine the results that were obtained. The results of these two tests indicate that the iterative dichotomizer 3 and classification and regression trees decision tree methods perform better than the other decision tree methodologies. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
17483018
Volume :
17
Database :
Complementary Index
Journal :
Journal of Algorithms & Computational Technology
Publication Type :
Academic Journal
Accession number :
175325117
Full Text :
https://doi.org/10.1177/17483026231198181