Back to Search Start Over

Comparative Analysis of Data Mining Models for Classification for Small Data Set

Authors :
Nakul Gupta
N P Singh
Source :
2018 IEEE 12th International Conference on Application of Information and Communication Technologies (AICT).
Publication Year :
2018
Publisher :
IEEE, 2018.

Abstract

The aim of this research paper is to identify best model to classify a data set with target variable as mode of delivery of pregnant women and a set of predictors (Mother’s weight, height, age, baby’s weight, and baby’s gender). The analysis is carried out using different algorithms of classification taking one at time which are part of Weka and R-Programming library. In addition, ensemble technique having different base classifiers is also applied to the same data set. The base classifiers used for ensemble techniques are Simple Cart, C4.5, Multi-Layer Perceptron (MLP), Radial Bias Function (RBF), and Reduced Error Pruning Tree (Reptree). The results with respect to performance of the new models are compared with the performance of logistic regression, discriminant function, and standalone decision tree algorithms based on values of accuracy, precision and recall [1]. It is concluded that data small or big should be subjected to many algorithms and their combinations using either hybrid or ensemble and other existing approaches, to get a more reliable output.

Details

Database :
OpenAIRE
Journal :
2018 IEEE 12th International Conference on Application of Information and Communication Technologies (AICT)
Accession number :
edsair.doi...........586a93017739aec0b4d1ab632f6cd5cc
Full Text :
https://doi.org/10.1109/icaict.2018.8747146