Back to Search Start Over

Feature subset selection using heuristic and metaheuristic approaches for diabetes prediction on a binary encoded dataset.

Authors :
Kumar, Puneet
Bhati, Bhoopesh Singh
Dhanaraj, Rajesh Kumar
Iwendi, Celestine
Balusamy, Balamurugan
Bhati, Nitesh Singh
Rai, Prerana
Source :
International Journal of Modeling, Simulation & Scientific Computing; Jun2024, Vol. 15 Issue 3, p1-24, 24p
Publication Year :
2024

Abstract

The Machine Learning (ML) models are prone to a curse of dimensionality. The dataset with a greater number of features involves more computational cost and it may lead to low performance in the context of prediction accuracy. Therefore, in this research work we have predicted diabetes with more accuracy by using a smaller number of features. The heuristic methods Sequential Forward Selection (SFS), Sequential Backward Selection (SBS) and metaheuristic evolutionary methods — Whale Optimization Algorithm (WOA) and Genetic Algorithm (GA) are used for performing feature subset selection. The Gini index is also used as a filter evaluator. The performance of the feature subsets is analyzed by applying three different types of ML models, Random Forest (RF), Multi-Layer Perceptron (MLP) and K-Nearest Neighbor (KNN). We have predicted type-2 diabetes with an accuracy of 96.82%. Also, we have reduced the number of features up to 67.44% i.e., identified 32.56% most relevant features. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
17939623
Volume :
15
Issue :
3
Database :
Complementary Index
Journal :
International Journal of Modeling, Simulation & Scientific Computing
Publication Type :
Academic Journal
Accession number :
178738592
Full Text :
https://doi.org/10.1142/S1793962324500314