Back to Search
Start Over
Feature subset selection using heuristic and metaheuristic approaches for diabetes prediction on a binary encoded dataset.
- Source :
- International Journal of Modeling, Simulation & Scientific Computing; Jun2024, Vol. 15 Issue 3, p1-24, 24p
- Publication Year :
- 2024
-
Abstract
- The Machine Learning (ML) models are prone to a curse of dimensionality. The dataset with a greater number of features involves more computational cost and it may lead to low performance in the context of prediction accuracy. Therefore, in this research work we have predicted diabetes with more accuracy by using a smaller number of features. The heuristic methods Sequential Forward Selection (SFS), Sequential Backward Selection (SBS) and metaheuristic evolutionary methods — Whale Optimization Algorithm (WOA) and Genetic Algorithm (GA) are used for performing feature subset selection. The Gini index is also used as a filter evaluator. The performance of the feature subsets is analyzed by applying three different types of ML models, Random Forest (RF), Multi-Layer Perceptron (MLP) and K-Nearest Neighbor (KNN). We have predicted type-2 diabetes with an accuracy of 96.82%. Also, we have reduced the number of features up to 67.44% i.e., identified 32.56% most relevant features. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 17939623
- Volume :
- 15
- Issue :
- 3
- Database :
- Complementary Index
- Journal :
- International Journal of Modeling, Simulation & Scientific Computing
- Publication Type :
- Academic Journal
- Accession number :
- 178738592
- Full Text :
- https://doi.org/10.1142/S1793962324500314