Back to Search Start Over

Feature selection using symmetric uncertainty and hybrid optimization for high-dimensional data.

Authors :
Sun, Lin
Sun, Shujing
Ding, Weiping
Huang, Xinyue
Fan, Peiyi
Li, Kunyu
Chen, Leqi
Source :
International Journal of Machine Learning & Cybernetics; Dec2023, Vol. 14 Issue 12, p4339-4360, 22p
Publication Year :
2023

Abstract

Recently, when handling high-dimensional data, it has become extremely difficult to search this optimal subset of selected features due to the restriction of reducing the exponential increase of the search procedure, and most of those feature selection models neglect the interactions of features or feature and decision class. This paper develops a novel feature selection approach using symmetric uncertainty and hybrid optimization for high-dimensional data (FSUHO) for high-dimensional data. First, to fully reflect the interaction relationship of features or feature and decision class, the F-relevance between features and the C-correlation between feature and decision class based on the symmetric uncertainty are constructed to remove those redundant features. Then, a strong correlation threshold is improved based on the C-correlation and random coefficient to prevent the removal of the effective features in this first stage. Second, to decrease this expensive computational consumption, one criterion for judging a weakly correlated feature is designed to sort all features, and another criterion is developed to select the class center. The similarity between features and class centers is calculated, and similar features are clustered into one class. Then, the symmetric uncertainty correlation-based feature clustering model can be constructed in this second stage. In the third stage, a hybrid optimization approach of particle swarm optimizer (PSO) and wild horse optimizer (WHO) for feature selection is proposed, where the association-guided group initialization probability with a multiobjective optimized particle selection scheme is defined as a criterion for the PSO in selecting stallion particles for the WHO, and the improved WHO is developed by integrating the nonlinear inertial weight factor and the Brownian motion operator to obtain the optimal subset of selected features. Finally, a novel three-stage feature selection algorithm is developed. Experimental results apply to 16 datasets prove the efficiency of FSUHO in tackling high-dimensional feature selection problems in metrics of classification accuracy and running time. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
18688071
Volume :
14
Issue :
12
Database :
Complementary Index
Journal :
International Journal of Machine Learning & Cybernetics
Publication Type :
Academic Journal
Accession number :
172971724
Full Text :
https://doi.org/10.1007/s13042-023-01897-4