Back to Search
Start Over
Feature selection using symmetric uncertainty and hybrid optimization for high-dimensional data.
- Source :
- International Journal of Machine Learning & Cybernetics; Dec2023, Vol. 14 Issue 12, p4339-4360, 22p
- Publication Year :
- 2023
-
Abstract
- Recently, when handling high-dimensional data, it has become extremely difficult to search this optimal subset of selected features due to the restriction of reducing the exponential increase of the search procedure, and most of those feature selection models neglect the interactions of features or feature and decision class. This paper develops a novel feature selection approach using symmetric uncertainty and hybrid optimization for high-dimensional data (FSUHO) for high-dimensional data. First, to fully reflect the interaction relationship of features or feature and decision class, the F-relevance between features and the C-correlation between feature and decision class based on the symmetric uncertainty are constructed to remove those redundant features. Then, a strong correlation threshold is improved based on the C-correlation and random coefficient to prevent the removal of the effective features in this first stage. Second, to decrease this expensive computational consumption, one criterion for judging a weakly correlated feature is designed to sort all features, and another criterion is developed to select the class center. The similarity between features and class centers is calculated, and similar features are clustered into one class. Then, the symmetric uncertainty correlation-based feature clustering model can be constructed in this second stage. In the third stage, a hybrid optimization approach of particle swarm optimizer (PSO) and wild horse optimizer (WHO) for feature selection is proposed, where the association-guided group initialization probability with a multiobjective optimized particle selection scheme is defined as a criterion for the PSO in selecting stallion particles for the WHO, and the improved WHO is developed by integrating the nonlinear inertial weight factor and the Brownian motion operator to obtain the optimal subset of selected features. Finally, a novel three-stage feature selection algorithm is developed. Experimental results apply to 16 datasets prove the efficiency of FSUHO in tackling high-dimensional feature selection problems in metrics of classification accuracy and running time. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 18688071
- Volume :
- 14
- Issue :
- 12
- Database :
- Complementary Index
- Journal :
- International Journal of Machine Learning & Cybernetics
- Publication Type :
- Academic Journal
- Accession number :
- 172971724
- Full Text :
- https://doi.org/10.1007/s13042-023-01897-4