Back to Search
Start Over
KNCFS: Feature selection for high-dimensional datasets based on improved random multi-subspace learning.
- Source :
- PLoS ONE, Vol 19, Iss 2, p e0296108 (2024)
- Publication Year :
- 2024
- Publisher :
- Public Library of Science (PLoS), 2024.
-
Abstract
- Feature selection has long been a focal point of research in various fields.Recent studies have focused on the application of random multi-subspaces methods to extract more information from raw samples.However,this approach inadequately addresses the adverse effects that may arise due to feature collinearity in high-dimensional datasets.To further address the limited ability of traditional algorithms to extract useful information from raw samples while considering the challenge of feature collinearity during the random subspaces learning process, we employ a clustering approach based on correlation measures to group features.Subsequently, we construct subspaces with lower inter-feature correlations.When integrating feature weights obtained from all feature spaces,we introduce a weighting factor to better handle the contributions from different feature spaces.We comprehensively evaluate our proposed algorithm on ten real datasets and four synthetic datasets,comparing it with six other feature selection algorithms.Experimental results demonstrate that our algorithm,denoted as KNCFS,effectively identifies relevant features,exhibiting robust feature selection performance,particularly suited for addressing feature selection challenges in practice.
Details
- Language :
- English
- ISSN :
- 19326203
- Volume :
- 19
- Issue :
- 2
- Database :
- Directory of Open Access Journals
- Journal :
- PLoS ONE
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.7e5be7cce57d4129b3ef7d1b3310d728
- Document Type :
- article
- Full Text :
- https://doi.org/10.1371/journal.pone.0296108&type=printable