Back to Search Start Over

GOLFS: feature selection via combining both global and local information for high dimensional clustering.

Authors :
Xing, Zhaoyu
Wan, Yang
Wen, Juan
Zhong, Wei
Source :
Computational Statistics. Jul2024, Vol. 39 Issue 5, p2651-2675. 25p.
Publication Year :
2024

Abstract

It is important to identify the discriminative features for high dimensional clustering. However, due to the lack of cluster labels, the regularization methods developed for supervised feature selection can not be directly applied. To learn the pseudo labels and select the discriminative features simultaneously, we propose a new unsupervised feature selection method, named GlObal and Local information combined Feature Selection (GOLFS), for high dimensional clustering problems. The GOLFS algorithm combines both local geometric structure via manifold learning and global correlation structure of samples via regularized self-representation to select the discriminative features. The combination improves the accuracy of both feature selection and clustering by exploiting more comprehensive information. In addition, an iterative algorithm is proposed to solve the optimization problem and the convergency is proved. Simulations and two real data applications demonstrate the excellent finite-sample performance of GOLFS on both feature selection and clustering. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09434062
Volume :
39
Issue :
5
Database :
Academic Search Index
Journal :
Computational Statistics
Publication Type :
Academic Journal
Accession number :
177897136
Full Text :
https://doi.org/10.1007/s00180-023-01393-x