Back to Search
Start Over
GOLFS: feature selection via combining both global and local information for high dimensional clustering.
- Source :
-
Computational Statistics . Jul2024, Vol. 39 Issue 5, p2651-2675. 25p. - Publication Year :
- 2024
-
Abstract
- It is important to identify the discriminative features for high dimensional clustering. However, due to the lack of cluster labels, the regularization methods developed for supervised feature selection can not be directly applied. To learn the pseudo labels and select the discriminative features simultaneously, we propose a new unsupervised feature selection method, named GlObal and Local information combined Feature Selection (GOLFS), for high dimensional clustering problems. The GOLFS algorithm combines both local geometric structure via manifold learning and global correlation structure of samples via regularized self-representation to select the discriminative features. The combination improves the accuracy of both feature selection and clustering by exploiting more comprehensive information. In addition, an iterative algorithm is proposed to solve the optimization problem and the convergency is proved. Simulations and two real data applications demonstrate the excellent finite-sample performance of GOLFS on both feature selection and clustering. [ABSTRACT FROM AUTHOR]
- Subjects :
- *FEATURE selection
*GOLF
*GLOBAL method of teaching
Subjects
Details
- Language :
- English
- ISSN :
- 09434062
- Volume :
- 39
- Issue :
- 5
- Database :
- Academic Search Index
- Journal :
- Computational Statistics
- Publication Type :
- Academic Journal
- Accession number :
- 177897136
- Full Text :
- https://doi.org/10.1007/s00180-023-01393-x