1. Robust Principal Component Analysis Based On Hypergraph Regularization for Sample Clustering and Co-Characteristic Gene Selection
- Author
-
Ying-Lian Gao, Juan Wang, Ming-Juan Wu, Chun-Hou Zheng, and Jin-Xing Liu
- Subjects
Principal Component Analysis ,Hypergraph ,Biological data ,Computer science ,business.industry ,Applied Mathematics ,Computational Biology ,Feature selection ,Pattern recognition ,ComputingMethodologies_PATTERNRECOGNITION ,Robustness (computer science) ,Neoplasms ,Principal component analysis ,Outlier ,Genetics ,Cluster Analysis ,Humans ,Artificial intelligence ,Cluster analysis ,business ,Robust principal component analysis ,Algorithms ,Biotechnology - Abstract
Extracting genes involved in cancer lesions from gene expression data is critical for cancer research and drug development. the method of feature selection has attracted much attention in the field of bioinformatics. Principal Component Analysis (PCA) is a widely used method for learning low-dimensional representation. Some variants of PCA have been proposed to improve the robustness and sparsity of the algorithm. However, the existing methods ignore the high-order relationships between data. In this paper, a new model named Robust Principal Component Analysis via Hypergraph Regularization (HRPCA) is proposed. In detail, HRPCA utilizes L2,1-norm to reduce the effect of outliers and make data sufficiently row-sparse. And the Hypergraph Regularization is introduced to consider the complex relationship between data. Important information hidden in the data are mined, and this method ensures the accuracy of the resulting data relationship information. Extensive experiments on multi-view biological data demonstrate that the feasible and effective of the proposed approach.
- Published
- 2022