Back to Search Start Over

Reducing microarray data via nonnegative matrix factorization for visualization and clustering analysis.

Authors :
Liu W
Yuan K
Ye D
Source :
Journal of biomedical informatics [J Biomed Inform] 2008 Aug; Vol. 41 (4), pp. 602-6. Date of Electronic Publication: 2007 Dec 23.
Publication Year :
2008

Abstract

In microarray data analysis, each gene expression sample has thousands of genes and reducing such high dimensionality is useful for both visualization and further clustering of samples. Traditional principal component analysis (PCA) is a commonly used method which has problems. Nonnegative Matrix Factorization (NMF) is a new dimension reduction method. In this paper we compare NMF and PCA for dimension reduction. The reduced data is used for visualization, and clustering analysis via k-means on 11 real gene expression datasets. Before the clustering analysis, we apply NMF and PCA for reduction in visualization. The results on one leukemia dataset show that NMF can discover natural clusters and clearly detect one mislabeled sample while PCA cannot. For clustering analysis via k-means, NMF most typically outperforms PCA. Our results demonstrate the superiority of NMF over PCA in reducing microarray data.

Details

Language :
English
ISSN :
1532-0480
Volume :
41
Issue :
4
Database :
MEDLINE
Journal :
Journal of biomedical informatics
Publication Type :
Academic Journal
Accession number :
18234564
Full Text :
https://doi.org/10.1016/j.jbi.2007.12.003