Back to Search
Start Over
Reducing microarray data via nonnegative matrix factorization for visualization and clustering analysis.
- Source :
-
Journal of biomedical informatics [J Biomed Inform] 2008 Aug; Vol. 41 (4), pp. 602-6. Date of Electronic Publication: 2007 Dec 23. - Publication Year :
- 2008
-
Abstract
- In microarray data analysis, each gene expression sample has thousands of genes and reducing such high dimensionality is useful for both visualization and further clustering of samples. Traditional principal component analysis (PCA) is a commonly used method which has problems. Nonnegative Matrix Factorization (NMF) is a new dimension reduction method. In this paper we compare NMF and PCA for dimension reduction. The reduced data is used for visualization, and clustering analysis via k-means on 11 real gene expression datasets. Before the clustering analysis, we apply NMF and PCA for reduction in visualization. The results on one leukemia dataset show that NMF can discover natural clusters and clearly detect one mislabeled sample while PCA cannot. For clustering analysis via k-means, NMF most typically outperforms PCA. Our results demonstrate the superiority of NMF over PCA in reducing microarray data.
Details
- Language :
- English
- ISSN :
- 1532-0480
- Volume :
- 41
- Issue :
- 4
- Database :
- MEDLINE
- Journal :
- Journal of biomedical informatics
- Publication Type :
- Academic Journal
- Accession number :
- 18234564
- Full Text :
- https://doi.org/10.1016/j.jbi.2007.12.003