Back to Search Start Over

Sequential projection pursuit principal component analysis--dealing with missing data associated with new -omics technologies.

Authors :
Webb-Robertson BJ
Matzke MM
Metz TO
McDermott JE
Walker H
Rodland KD
Pounds JG
Waters KM
Source :
BioTechniques [Biotechniques] 2013 Mar; Vol. 54 (3), pp. 165-8.
Publication Year :
2013

Abstract

Principal Component Analysis (PCA) is a common exploratory tool used to evaluate large complex data sets. The resulting lower-dimensional representations are often valuable for pattern visualization, clustering, or classification of the data. However, PCA cannot be applied directly to many -omics data sets generated by newer technologies such as label-free mass spectrometry due to large numbers of non-random missing values. Here we present a sequential projection pursuit PCA (sppPCA) method for defining principal components in the presence of missing data. Our results demonstrate that this approach generates robust and informative low-dimensional data representations compared to commonly used imputation approaches.

Details

Language :
English
ISSN :
1940-9818
Volume :
54
Issue :
3
Database :
MEDLINE
Journal :
BioTechniques
Publication Type :
Academic Journal
Accession number :
23477384
Full Text :
https://doi.org/10.2144/000113978