Back to Search
Start Over
Exploring the variability of DNA molecules via principal geodesic analysis on the shape space
- Source :
- Journal of Applied Statistics. 39:2199-2207
- Publication Year :
- 2012
- Publisher :
- Informa UK Limited, 2012.
-
Abstract
- Most of the linear statistics deal with data lying in a Euclidean space. However, there are many examples, such as DNA molecule topological structures, in which the initial or the transformed data lie in a non-Euclidean space. To get a measure of variability in these situations, the principal component analysis (PCA) is usually performed on a Euclidean tangent space as it cannot be directly implemented on a non-Euclidean space. Instead, principal geodesic analysis (PGA) is a new tool that provides a measure of variability for nonlinear statistics. In this paper, the performance of this new tool is compared with that of the PCA using a real data set representing a DNA molecular structure. It is shown that due to the nonlinearity of space, the PGA explains more variability of the data than the PCA.
- Subjects :
- Statistics and Probability
Euclidean space
Statistical shape analysis
Sparse PCA
Space (mathematics)
Topology
Measure (mathematics)
Principal component analysis
Tangent space
Mathematics::Metric Geometry
Statistics, Probability and Uncertainty
Principal geodesic analysis
Algorithm
Mathematics
Subjects
Details
- ISSN :
- 13600532 and 02664763
- Volume :
- 39
- Database :
- OpenAIRE
- Journal :
- Journal of Applied Statistics
- Accession number :
- edsair.doi...........a805719759797705fa609fec30f0d081