Back to Search Start Over

Exploring the variability of DNA molecules via principal geodesic analysis on the shape space

Authors :
H. Fotouhi
Mousa Golalizadeh
Source :
Journal of Applied Statistics. 39:2199-2207
Publication Year :
2012
Publisher :
Informa UK Limited, 2012.

Abstract

Most of the linear statistics deal with data lying in a Euclidean space. However, there are many examples, such as DNA molecule topological structures, in which the initial or the transformed data lie in a non-Euclidean space. To get a measure of variability in these situations, the principal component analysis (PCA) is usually performed on a Euclidean tangent space as it cannot be directly implemented on a non-Euclidean space. Instead, principal geodesic analysis (PGA) is a new tool that provides a measure of variability for nonlinear statistics. In this paper, the performance of this new tool is compared with that of the PCA using a real data set representing a DNA molecular structure. It is shown that due to the nonlinearity of space, the PGA explains more variability of the data than the PCA.

Details

ISSN :
13600532 and 02664763
Volume :
39
Database :
OpenAIRE
Journal :
Journal of Applied Statistics
Accession number :
edsair.doi...........a805719759797705fa609fec30f0d081