Back to Search Start Over

Principal component analysis in protein tertiary structure prediction.

Authors :
Álvarez, Óscar
Fernández-Martínez, Juan Luis
Fernández-Brillet, Celia
Cernea, Ana
Fernández-Muñiz, Zulima
Kloczkowski, Andrzej
Source :
Journal of Bioinformatics & Computational Biology. Apr2018, Vol. 16 Issue 2, p-1. 34p.
Publication Year :
2018

Abstract

We discuss applicability of principal component analysis (PCA) for protein tertiary structure prediction from amino acid sequence. The algorithm presented in this paper belongs to the category of protein refinement models and involves establishing a low-dimensional space where the sampling (and optimization) is carried out via particle swarm optimizer (PSO). The reduced space is found via PCA performed for a set of low-energy protein models previously found using different optimization techniques. A high frequency term is added into this expansion by projecting the best decoy into the PCA basis set and calculating the residual model. This term is aimed at providing high frequency details in the energy optimization. The goal of this research is to analyze how the dimensionality reduction affects the prediction capability of the PSO procedure. For that purpose, different proteins from the Critical Assessment of Techniques for Protein Structure Prediction experiments were modeled. In all the cases, both the energy of the best decoy and the distance to the native structure have decreased. Our analysis also shows how the predicted backbone structure of native conformation and of alternative low energy states varies with respect to the PCA dimensionality. Generally speaking, the reconstruction can be successfully achieved with 10 principal components and the high frequency term. We also provide a computational analysis of protein energy landscape for the inverse problem of reconstructing structure from the reduced number of principal components, showing that the dimensionality reduction alleviates the ill-posed character of this high-dimensional energy optimization problem. The procedure explained in this paper is very fast and allows testing different PCA expansions. Our results show that PSO improves the energy of the best decoy used in the PCA when the adequate number of PCA terms is considered. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02197200
Volume :
16
Issue :
2
Database :
Academic Search Index
Journal :
Journal of Bioinformatics & Computational Biology
Publication Type :
Academic Journal
Accession number :
129503611
Full Text :
https://doi.org/10.1142/S0219720018500051