Back to Search Start Over

FALCON@home: a high-throughput protein structure prediction server based on remote homologue recognition

Authors :
Shiwei Sun
Wei-Mou Zheng
Kang Ning
Dongbo Bu
Chao Wang
Jianwei Zhu
Haicang Zhang
Dong Xu
Shuai Cheng Li
Bing Wang
Source :
Bioinformatics. 32:462-464
Publication Year :
2015
Publisher :
Oxford University Press (OUP), 2015.

Abstract

Summary: The protein structure prediction approaches can be categorized into template-based modeling (including homology modeling and threading) and free modeling. However, the existing threading tools perform poorly on remote homologous proteins. Thus, improving fold recognition for remote homologous proteins remains a challenge. Besides, the proteome-wide structure prediction poses another challenge of increasing prediction throughput. In this study, we presented FALCON@home as a protein structure prediction server focusing on remote homologue identification. The design of FALCON@home is based on the observation that a structural template, especially for remote homologous proteins, consists of conserved regions interweaved with highly variable regions. The highly variable regions lead to vague alignments in threading approaches. Thus, FALCON@home first extracts conserved regions from each template and then aligns a query protein with conserved regions only rather than the full-length template directly. This helps avoid the vague alignments rooted in highly variable regions, improving remote homologue identification. We implemented FALCON@home using the Berkeley Open Infrastructure of Network Computing (BOINC) volunteer computing protocol. With computation power donated from over 20 000 volunteer CPUs, FALCON@home shows a throughput as high as processing of over 1000 proteins per day. In the Critical Assessment of protein Structure Prediction (CASP11), the FALCON@home-based prediction was ranked the 12th in the template-based modeling category. As an application, the structures of 880 mouse mitochondria proteins were predicted, which revealed the significant correlation between protein half-lives and protein structural factors. Availability and implementation: FALCON@home is freely available at http://protein.ict.ac.cn/FALCON/. Contact: shuaicli@cityu.edu.hk, dbu@ict.ac.cn Supplementary information: Supplementary data are available at Bioinformatics online.

Details

ISSN :
13674811 and 13674803
Volume :
32
Database :
OpenAIRE
Journal :
Bioinformatics
Accession number :
edsair.doi.dedup.....a34c3518d269f8bb6cdfd185a6ef4d72
Full Text :
https://doi.org/10.1093/bioinformatics/btv581