101. CCFold: rapid and accurate prediction of coiled-coil structures and application to modelling intermediate filaments
- Author
-
Sergei V. Strelkov and Dmytro Guzenko
- Subjects
0301 basic medicine ,Statistics and Probability ,Source code ,Computer science ,media_common.quotation_subject ,Dimer ,Protein dimer ,Bioinformatics ,Biochemistry ,Protein filament ,03 medical and health sciences ,chemistry.chemical_compound ,Protein sequencing ,Molecule ,Intermediate filament ,Molecular Biology ,media_common ,Physics ,Coiled coil ,Computer Science Applications ,Computational Mathematics ,030104 developmental biology ,Computational Theory and Mathematics ,chemistry ,Threading (protein sequence) ,Biological system ,Algorithm - Abstract
Motivation Accurate molecular structure of the protein dimer representing the elementary building block of intermediate filaments (IFs) is essential towards the understanding of the filament assembly, rationalizing their mechanical properties and explaining the effect of disease-related IF mutations. The dimer contains a ∼300-residue long α-helical coiled coil which cannot be assessed by either direct experimental structure determination or modelling using standard approaches. At the same time, coiled coils are well-represented in structural databases. Results Here we present CCFold, a generally applicable threading-based algorithm which produces coiled-coil models from protein sequence only. The algorithm is based on a statistical analysis of experimentally determined structures and can handle any hydrophobic repeat patterns in addition to the most common heptads. We demonstrate that CCFold outperforms general-purpose computational folding in terms of accuracy, while being faster by orders of magnitude. By combining the CCFold algorithm and Rosetta folding we generate representative dimer models for all IF protein classes. Availability and implementation The source code is freely available at https://github.com/biocryst/IF; a web server to run the program is at http://pharm.kuleuven.be/Biocrystallography/cc. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2017