Back to Search
Start Over
Critical evaluation of bioinformatics tools for the prediction of protein crystallization propensity
- Source :
- Briefings in bioinformatics. 18(6)
- Publication Year :
- 2017
-
Abstract
- X-ray crystallography is the main tool for structural determination of proteins. Yet, the underlying crystallization process is costly, has a high attrition rate and involves a series of trial-and-error attempts to obtain diffraction-quality crystals. The Structural Genomics Consortium aims to systematically solve representative structures of major protein-fold classes using primarily high-throughput X-ray crystallography. The attrition rate of these efforts can be improved by selection of proteins that are potentially easier to be crystallized. In this context, bioinformatics approaches have been developed to predict crystallization propensities based on protein sequences. These approaches are used to facilitate prioritization of the most promising target proteins, search for alternative structural orthologues of the target proteins and suggest designs of constructs capable of potentially enhancing the likelihood of successful crystallization. We reviewed and compared nine predictors of protein crystallization propensity. Moreover, we demonstrated that integrating selected outputs from multiple predictors as candidate input features to build the predictive model results in a significantly higher predictive performance when compared to using these predictors individually. Furthermore, we also introduced a new and accurate predictor of protein crystallization propensity, Crysf, which uses functional features extracted from UniProt as inputs. This comprehensive review will assist structural biologists in selecting the most appropriate predictor, and is also beneficial for bioinformaticians to develop a new generation of predictive algorithms.
- Subjects :
- Prioritization
0301 basic medicine
Paper
Computer science
Functional features
Context (language use)
Computational biology
Biology
Bioinformatics
Crystallography, X-Ray
030226 pharmacology & pharmacy
Structural genomics
03 medical and health sciences
0302 clinical medicine
Humans
Amino Acid Sequence
Databases, Protein
Molecular Biology
Selection (genetic algorithm)
Computational Biology
Proteins
Predictive analytics
030104 developmental biology
UniProt
Corrigendum
Crystallization
Protein crystallization
Algorithms
Software
Information Systems
Subjects
Details
- ISSN :
- 14774054
- Volume :
- 18
- Issue :
- 6
- Database :
- OpenAIRE
- Journal :
- Briefings in bioinformatics
- Accession number :
- edsair.doi.dedup.....0d91c2065f5bb49b48c48d93ca9fdf47