Back to Search
Start Over
BEST: improved prediction of B-cell epitopes from antigen sequences
- Source :
- PLoS ONE, Vol 7, Iss 6, p e40104 (2012), PLoS ONE
- Publication Year :
- 2012
- Publisher :
- Public Library of Science (PLoS), 2012.
-
Abstract
- Accurate identification of immunogenic regions in a given antigen chain is a difficult and actively pursued problem. Although accurate predictors for T-cell epitopes are already in place, the prediction of the B-cell epitopes requires further research. We overview the available approaches for the prediction of B-cell epitopes and propose a novel and accurate sequence-based solution. Our BEST (B-cell Epitope prediction using Support vector machine Tool) method predicts epitopes from antigen sequences, in contrast to some method that predict only from short sequence fragments, using a new architecture based on averaging selected scores generated from sliding 20-mers by a Support Vector Machine (SVM). The SVM predictor utilizes a comprehensive and custom designed set of inputs generated by combining information derived from the chain, sequence conservation, similarity to known (training) epitopes, and predicted secondary structure and relative solvent accessibility. Empirical evaluation on benchmark datasets demonstrates that BEST outperforms several modern sequence-based B-cell epitope predictors including ABCPred, method by Chen et al. (2007), BCPred, COBEpro, BayesB, and CBTOPE, when considering the predictions from antigen chains and from the chain fragments. Our method obtains a cross-validated area under the receiver operating characteristic curve (AUC) for the fragment-based prediction at 0.81 and 0.85, depending on the dataset. The AUCs of BEST on the benchmark sets of full antigen chains equal 0.57 and 0.6, which is significantly and slightly better than the next best method we tested. We also present case studies to contrast the propensity profiles generated by BEST and several other methods.
- Subjects :
- B Cells
Support Vector Machine
Computer science
Immune Cells
Immunology
Biophysics
Antigen-Presenting Cells
lcsh:Medicine
Antigen Processing and Recognition
Bioinformatics
Epitope
Set (abstract data type)
Epitopes
03 medical and health sciences
Similarity (network science)
Antigen
Molecular Cell Biology
Humans
Biomacromolecule-Ligand Interactions
Antigens
B-Cell Epitopes
lcsh:Science
Biology
Protein secondary structure
030304 developmental biology
0303 health sciences
Sequence
Multidisciplinary
T Cells
business.industry
Physics
030302 biochemistry & molecular biology
lcsh:R
Computational Biology
Pattern recognition
Protein structure prediction
Support vector machine
Area Under Curve
Computer Science
Benchmark (computing)
lcsh:Q
Artificial intelligence
Cellular Types
business
Sequence Analysis
Research Article
Computer Modeling
Subjects
Details
- Language :
- English
- ISSN :
- 19326203
- Volume :
- 7
- Issue :
- 6
- Database :
- OpenAIRE
- Journal :
- PLoS ONE
- Accession number :
- edsair.doi.dedup.....592544288d68bcb69f569c710b10d6a2