Back to Search
Start Over
Enhancing the Biological Relevance of Machine Learning Classifiers for Reverse Vaccinology
- Source :
- International Journal of Molecular Sciences, International Journal of Molecular Sciences; Volume 18; Issue 2; Pages: 312, International Journal of Molecular Sciences, Vol 18, Iss 2, p 312 (2017)
- Publication Year :
- 2017
- Publisher :
- MDPI, 2017.
-
Abstract
- Reverse vaccinology (RV) is a bioinformatics approach that can predict antigens with protective potential from the protein coding genomes of bacterial pathogens for subunit vaccine design. RV has become firmly established following the development of the BEXSERO® vaccine against Neisseria meningitidis serogroup B. RV studies have begun to incorporate machine learning (ML) techniques to distinguish bacterial protective antigens (BPAs) from non-BPAs. This research contributes significantly to the RV field by using permutation analysis to demonstrate that a signal for protective antigens can be curated from published data. Furthermore, the effects of the following on an ML approach to RV were also assessed: nested cross-validation, balancing selection of non-BPAs for subcellular localization, increasing the training data, and incorporating greater numbers of protein annotation tools for feature generation. These enhancements yielded a support vector machine (SVM) classifier that could discriminate BPAs (n = 200) from non-BPAs (n = 200) with an area under the curve (AUC) of 0.787. In addition, hierarchical clustering of BPAs revealed that intracellular BPAs clustered separately from extracellular BPAs. However, no immediate benefit was derived when training SVM classifiers on data sets exclusively containing intra- or extracellular BPAs. In conclusion, this work demonstrates that ML classifiers have great utility in RV approaches and will lead to new subunit vaccines in the future.
- Subjects :
- 0301 basic medicine
Support Vector Machine
computer.software_genre
lcsh:Chemistry
Machine Learning
Epitopes
0302 clinical medicine
reverse vaccinology
Protein Annotation
030212 general & internal medicine
Feature generation
lcsh:QH301-705.5
Spectroscopy
Protein coding
Neisseria meningitidis serogroup
bacterial protective antigen
machine learning
support vector machine
bacterial pathogen
General Medicine
Computer Science Applications
Area Under Curve
Bacterial Vaccines
Vaccines, Subunit
Biology
Machine learning
Catalysis
Article
Inorganic Chemistry
03 medical and health sciences
Bacterial Proteins
Humans
Subunit vaccines
Physical and Theoretical Chemistry
Molecular Biology
Antigens, Bacterial
business.industry
Organic Chemistry
Reverse vaccinology
Computational Biology
Hierarchical clustering
Support vector machine
030104 developmental biology
lcsh:Biology (General)
lcsh:QD1-999
ROC Curve
Mutagenesis
Artificial intelligence
business
computer
Epitope Mapping
Subjects
Details
- Language :
- English
- ISSN :
- 14220067
- Database :
- OpenAIRE
- Journal :
- International Journal of Molecular Sciences, International Journal of Molecular Sciences; Volume 18; Issue 2; Pages: 312, International Journal of Molecular Sciences, Vol 18, Iss 2, p 312 (2017)
- Accession number :
- edsair.doi.dedup.....76f709d4b73a53eb25492ad15b00952f