Back to Search
Start Over
Prediction of heterotrimeric protein complexes by two-phase learning using neighboring kernels
- Source :
- BMC Bioinformatics
- Publication Year :
- 2014
-
Abstract
- [Background]Protein complexes play important roles in biological systems such as gene regulatory networks and metabolic pathways. Most methods for predicting protein complexes try to find protein complexes with size more than three. It, however, is known that protein complexes with smaller sizes occupy a large part of whole complexes for several species. In our previous work, we developed a method with several feature space mappings and the domain composition kernel for prediction of heterodimeric protein complexes, which outperforms existing methods. [Results]We propose methods for prediction of heterotrimeric protein complexes by extending techniques in the previous work on the basis of the idea that most heterotrimeric protein complexes are not likely to share the same protein with each other. We make use of the discriminant function in support vector machines (SVMs), and design novel feature space mappings for the second phase. As the second classifier, we examine SVMs and relevance vector machines (RVMs). We perform 10-fold cross-validation computational experiments. The results suggest that our proposed two-phase methods and SVM with the extended features outperform the existing method NWE, which was reported to outperform other existing methods such as MCL, MCODE, DPClus, CMC, COACH, RRW, and PPSampler for prediction of heterotrimeric protein complexes. [Conclusions]We propose two-phase prediction methods with the extended features, the domain composition kernel, SVMs and RVMs. The two-phase method with the extended features and the domain composition kernel using SVM as the second classifier is particularly useful for prediction of heterotrimeric protein complexes.
- Subjects :
- Support Vector Machine
business.industry
Applied Mathematics
Feature vector
Gene regulatory network
Discriminant Analysis
Pattern recognition
Biology
Machine learning
computer.software_genre
Biochemistry
Computer Science Applications
Support vector machine
Relevance vector machine
Proceedings
Structural Biology
Prediction methods
Heterotrimeric G protein
Multiprotein Complexes
Artificial intelligence
Protein Multimerization
business
Molecular Biology
computer
Classifier (UML)
Subjects
Details
- ISSN :
- 14712105
- Volume :
- 15
- Database :
- OpenAIRE
- Journal :
- BMC bioinformatics
- Accession number :
- edsair.doi.dedup.....b4c3593c31282874ab32eddae34a8c59