Back to Search
Start Over
Comparative analysis and prediction of quorum-sensing peptides using feature representation learning and machine learning algorithms
- Source :
- Briefings in bioinformatics.
- Publication Year :
- 2018
-
Abstract
- Quorum-sensing peptides (QSPs) are the signal molecules that are closely associated with diverse cellular processes, such as cell-cell communication, and gene expression regulation in Gram-positive bacteria. It is therefore of great importance to identify QSPs for better understanding and in-depth revealing of their functional mechanisms in physiological processes. Machine learning algorithms have been developed for this purpose, showing the great potential for the reliable prediction of QSPs. In this study, several sequence-based feature descriptors for peptide representation and machine learning algorithms are comprehensively reviewed, evaluated and compared. To effectively use existing feature descriptors, we used a feature representation learning strategy that automatically learns the most discriminative features from existing feature descriptors in a supervised way. Our results demonstrate that this strategy is capable of effectively capturing the sequence determinants to represent the characteristics of QSPs, thereby contributing to the improved predictive performance. Furthermore, wrapping this feature representation learning strategy, we developed a powerful predictor named QSPred-FL for the detection of QSPs in large-scale proteomic data. Benchmarking results with 10-fold cross validation showed that QSPred-FL is able to achieve better performance as compared to the state-of-the-art predictors. In addition, we have established a user-friendly webserver that implements QSPred-FL, which is currently available at http://server.malab.cn/QSPred-FL. We expect that this tool will be useful for the high-throughput prediction of QSPs and the discovery of important functional mechanisms of QSPs.
- Subjects :
- 0301 basic medicine
Web server
Computer science
business.industry
Feature selection
Machine learning
computer.software_genre
Cross-validation
03 medical and health sciences
Identification (information)
030104 developmental biology
0302 clinical medicine
Discriminative model
030220 oncology & carcinogenesis
Feature (machine learning)
Artificial intelligence
Representation (mathematics)
business
Molecular Biology
computer
Feature learning
Algorithm
Information Systems
Subjects
Details
- ISSN :
- 14774054
- Database :
- OpenAIRE
- Journal :
- Briefings in bioinformatics
- Accession number :
- edsair.doi.dedup.....ab619a0e87140e5acb0a0ecc9a30578a