1. Multifaceted protein–protein interaction prediction based on Siamese residual RCNN
- Author
-
Xuelu Chen, Wei Wang, Carlo Zaniolo, Tianran Zhang, Muhao Chen, Chelsea J.-T. Ju, Guangyu Zhou, and Kai-Wei Chang
- Subjects
Statistics and Probability ,Computer science ,Residual ,Machine learning ,computer.software_genre ,Biochemistry ,Convolutional neural network ,03 medical and health sciences ,0302 clinical medicine ,Ismb/Eccb 2019 Conference Proceedings ,Amino Acid Sequence ,Molecular Biology ,Peptide sequence ,Mutual influence ,030304 developmental biology ,0303 health sciences ,Sequence ,Artificial neural network ,business.industry ,Computational Biology ,Proteins ,Macromolecular Sequence, Structure, and Function ,Ligand (biochemistry) ,Computer Science Applications ,Computational Mathematics ,ComputingMethodologies_PATTERNRECOGNITION ,Computational Theory and Mathematics ,030220 oncology & carcinogenesis ,Protein–protein interaction prediction ,Neural Networks, Computer ,Artificial intelligence ,business ,computer ,Algorithms ,Protein Binding - Abstract
MotivationSequence-based protein–protein interaction (PPI) prediction represents a fundamental computational biology problem. To address this problem, extensive research efforts have been made to extract predefined features from the sequences. Based on these features, statistical algorithms are learned to classify the PPIs. However, such explicit features are usually costly to extract, and typically have limited coverage on the PPI information.ResultsWe present an end-to-end framework, PIPR (Protein–Protein Interaction Prediction Based on Siamese Residual RCNN), for PPI predictions using only the protein sequences. PIPR incorporates a deep residual recurrent convolutional neural network in the Siamese architecture, which leverages both robust local features and contextualized information, which are significant for capturing the mutual influence of proteins sequences. PIPR relieves the data pre-processing efforts that are required by other systems, and generalizes well to different application scenarios. Experimental evaluations show that PIPR outperforms various state-of-the-art systems on the binary PPI prediction problem. Moreover, it shows a promising performance on more challenging problems of interaction type prediction and binding affinity estimation, where existing approaches fall short.Availability and implementationThe implementation is available at https://github.com/muhaochen/seq_ppi.git.Supplementary informationSupplementary data are available at Bioinformatics online.
- Published
- 2019