Back to Search Start Over

PSOPIA: Toward more reliable protein-protein interaction prediction from sequence information

Authors :
Kenji Mizuguchi
Yoichi Murakami
Source :
2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS).
Publication Year :
2017
Publisher :
IEEE, 2017.

Abstract

A better understanding of biological processes, pathways and functions requires reliable information about protein-protein interactions (PPIs). However, it is still a difficult task to identify complete PPI-networks experimentally in a cell or organism. To supplement the limitations of current experimental techniques, we have proposed PSOPIA, a computational method to predict whether two proteins interact or not (http://mizuguchilab.org/PSOPIA/) [1]. The selection of datasets is a big issue for the PPI prediction [2, 3]. It is generally believed that increasing the size and diversity of examples makes the dataset more representative and reduces the noise effects; however, for many algorithms, it is impractical to use a large-scale dataset at the proteome level because of the memory and CPU time requirements. In this study, PSOPIA was retrained on a highly imbalanced large-scale dataset having a diverse set of examples at the proteome level. The dataset consisted of 43,060 high confidence direct physical PPIs obtained from TargetMine [4] (as positives being only 0.13% of the total) and 33,098,951 negative PPIs. As a result, the new prediction model achieved the higher AUC of 0.89 (pAUCfpr

Details

Database :
OpenAIRE
Journal :
2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)
Accession number :
edsair.doi...........fab494ca613507cc9bdf2aa15b153205
Full Text :
https://doi.org/10.1109/iciibms.2017.8279749