Back to Search
Start Over
PSOPIA: Toward more reliable protein-protein interaction prediction from sequence information
- Source :
- 2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS).
- Publication Year :
- 2017
- Publisher :
- IEEE, 2017.
-
Abstract
- A better understanding of biological processes, pathways and functions requires reliable information about protein-protein interactions (PPIs). However, it is still a difficult task to identify complete PPI-networks experimentally in a cell or organism. To supplement the limitations of current experimental techniques, we have proposed PSOPIA, a computational method to predict whether two proteins interact or not (http://mizuguchilab.org/PSOPIA/) [1]. The selection of datasets is a big issue for the PPI prediction [2, 3]. It is generally believed that increasing the size and diversity of examples makes the dataset more representative and reduces the noise effects; however, for many algorithms, it is impractical to use a large-scale dataset at the proteome level because of the memory and CPU time requirements. In this study, PSOPIA was retrained on a highly imbalanced large-scale dataset having a diverse set of examples at the proteome level. The dataset consisted of 43,060 high confidence direct physical PPIs obtained from TargetMine [4] (as positives being only 0.13% of the total) and 33,098,951 negative PPIs. As a result, the new prediction model achieved the higher AUC of 0.89 (pAUCfpr
Details
- Database :
- OpenAIRE
- Journal :
- 2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)
- Accession number :
- edsair.doi...........fab494ca613507cc9bdf2aa15b153205
- Full Text :
- https://doi.org/10.1109/iciibms.2017.8279749