Back to Search Start Over

Protein-DNA Binding Residue Prediction via Bagging Strategy and Sequence-based Cube-Format Feature

Authors :
Gui-Jun Zhang
Ning-Xin Jia
Dong-Jun Yu
Zheng Linlin
Bai Yansong
Jun Hu
Source :
IEEE/ACM transactions on computational biology and bioinformatics.
Publication Year :
2021

Abstract

Protein-DNA interactions play an important role in biological processes. Accurately identifying DNA-binding residues is a critical but challenging task for protein function annotations and drug design. Although wet-lab experimental methods are the most accurate way to identify DNA-binding residues, they are time consuming and labor intensive. There is an urgent need to develop computational methods to rapidly and accurately predict DNA-binding residues. In this study, we propose a novel sequence-based method, named PredDBR, for predicting DNA-binding residues. In PredDBR, for each protein, its position-specific frequency matrix (PSFM), predicted secondary structure (PSS), and predicted probabilities of ligand-binding residues (PPLBR) are first generated as three feature sources. Secondly, for each feature source, the sliding window technique is employed to extract the matrix-format feature of each residue. Then, we design two strategies, i.e., SR and AVE, to separately transform PSFM-based and two predicted feature source-based, i.e., PSS-based and PPLBR-based, matrix-format features of each residue into three cube-format features. Finally, after serially combining the three cube-format features, the ensemble classifier is generated via applying bagging strategy to multiple base classifiers built by the framework of 2D convolutional neural network. Experimental results demonstrate that PredDBR outperforms several state-of-the-art sequenced-based DNA-binding residue predictors.

Details

ISSN :
15579964
Database :
OpenAIRE
Journal :
IEEE/ACM transactions on computational biology and bioinformatics
Accession number :
edsair.doi.dedup.....22f30d8db488e2fbe85dc5896a77de67