Back to Search
Start Over
Protein-peptide binding residue prediction based on protein language models and cross-attention mechanism.
- Source :
-
Analytical biochemistry [Anal Biochem] 2024 Nov; Vol. 694, pp. 115637. Date of Electronic Publication: 2024 Aug 08. - Publication Year :
- 2024
-
Abstract
- Accurate identifications of protein-peptide binding residues are essential for protein-peptide interactions and advancing drug discovery. To address this problem, extensive research efforts have been made to design more discriminative feature representations. However, extracting these explicit features usually depend on third-party tools, resulting in low computational efficacy and suffering from low predictive performance. In this study, we design an end-to-end deep learning-based method, E2EPep, for protein-peptide binding residue prediction using protein sequence only. E2EPep first employs and fine-tunes two state-of-the-art pre-trained protein language models that can extract two different high-latent feature representations from protein sequences relevant for protein structures and functions. A novel feature fusion module is then designed in E2EPep to fuse and optimize the above two feature representations of binding residues. In addition, we have also design E2EPep+, which integrates E2EPep and PepBCL models, to improve the prediction performance. Experimental results on two independent testing data sets demonstrate that E2EPep and E2EPep + could achieve the average AUC values of 0.846 and 0.842 while achieving an average Matthew's correlation coefficient value that is significantly higher than that of existing most of sequence-based methods and comparable to that of the state-of-the-art structure-based predictors. Detailed data analysis shows that the primary strength of E2EPep lies in the effectiveness of feature representation using cross-attention mechanism to fuse the embeddings generated by two fine-tuned protein language models. The standalone package of E2EPep and E2EPep + can be obtained at https://github.com/ckx259/E2EPep.git for academic use only.<br />Competing Interests: Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.<br /> (Copyright © 2024 Elsevier Inc. All rights reserved.)
Details
- Language :
- English
- ISSN :
- 1096-0309
- Volume :
- 694
- Database :
- MEDLINE
- Journal :
- Analytical biochemistry
- Publication Type :
- Academic Journal
- Accession number :
- 39121938
- Full Text :
- https://doi.org/10.1016/j.ab.2024.115637