Back to Search Start Over

Decoding functional proteome information in model organisms using protein language models.

Authors :
Barrios-Núñez I
Martínez-Redondo GI
Medina-Burgos P
Cases I
Fernández R
Rojas AM
Source :
NAR genomics and bioinformatics [NAR Genom Bioinform] 2024 Jul 02; Vol. 6 (3), pp. lqae078. Date of Electronic Publication: 2024 Jul 02 (Print Publication: 2024).
Publication Year :
2024

Abstract

Protein language models have been tested and proved to be reliable when used on curated datasets but have not yet been applied to full proteomes. Accordingly, we tested how two different machine learning-based methods performed when decoding functional information from the proteomes of selected model organisms. We found that protein language models are more precise and informative than deep learning methods for all the species tested and across the three gene ontologies studied, and that they better recover functional information from transcriptomic experiments. The results obtained indicate that these language models are likely to be suitable for large-scale annotation and downstream analyses, and we recommend a guide for their use.<br /> (© The Author(s) 2024. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.)

Details

Language :
English
ISSN :
2631-9268
Volume :
6
Issue :
3
Database :
MEDLINE
Journal :
NAR genomics and bioinformatics
Publication Type :
Academic Journal
Accession number :
38962255
Full Text :
https://doi.org/10.1093/nargab/lqae078