Back to Search Start Over

A catalog of small proteins from the global microbiome

Authors :
Yiqian Duan
Célio Dias Santos-Júnior
Thomas Sebastian Schmidt
Anthony Fullam
Breno L. S. de Almeida
Chengkai Zhu
Michael Kuhn
Xing-Ming Zhao
Peer Bork
Luis Pedro Coelho
Source :
Nature Communications, Vol 15, Iss 1, Pp 1-11 (2024)
Publication Year :
2024
Publisher :
Nature Portfolio, 2024.

Abstract

Abstract Small open reading frames (smORFs) shorter than 100 codons are widespread and perform essential roles in microorganisms, where they encode proteins active in several cell functions, including signal pathways, stress response, and antibacterial activities. However, the ecology, distribution and role of small proteins in the global microbiome remain unknown. Here, we construct a global microbial smORFs catalog (GMSC) derived from 63,410 publicly available metagenomes across 75 distinct habitats and 87,920 high-quality isolate genomes. GMSC contains 965 million non-redundant smORFs with comprehensive annotations. We find that archaea harbor more smORFs proportionally than bacteria. We moreover provide a tool called GMSC-mapper to identify and annotate small proteins from microbial (meta)genomes. Overall, this publicly-available resource demonstrates the immense and underexplored diversity of small proteins.

Subjects

Subjects :
Science

Details

Language :
English
ISSN :
20411723
Volume :
15
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Nature Communications
Publication Type :
Academic Journal
Accession number :
edsdoj.06a73a03ddbd4d009f53501faebba7e2
Document Type :
article
Full Text :
https://doi.org/10.1038/s41467-024-51894-6