Back to Search Start Over

MGnify: the microbiome sequence data analysis resource in 2023.

Authors :
Richardson L
Allen B
Baldi G
Beracochea M
Bileschi ML
Burdett T
Burgin J
Caballero-Pérez J
Cochrane G
Colwell LJ
Curtis T
Escobar-Zepeda A
Gurbich TA
Kale V
Korobeynikov A
Raj S
Rogers AB
Sakharova E
Sanchez S
Wilkinson DJ
Finn RD
Source :
Nucleic acids research [Nucleic Acids Res] 2023 Jan 06; Vol. 51 (D1), pp. D753-D759.
Publication Year :
2023

Abstract

The MGnify platform (https://www.ebi.ac.uk/metagenomics) facilitates the assembly, analysis and archiving of microbiome-derived nucleic acid sequences. The platform provides access to taxonomic assignments and functional annotations for nearly half a million analyses covering metabarcoding, metatranscriptomic, and metagenomic datasets, which are derived from a wide range of different environments. Over the past 3 years, MGnify has not only grown in terms of the number of datasets contained but also increased the breadth of analyses provided, such as the analysis of long-read sequences. The MGnify protein database now exceeds 2.4 billion non-redundant sequences predicted from metagenomic assemblies. This collection is now organised into a relational database making it possible to understand the genomic context of the protein through navigation back to the source assembly and sample metadata, marking a major improvement. To extend beyond the functional annotations already provided in MGnify, we have applied deep learning-based annotation methods. The technology underlying MGnify's Application Programming Interface (API) and website has been upgraded, and we have enabled the ability to perform downstream analysis of the MGnify data through the introduction of a coupled Jupyter Lab environment.<br /> (© The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research.)

Details

Language :
English
ISSN :
1362-4962
Volume :
51
Issue :
D1
Database :
MEDLINE
Journal :
Nucleic acids research
Publication Type :
Academic Journal
Accession number :
36477304
Full Text :
https://doi.org/10.1093/nar/gkac1080