1. A review of computational tools for generating metagenome-assembled genomes from metagenomic sequencing data
- Author
-
William W. L. Cheung, Debajyoti Chowdhury, Zhenmiao Zhang, Zhaoxiang Bian, Aiping Lu, Chao Yang, and Lu Zhang
- Subjects
CNN, convolutional neural network ,Metagenome-assembled genomes ,Computer science ,Metagenome binning ,Gene functional annotation ,Taxonomic classification ,Gene prediction ,PacBio, Pacific Biosciences ,Microbial abundance profiling ,Sequencing data ,MAGs, metagenome-assembled genomes ,Biophysics ,Sequence assembly ,LCA, lowest common ancestor ,Review Article ,Computational biology ,ORFs, open reading frames ,KEGG, Kyoto Encyclopedia of Genes and Genomes ,Biochemistry ,Genome ,TNFs, tetranucleotide frequencies ,Annotation ,Structural Biology ,Genetics ,Quantitative Biology - Genomics ,ComputingMethodologies_COMPUTERGRAPHICS ,Genomics (q-bio.GN) ,Profiling (computer programming) ,Genome assembly ,OLC, overlap-layout consensus ,HMM, Hidden Markov Model ,ONT, Oxford Nanopore Technologies ,Biological classification ,SLR, synthetic long reads ,Computer Science Applications ,QC, quality control ,LPA, label propagation algorithm ,Metagenomics ,FOS: Biological sciences ,GTDB, Genome Taxonomy Database ,DBG, De Bruijn graph ,Metagenomic sequencing ,TP248.13-248.65 ,Biotechnology - Abstract
Graphical abstract, Metagenomic sequencing provides a culture-independent avenue to investigate the complex microbial communities by constructing metagenome-assembled genomes (MAGs). A MAG represents a microbial genome by a group of sequences from genome assembly with similar characteristics. It enables us to identify novel species and understand their potential functions in a dynamic ecosystem. Many computational tools have been developed to construct and annotate MAGs from metagenomic sequencing, however, there is a prominent gap to comprehensively introduce their background and practical performance. In this paper, we have thoroughly investigated the computational tools designed for both upstream and downstream analyses, including metagenome assembly, metagenome binning, gene prediction, functional annotation, taxonomic classification, and profiling. We have categorized the commonly used tools into unique groups based on their functional background and introduced the underlying core algorithms and associated information to demonstrate a comparative outlook. Furthermore, we have emphasized the computational requisition and offered guidance to the users to select the most efficient tools. Finally, we have indicated current limitations, potential solutions, and future perspectives for further improving the tools of MAG construction and annotation. We believe that our work provides a consolidated resource for the current stage of MAG studies and shed light on the future development of more effective MAG analysis tools on metagenomic sequencing.
- Published
- 2021