Back to Search Start Over

Review, Evaluation, and Directions for Gene-Targeted Assembly for Ecological Analyses of Metagenomes.

Authors :
Guo, Jiarong
Guo, Jiarong
Quensen, John F
Sun, Yanni
Wang, Qiong
Brown, C Titus
Cole, James R
Tiedje, James M
Guo, Jiarong
Guo, Jiarong
Quensen, John F
Sun, Yanni
Wang, Qiong
Brown, C Titus
Cole, James R
Tiedje, James M
Publication Year :
2019

Abstract

Shotgun metagenomics has greatly advanced our understanding of microbial communities over the last decade. Metagenomic analyses often include assembly and genome binning, computationally daunting tasks especially for big data from complex environments such as soil and sediments. In many studies, however, only a subset of genes and pathways involved in specific functions are of interest; thus, it is not necessary to attempt global assembly. In addition, methods that target genes can be computationally more efficient and produce more accurate assembly by leveraging rich databases, especially for those genes that are of broad interest such as those involved in biogeochemical cycles, biodegradation, and antibiotic resistance or used as phylogenetic markers. Here, we review six gene-targeted assemblers with unique algorithms for extracting and/or assembling targeted genes: Xander, MegaGTA, SAT-Assembler, HMM-GRASPx, GenSeed-HMM, and MEGAN. We tested these tools using two datasets with known genomes, a synthetic community of artificial reads derived from the genomes of 17 bacteria, shotgun sequence data from a mock community with 48 bacteria and 16 archaea genomes, and a large soil shotgun metagenomic dataset. We compared assemblies of a universal single copy gene (rplB) and two N cycle genes (nifH and nirK). We measured their computational efficiency, sensitivity, specificity, and chimera rate and found Xander and MegaGTA, which both use a probabilistic graph structure to model the genes, have the best overall performance with all three datasets, although MEGAN, a reference matching assembler, had better sensitivity with synthetic and mock community members chosen from its reference collection. Also, Xander and MegaGTA are the only tools that include post-assembly scripts tuned for common molecular ecology and diversity analyses. Additionally, we provide a mathematical model for estimating the probability of assembling targeted genes in a metagenome for estimating requir

Details

Database :
OAIster
Notes :
application/pdf
Publication Type :
Electronic Resource
Accession number :
edsoai.on1391602901
Document Type :
Electronic Resource