Start Over

Utilization of defined microbial communities enables effective evaluation of meta-genomic assemblies

Authors :: William W. Greenwald
Niels Klitgord
Victor Seguritan
Shibu Yooseph
J. Craig Venter
Chad Garner
Karen E. Nelson
Weizhong Li
Source :: BMC Genomics, Vol 18, Iss 1, Pp 1-11 (2017)
Publication Year :: 2017
Publisher :: BMC, 2017.
Abstract: Abstract Background Metagenomics is the study of the microbial genomes isolated from communities found on our bodies or in our environment. By correctly determining the relation between human health and the human associated microbial communities, novel mechanisms of health and disease can be found, thus enabling the development of novel diagnostics and therapeutics. Due to the diversity of the microbial communities, strategies developed for aligning human genomes cannot be utilized, and genomes of the microbial species in the community must be assembled de novo. However, in order to obtain the best metagenomic assemblies, it is important to choose the proper assembler. Due to the rapidly evolving nature of metagenomics, new assemblers are constantly created, and the field has not yet agreed on a standardized process. Furthermore, the truth sets used to compare these methods are either too simple (computationally derived diverse communities) or complex (microbial communities of unknown composition), yielding results that are hard to interpret. In this analysis, we interrogate the strengths and weaknesses of five popular assemblers through the use of defined biological samples of known genomic composition and abundance. We assessed the performance of each assembler on their ability to reassemble genomes, call taxonomic abundances, and recreate open reading frames (ORFs). Results We tested five metagenomic assemblers: Omega, metaSPAdes, IDBA-UD, metaVelvet and MEGAHIT on known and synthetic metagenomic data sets. MetaSPAdes excelled in diverse sets, IDBA-UD performed well all around, metaVelvet had high accuracy in high abundance organisms, and MEGAHIT was able to accurately differentiate similar organisms within a community. At the ORF level, metaSPAdes and MEGAHIT had the least number of missing ORFs within diverse and similar communities respectively. Conclusions Depending on the metagenomics question asked, the correct assembler for the task at hand will differ. It is important to choose the appropriate assembler, and thus clearly define the biological problem of an experiment, as different assemblers will give different answers to the same question.

Subjects :: Reference Genome
Human Microbiome Project
Mock Community
Longe Contig
Balance Community
Biotechnology
TP248.13-248.65
Genetics
QH426-470

Details

Language :: English
ISSN :: 14712164
Volume :: 18
Issue :: 1
Database :: Directory of Open Access Journals
Journal :: BMC Genomics
Publication Type :: Academic Journal
Accession number :: edsdoj.34673d91ae6240008ee7b68e595e613a
Document Type :: article
Full Text :: https://doi.org/10.1186/s12864-017-3679-5

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Utilization of defined microbial communities enables effective evaluation of meta-genomic assemblies

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Utilization of defined microbial communities enables effective evaluation of meta-genomic assemblies

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources