Back to Search Start Over

Exploring neighborhoods in large metagenome assembly graphs using spacegraphcats reveals hidden sequence diversity

Authors :
C. Titus Brown
Dominik Moritz
Michael P. O’Brien
Felix Reidl
Taylor Reiter
Blair D. Sullivan
Source :
Genome Biology, Vol 21, Iss 1, Pp 1-16 (2020)
Publication Year :
2020
Publisher :
BMC, 2020.

Abstract

Abstract Genomes computationally inferred from large metagenomic data sets are often incomplete and may be missing functionally important content and strain variation. We introduce an information retrieval system for large metagenomic data sets that exploits the sparsity of DNA assembly graphs to efficiently extract subgraphs surrounding an inferred genome. We apply this system to recover missing content from genome bins and show that substantial genomic sequence variation is present in a real metagenome. Our software implementation is available at https://github.com/spacegraphcats/spacegraphcats under the 3-Clause BSD License.

Details

Language :
English
ISSN :
1474760X
Volume :
21
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Genome Biology
Publication Type :
Academic Journal
Accession number :
edsdoj.881e80dd0ab84c88bad111a94e32beb0
Document Type :
article
Full Text :
https://doi.org/10.1186/s13059-020-02066-4