Back to Search
Start Over
Exploring neighborhoods in large metagenome assembly graphs using spacegraphcats reveals hidden sequence diversity
- Source :
- Genome Biology, Vol 21, Iss 1, Pp 1-16 (2020), Genome biology, vol 21, iss 1, Genome Biology
- Publication Year :
- 2020
- Publisher :
- BMC, 2020.
-
Abstract
- Genomes computationally inferred from large metagenomic data sets are often incomplete and may be missing functionally important content and strain variation. We introduce an information retrieval system for large metagenomic data sets that exploits the sparsity of DNA assembly graphs to efficiently extract subgraphs surrounding an inferred genome. We apply this system to recover missing content from genome bins and show that substantial genomic sequence variation is present in a real metagenome. Our software implementation is available at https://github.com/spacegraphcats/spacegraphcats under the 3-Clause BSD License.
- Subjects :
- lcsh:QH426-470
Bioinformatics
Method
Sequence assembly
Variation (game tree)
Computational biology
Biology
Genome
Software implementation
03 medical and health sciences
0302 clinical medicine
Dominating set
Bounded expansion
Information and Computing Sciences
Genetics
Dna assembly
lcsh:QH301-705.5
030304 developmental biology
0303 health sciences
Sequence
Human Genome
Genetic Variation
Biological Sciences
lcsh:Genetics
ComputingMethodologies_PATTERNRECOGNITION
Strain variation
Networking and Information Technology R&D (NITRD)
lcsh:Biology (General)
Metagenomics
030217 neurology & neurosurgery
Algorithms
Software
Environmental Sciences
Subjects
Details
- Language :
- English
- Volume :
- 21
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- Genome Biology
- Accession number :
- edsair.doi.dedup.....fa09e3b64eccdf289e0ac59ad70437a6