Back to Search
Start Over
MinYS: mine your symbiont by targeted genome assembly in symbiotic communities
- Source :
- NAR Genomics and Bioinformatics, NAR Genomics and Bioinformatics, Oxford University Press, 2020, 2 (3), pp.1-11. ⟨10.1093/nargab/lqaa047⟩, NAR Genomics and Bioinformatics, 2020, 2 (3), pp.1-11. ⟨10.1093/nargab/lqaa047⟩
- Publication Year :
- 2020
- Publisher :
- HAL CCSD, 2020.
-
Abstract
- Most metazoans are associated with symbionts. Characterizing the effect of a particular symbiont often requires to get access to its genome, which is usually done by sequencing the whole community. We present MinYS, a targeted assembly approach to assemble one particular genome of interest from such metagenomic data. First, taking advantage of a reference genome, a subset of the reads is assembled into a set of backbone contigs. Then, this draft assembly is completed using the whole metagenomic readset in a de novo manner. The resulting assembly is output as a genome graph, allowing to distinguish different strains with potential structural variants coexisting in the sample. MinYS was applied to 50 pea aphid re-sequencing samples, with low and high diversity, in order to recover the genome sequence of its obligatory bacterial symbiont, Buchnera aphidicola. It was able to return high quality assemblies (one contig assembly in 90% of the samples), even when using increasingly distant reference genomes, and to retrieve large structural variations in the samples. Due to its targeted essence, it outperformed standard metagenomic assemblers in terms of both time and assembly quality.
- Subjects :
- Whole genome sequencing
0303 health sciences
[SDV.GEN]Life Sciences [q-bio]/Genetics
Contig
030302 biochemistry & molecular biology
Sequence assembly
food and beverages
Standard Article
Computational biology
biochemical phenomena, metabolism, and nutrition
Biology
biology.organism_classification
Genome
03 medical and health sciences
Metagenomics
[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]
Buchnera
030304 developmental biology
Reference genome
Subjects
Details
- Language :
- English
- ISSN :
- 26319268
- Database :
- OpenAIRE
- Journal :
- NAR Genomics and Bioinformatics, NAR Genomics and Bioinformatics, Oxford University Press, 2020, 2 (3), pp.1-11. ⟨10.1093/nargab/lqaa047⟩, NAR Genomics and Bioinformatics, 2020, 2 (3), pp.1-11. ⟨10.1093/nargab/lqaa047⟩
- Accession number :
- edsair.doi.dedup.....09565bb040feb40528fb6304aba72c6f
- Full Text :
- https://doi.org/10.1093/nargab/lqaa047⟩