Back to Search
Start Over
Chromosome assembly of large and complex genomes using multiple references
- Publication Year :
- 2016
- Publisher :
- Cold Spring Harbor Laboratory, 2016.
-
Abstract
- Despite the rapid development of sequencing technologies, assembly of mammalian-scale genomes into complete chromosomes remains one of the most challenging problems in bioinformatics. To help address this difficulty, we developed Ragout, a reference-assisted assembly tool that now works for large and complex genomes. Taking one or more target assemblies (generated from an NGS assembler) and one or multiple related reference genomes, Ragout infers the evolutionary relationships between the genomes and builds the final assemblies using a genome rearrangement approach. Using Ragout, we transformed NGS assemblies of 15 differentMus musculusand oneMus spretusgenomes into sets of complete chromosomes, leaving less than 5% of sequence unlocalized per set. Various benchmarks, including PCR testing and realigning of long PacBio reads, suggest only a small number of structural errors in the final assemblies, comparable with direct assembly approaches. Additionally, we applied Ragout toMus caroliandMus paharigenomes, which exhibit karyotype-scale variations compared to other genomes from theMuridaefamily. Chromosome color maps confirmed most large-scale rearrangements that Ragout detected.
Details
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....db39d473a1837d010d747870a6ba3911
- Full Text :
- https://doi.org/10.1101/088435