Back to Search
Start Over
MTG-Link: leveraging barcode information from linked-reads to assemble specific loci
- Source :
- BMC Bioinformatics, BMC Bioinformatics, 2023, 24, pp.17. ⟨10.1186/s12859-023-05395-w⟩
- Publication Year :
- 2022
- Publisher :
- HAL CCSD, 2022.
-
Abstract
- BackgroundLocal assembly with short and long reads has proven to be very useful in many applications: reconstruction of the sequence of a locus of interest, gap-filling in draft assemblies, as well as alternative allele reconstruction of large insertion variants. Whereas linked-read technologies have a great potential to assemble specific loci as they provide long-range information while maintaining the power and accuracy of short-read sequencing, there is a lack of local assembly tools for linked-read data.ResultsWe present MTG-Link, a novel local assembly tool dedicated to linked-reads. The originality of the method lies in its read subsampling step which takes advantage of the barcode information contained in linked-reads mapped in flanking regions. We validated our approach on several datasets from different linked-read technologies. We show that MTG-Link is able to assemble successfully large sequences, up to dozens of Kb. We also demonstrate that the read subsampling step of MTG-Link considerably improves the local assembly of specific loci compared to other existing short-read local assembly tools. Furthermore, MTG-Link was able to fully characterize large insertion variants in a human genome and improved the contiguity of a 1.3 Mb locus of biological interest in several individual genomes of the mimetic butterfly (Heliconius numata).ConclusionsMTG-Link is an efficient local assembly tool designed for different linked-read sequencing technologies. MTG-Link source code is available at https://github.com/anne-gcd/MTG-Link and as a Bioconda package.Contactanne.guichard@irisa.fr
Details
- Language :
- English
- ISSN :
- 14712105
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics, BMC Bioinformatics, 2023, 24, pp.17. ⟨10.1186/s12859-023-05395-w⟩
- Accession number :
- edsair.doi.dedup.....59c9d4596f4e41ebf618cc7c5e1690e6
- Full Text :
- https://doi.org/10.1186/s12859-023-05395-w⟩