Back to Search
Start Over
Finishing bacterial genome assemblies with Mix
- Source :
- HAL, BMC Bioinformatics, BMC Bioinformatics, BioMed Central, 2013, pp.doi:10.1186/1471-2105-14-S15-S16, 11. Annual Research in Computational Molecular Biology (RECOMB) Satellite Workshop on Comparative Genomics, 11. Annual Research in Computational Molecular Biology (RECOMB) Satellite Workshop on Comparative Genomics, Institut National de Recherche en Informatique et en Automatique (INRIA). FRA., Oct 2013, Lyon, France. ⟨10.1186/1471-2105-14-S15-S16⟩, Scopus-Elsevier, Eleventh Annual Research in Computational Molecular Biology (RECOMB) Satellite Workshop on Comparative Genomics. (14 (suppl. 15))2013; 11. Annual Research in Computational Molecular Biology (RECOMB) Satellite Workshop on Comparative Genomics, Lyon, FRA, 2013-10-17-2013-10-19, 1-11
- Publication Year :
- 2013
- Publisher :
- BioMed Central Ltd, 2013.
-
Abstract
- Among challenges that hamper reaping the benefits of genome assembly are both unfinished assemblies and the ensuing experimental costs. First, numerous software solutions for genome de novo assembly are available, each having its advantages and drawbacks, without clear guidelines as to how to choose among them. Second, these solutions produce draft assemblies that often require a resource intensive finishing phase. In this paper we address these two aspects by developing Mix , a tool that mixes two or more draft assemblies, without relying on a reference genome and having the goal to reduce contig fragmentation and thus speed-up genome finishing. The proposed algorithm builds an extension graph where vertices represent extremities of contigs and edges represent existing alignments between these extremities. These alignment edges are used for contig extension. The resulting output assembly corresponds to a set of paths in the extension graph that maximizes the cumulative contig length. We evaluate the performance of Mix on bacterial NGS data from the GAGE-B study and apply it to newly sequenced Mycoplasma genomes. Resulting final assemblies demonstrate a significant improvement in the overall assembly quality. In particular, Mix is consistent by providing better overall quality results even when the choice is guided solely by standard assembly statistics, as is the case for de novo projects. Mix is implemented in Python and is available at https://github.com/cbib/MIX , novel data for our Mycoplasma study is available at http://services.cbib.u-bordeaux2.fr/mix/ .
- Subjects :
- [SDV]Life Sciences [q-bio]
Sequence assembly
Bacterial genome size
Biology
computer.software_genre
Genome
Biochemistry
Market fragmentation
03 medical and health sciences
Mycoplasma
Structural Biology
Software Design
génome bactérien
[SDV.BV]Life Sciences [q-bio]/Vegetal Biology
Molecular Biology
ComputingMilieux_MISCELLANEOUS
030304 developmental biology
computer.programming_language
H20 - Maladies des plantes
0303 health sciences
Contig
030306 microbiology
business.industry
qualité
U10 - Informatique, mathématiques et statistiques
Applied Mathematics
High-Throughput Nucleotide Sequencing
Sequence Analysis, DNA
Python (programming language)
[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM]
Biotechnology
Computer Science Applications
séquence du génome
Proceedings
Graph (abstract data type)
Data mining
[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]
business
computer
Algorithms
Genome, Bacterial
Reference genome
Subjects
Details
- Language :
- English
- ISSN :
- 14712105
- Database :
- OpenAIRE
- Journal :
- HAL, BMC Bioinformatics, BMC Bioinformatics, BioMed Central, 2013, pp.doi:10.1186/1471-2105-14-S15-S16, 11. Annual Research in Computational Molecular Biology (RECOMB) Satellite Workshop on Comparative Genomics, 11. Annual Research in Computational Molecular Biology (RECOMB) Satellite Workshop on Comparative Genomics, Institut National de Recherche en Informatique et en Automatique (INRIA). FRA., Oct 2013, Lyon, France. ⟨10.1186/1471-2105-14-S15-S16⟩, Scopus-Elsevier, Eleventh Annual Research in Computational Molecular Biology (RECOMB) Satellite Workshop on Comparative Genomics. (14 (suppl. 15))2013; 11. Annual Research in Computational Molecular Biology (RECOMB) Satellite Workshop on Comparative Genomics, Lyon, FRA, 2013-10-17-2013-10-19, 1-11
- Accession number :
- edsair.doi.dedup.....45c236426671aef226eabaea9fdbbb93
- Full Text :
- https://doi.org/10.1186/1471-2105-14-S15-S16