Back to Search Start Over

Detecting mutations by eBWT

Authors :
Prezza, Nicola
Pisanti, Nadia
Sciortino, Marinella
Rosone, Giovanna
Parida, L
Ukkonen, E
Prezza, Nicola
Pisanti, Nadia
Sciortino, Marinella
Rosone, Giovanna
Dipartimento di Informatica [Pisa] (DI)
University of Pisa - Università di Pisa
Equipe de recherche européenne en algorithmique et biologie formelle et expérimentale (ERABLE)
Inria Grenoble - Rhône-Alpes
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)
Università degli studi di Palermo - University of Palermo
Source :
Workshop on Algorithms in Bioinformatics, Workshop on Algorithms in Bioinformatics, 2018, Helsinki, Finland. ⟨10.4230/LIPIcs.CVIT.2016.23⟩
Publication Year :
2018
Publisher :
Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, 2018.

Abstract

In this paper we develop a theory describing how the extended Burrows-Wheeler Transform (eBWT) of a collection of DNA fragments tends to cluster together the copies of nucleotides sequenced from a genome G. Our theory accurately predicts how many copies of any nucleotide are expected inside each such cluster, and how an elegant and precise LCP array based procedure can locate these clusters in the eBWT. Our findings are very general and can be applied to a wide range of different problems. In this paper, we consider the case of alignment-free and reference-free SNPs discovery in multiple collections of reads. We note that, in accordance with our theoretical results, SNPs are clustered in the eBWT of the reads collection, and we develop a tool finding SNPs with a simple scan of the eBWT and LCP arrays. Preliminary results show that our method requires much less coverage than state-of-the-art tools while drastically improving precision and sensitivity.<br />Comment: simplified Proposition 4; extended Thm 2 to ambiguous clusters

Details

Language :
English
Database :
OpenAIRE
Journal :
Workshop on Algorithms in Bioinformatics, Workshop on Algorithms in Bioinformatics, 2018, Helsinki, Finland. ⟨10.4230/LIPIcs.CVIT.2016.23⟩
Accession number :
edsair.doi.dedup.....2bf78b23dcd4d38e8d691fa4949be4f4
Full Text :
https://doi.org/10.4230/LIPIcs.CVIT.2016.23⟩