Back to Search Start Over

Approximate all-pairs suffix/prefix overlaps

Authors :
Välimäki, Niko
Ladra, Susana
Mäkinen, Veli
Source :
Information & Computation. Apr2012, Vol. 213, p49-58. 10p.
Publication Year :
2012

Abstract

Abstract: Finding approximate overlaps is the first phase of many sequence assembly methods. Given a set of strings of total length n and an error-rate ϵ, the goal is to find, for all-pairs of strings, their suffix/prefix matches (overlaps) that are within edit distance , where ℓ is the length of the overlap. We propose a new solution for this problem based on backward backtracking (Lam, et al., 2008) and suffix filters (Kärkkäinen and Na, 2008). Our technique uses bits of space, where is the k-th order entropy and σ the alphabet size. In practice, it is more scalable in terms of space, and comparable in terms of time, than q-gram filters (Rasmussen, et al., 2006). Our method is also easy to parallelize and scales up to millions of DNA reads. [Copyright &y& Elsevier]

Details

Language :
English
ISSN :
08905401
Volume :
213
Database :
Academic Search Index
Journal :
Information & Computation
Publication Type :
Academic Journal
Accession number :
73498957
Full Text :
https://doi.org/10.1016/j.ic.2012.02.002