Back to Search Start Over

Compressed text indexing with wildcards.

Authors :
Hon, Wing-Kai
Ku, Tsung-Han
Shah, Rahul
Thankachan, Sharma V.
Vitter, Jeffrey Scott
Source :
Journal of Discrete Algorithms; Mar2013, Vol. 19, p23-29, 7p
Publication Year :
2013

Abstract

Abstract: Let be a text of total length n, where characters of each are chosen from an alphabet Σ of size σ, and ϕ denotes a wildcard symbol. The text indexing with wildcards problem is to index T such that when we are given a query pattern P, we can locate the occurrences of P in T efficiently. This problem has been applied in indexing genomic sequences that contain single-nucleotide polymorphisms (SNP) because SNP can be modeled as wildcards. Recently Tam et al. (2009) and Thachuk (2011) have proposed succinct indexes for this problem. In this paper, we present the first compressed index for this problem, which takes only bits of space, where is the hth-order empirical entropy () of T. [Copyright &y& Elsevier]

Details

Language :
English
ISSN :
15708667
Volume :
19
Database :
Supplemental Index
Journal :
Journal of Discrete Algorithms
Publication Type :
Academic Journal
Accession number :
86394675
Full Text :
https://doi.org/10.1016/j.jda.2012.12.003