Back to Search
Start Over
DAWGs for Parameterized Matching: Online Construction and Related Indexing Structures
- Publication Year :
- 2020
- Publisher :
- Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2020.
-
Abstract
- Two strings x and y over Σ ∪ Π of equal length are said to parameterized match (p-match) if there is a renaming bijection f:Σ ∪ Π → Σ ∪ Π that is identity on Σ and transforms x to y (or vice versa). The p-matching problem is to look for substrings in a text that p-match a given pattern. In this paper, we propose parameterized suffix automata (p-suffix automata) and parameterized directed acyclic word graphs (PDAWGs) which are the p-matching versions of suffix automata and DAWGs. While suffix automata and DAWGs are equivalent for standard strings, we show that p-suffix automata can have Θ(n²) nodes and edges but PDAWGs have only O(n) nodes and edges, where n is the length of an input string. We also give O(n |Π| log (|Π| + |Σ|))-time O(n)-space algorithm that builds the PDAWG in a left-to-right online manner. As a byproduct, it is shown that the parameterized suffix tree for the reversed string can also be built in the same time and space, in a right-to-left online manner.<br />LIPIcs, Vol. 161, 31st Annual Symposium on Combinatorial Pattern Matching (CPM 2020), pages 26:1-26:14
- Subjects :
- parameterized matching
050101 languages & linguistics
05 social sciences
DAWGs
0202 electrical engineering, electronic engineering, information engineering
suffix trees
020201 artificial intelligence & image processing
0501 psychology and cognitive sciences
suffix automata
02 engineering and technology
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.doi...........b2557c6377e340310e56428e0eb36c4b
- Full Text :
- https://doi.org/10.4230/lipics.cpm.2020.26