Back to Search
Start Over
Fast construction of space-optimized recursive automaton
- Source :
- Software: Practice and Experience. 45:783-799
- Publication Year :
- 2014
- Publisher :
- Wiley, 2014.
-
Abstract
- Finite-state automata are important components in information retrieval and natural language processing software. A recursive automaton is the most compact representation of the acyclic deterministic finite-state automata. It is based on merging not only the equivalent states but also the identical substructures in an automaton. The LZ trie variant is the state-of-the-art in automata compression regarding space, but the time needed for its construction was, until now, quadratic, which has made it impractical for large inputs. In this paper, we present the first algorithm for LZ trie construction that runs in effectively linear time thereby making it an attractive choice for finite-state automata implementation. We achieve this goal by adding a new functionality to the enhanced suffix array data structure. We present two variants of the construction procedure - an optimal method regarding the final size and a method that sacrifices some compression for low intermediate memory usage. We have made the implementation of our algorithms available in an open source software package LzLex.Copyright © 2014 John Wiley & Sons, Ltd.
- Subjects :
- Theoretical computer science
Nested stack automaton
Computer science
Timed automaton
Büchi automaton
ω-automaton
law.invention
Nondeterministic finite automaton with ε-moves
DFA minimization
Deterministic automaton
law
Trie
Continuous spatial automaton
Quantum finite automata
Two-way deterministic finite automaton
Nondeterministic finite automaton
Time complexity
Recursion
Finite-state machine
Powerset construction
GrowCut algorithm
Continuous automaton
Pushdown automaton
Suffix array
Mobile automaton
Automaton
Deterministic finite automaton
Automata theory
Algorithm
Software
Subjects
Details
- ISSN :
- 00380644
- Volume :
- 45
- Database :
- OpenAIRE
- Journal :
- Software: Practice and Experience
- Accession number :
- edsair.doi...........63e86dff674bed693927ebcc405a8bdc