1. A weak approach to suffix automata simulation for exact and approximate string matching.
- Author
-
Faro, Simone and Scafiti, Stefano
- Subjects
- *
PATTERN matching , *AUTOMATIC speech recognition , *DATA compression , *SUFFIXES & prefixes (Grammar) , *COMPUTER science , *COMPUTATIONAL chemistry , *ROBOTS - Abstract
String matching is one of the most extensively studied problems in computer science, mainly due to its direct applications to such diverse areas as text, image and signal processing, speech analysis and recognition, information retrieval, data compression, computational biology and chemistry. In the last few decades a myriad of alternative solutions have been proposed, based on very different techniques. However, automata have always played a very important role in the design of efficient string matching algorithms. In this paper we present the Range Automaton , a weak yet efficient variant of the non-deterministic suffix automaton of a string whose configuration can be encoded in a very simple form and which is particularly suitable to be used for solving a multitude of text-searching problems. We will firstly develop our approach in the case of exact string matching and present an efficient algorithm, named Backward Range Automaton Matcher, which turns out to be very fast in many practical cases. Later, we will show how the Range Automaton can be adapted in an effective way also to non-standard string matching problems such as swap matching and multiple string matching. Experimental results suggest that our approach is flexible and effective for all three search problems addressed, especially in the case of long patterns. • We present a simple and efficient technique for simulating the suffix automaton of a string. • The new technique leads to the definition of an automaton, called Range Automaton, for the weak recognition of the suffixes of a string. • We apply the Range Automaton to the exact string matching problem, obtaining an efficient and flexible solution in practice, especially in the case of long patterns. • We extend the application of the Range Automaton to the swap matching problem, a special case of approximate string matching. • We extend the application of the Range Automaton to the problem of multiple pattern matching. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF