Back to Search
Start Over
On Table Arrangements, Scrabble Freaks, and Jumbled Pattern Matching
- Source :
- Lecture Notes in Computer Science ISBN: 9783642131219, FUN
- Publication Year :
- 2010
- Publisher :
- Springer, 2010.
-
Abstract
- Given a string s, the Parikh vector of s, denoted p(s), counts the multiplicity of each character in s. Searching for a match of Parikh vector q (a “jumbled string”) in the text s requires to find a substring t of s with p(t) = q. The corresponding decision problem is to verify whether at least one such match exists. So, for example for the alphabet Σ = {a, b, c}, the string s = abaccbabaaa has Parikh vector p(s) = (6,3,2), and the Parikh vector q = (2,1,1) appears once in s in position (1,4). Like its more precise counterpart, the renown Exact String Matching, Jumbled Pattern Matching has ubiquitous applications, e.g., string matching with a dyslectic word processor, table rearrangements, anagram checking, Scrabble playing and, allegedly, also analysis of mass spectrometry data. We consider two simple algorithms for Jumbled Pattern Matching and use very complicated data structures and analytic tools to show that they are not worse than the most obvious algorithm. We also show that we can achieve non-trivial efficient average case behavior, but that’s less fun to describe in this abstract so we defer the details to the main part of the article, to be read at the reader’s risk...well, at the reader’s discretion.
- Subjects :
- Discrete mathematics
Parikh vectors, jumbled pattern matching, scrabble, approximate pattern matching
Anagram
Parikh vectors
String searching algorithm
Approximate string matching
Decision problem
algorithms
Data structure
Jumbled Pattern Matching
Substring
scrabble
approximate pattern matching
String Matching
Wavelet Tree
Pattern matching
Mathematics
Subjects
Details
- Language :
- English
- ISBN :
- 978-3-642-13121-9
- ISBNs :
- 9783642131219
- Database :
- OpenAIRE
- Journal :
- Lecture Notes in Computer Science ISBN: 9783642131219, FUN
- Accession number :
- edsair.doi.dedup.....649c1fc6f0daebb0dfe6b6ad92d1ebf6