Back to Search Start Over

Linear Time Algorithms for Generalizations of the Longest Common Substring Problem.

Authors :
Arnold, Michael
Ohlebusch, Enno
Source :
Algorithmica. Aug2011, Vol. 60 Issue 4, p806-818. 13p. 1 Chart, 1 Graph.
Publication Year :
2011

Abstract

In its simplest form, the longest common substring problem is to find a longest substring common to two or multiple strings. Using (generalized) suffix trees, this problem can be solved in linear time and space. A first generalization is the k -common substring problem: Given m strings of total length n, for all k with 2≤ k≤ m simultaneously find a longest substring common to at least k of the strings. It is known that the k-common substring problem can also be solved in O( n) time (Hui in Proc. 3rd Annual Symposium on Combinatorial Pattern Matching, volume 644 of Lecture Notes in Computer Science, pp. 230-243, Springer, Berlin, ). A further generalization is the k -common repeated substring problem: Given m strings T, T,..., T of total length n and m positive integers x,..., x, for all k with 1≤ k≤ m simultaneously find a longest string ω for which there are at least k strings $T^{(i_{1})},T^{(i_{2})},\ldots,T^{(i_{k})}$ (1≤ i< i< ⋅⋅⋅< i≤ m) such that ω occurs at least $x_{i_{j}}$ times in $T^{(i_{j})}$ for each j with 1≤ j≤ k. (For x= ⋅⋅⋅= x=1, we have the k-common substring problem.) In this paper, we present the first O( n) time algorithm for the k-common repeated substring problem. Our solution is based on a new linear time algorithm for the k-common substring problem. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01784617
Volume :
60
Issue :
4
Database :
Academic Search Index
Journal :
Algorithmica
Publication Type :
Academic Journal
Accession number :
60392572
Full Text :
https://doi.org/10.1007/s00453-009-9369-1