Back to Search Start Over

The Rate of Convergence of the Mean Length of the Longest Common Subsequence

Authors :
Kenneth S. Alexander
Source :
Ann. Appl. Probab. 4, no. 4 (1994), 1074-1082
Publication Year :
1994
Publisher :
Institute of Mathematical Statistics, 1994.

Abstract

Given two i.i.d. sequences of $n$ letters from a finite alphabet, one can consider the length $L_n$ of the longest sequence which is a subsequence of both the given sequences. It is known that $EL_n$ grows like $\gamma n$ for some $\gamma \in \lbrack 0, 1\rbrack$. Here it is shown that $\gamma n \geq EL_n \geq \gamma n - C(n \log n)^{1/2}$ for an explicit numerical constant $C$ which does not depend on the distribution of the letters. In simulations with $n = 100,000, EL_n/n$ can be determined from $k$ such trials with 95% confidence to within $0.0055/\sqrt k$, and the results here show that $\gamma$ can then be determined with 95% confidence to within $0.0225 + 0.0055/\sqrt k$, for an arbitrary letter distribution.

Details

ISSN :
10505164
Volume :
4
Database :
OpenAIRE
Journal :
The Annals of Applied Probability
Accession number :
edsair.doi.dedup.....bc689db3b45667fb8e7d25ab10b4e56a
Full Text :
https://doi.org/10.1214/aoap/1177004903