Back to Search Start Over

Adaptive Computation of the Swap-Insert Correction Distance

Authors :
Barbay, J��r��my
P��rez-Lantero, Pablo
Publication Year :
2015
Publisher :
arXiv, 2015.

Abstract

The Swap-Insert Correction distance from a string $S$ of length $n$ to another string $L$ of length $m\geq n$ on the alphabet $[1..d]$ is the minimum number of insertions, and swaps of pairs of adjacent symbols, converting $S$ into $L$. Contrarily to other correction distances, computing it is NP-Hard in the size $d$ of the alphabet. We describe an algorithm computing this distance in time within $O(d^2 nm g^{d-1})$, where there are $n_��$ occurrences of $��$ in $S$, $m_��$ occurrences of $��$ in $L$, and where $g=\max_{��\in[1..d]} \min\{n_��,m_��-n_��\}$ measures the difficulty of the instance. The difficulty $g$ is bounded by above by various terms, such as the length of the shortest string $S$, and by the maximum number of occurrences of a single character in $S$. Those results illustrate how, in many cases, the correction distance between two strings can be easier to compute than in the worst case scenario.<br />16 pages, no figures, long version of the extended abstract accepted to SPIRE 2015

Details

Database :
OpenAIRE
Accession number :
edsair.doi...........31b4a4c0ce5646cbf1391e05745e6314
Full Text :
https://doi.org/10.48550/arxiv.1504.07298