Back to Search
Start Over
Sample-Based Distance-Approximation for Subsequence-Freeness.
- Source :
-
Algorithmica . Aug2024, Vol. 86 Issue 8, p2519-2556. 38p. - Publication Year :
- 2024
-
Abstract
- In this work, we study the problem of approximating the distance to subsequence-freeness in the sample-based distribution-free model. For a given subsequence (word) w = w 1 ... w k , a sequence (text) T = t 1 ... t n is said to contain w if there exist indices 1 ≤ i 1 < ⋯ < i k ≤ n such that t i j = w j for every 1 ≤ j ≤ k . Otherwise, T is w-free. Ron and Rosin (ACM Trans Comput Theory 14(4):1–31, 2022) showed that the number of samples both necessary and sufficient for one-sided error testing of subsequence-freeness in the sample-based distribution-free model is Θ (k / ϵ) . Denoting by Δ (T , w , p) the distance of T to w-freeness under a distribution p : [ n ] → [ 0 , 1 ] , we are interested in obtaining an estimate Δ ^ , such that | Δ ^ - Δ (T , w , p) | ≤ δ with probability at least 2/3, for a given error parameter δ . Our main result is a sample-based distribution-free algorithm whose sample complexity is O ~ (k 2 / δ 2) . We first present an algorithm that works when the underlying distribution p is uniform, and then show how it can be modified to work for any (unknown) distribution p. We also show that a quadratic dependence on 1 / δ is necessary. [ABSTRACT FROM AUTHOR]
- Subjects :
- *ALGORITHMS
*GUMS & resins
*PROBABILITY theory
Subjects
Details
- Language :
- English
- ISSN :
- 01784617
- Volume :
- 86
- Issue :
- 8
- Database :
- Academic Search Index
- Journal :
- Algorithmica
- Publication Type :
- Academic Journal
- Accession number :
- 178655125
- Full Text :
- https://doi.org/10.1007/s00453-024-01233-4