Back to Search Start Over

Sequence similarity measures based on bounded hamming distance.

Authors :
Apostolico, Alberto
Guerra, Concettina
Landau, Gad M.
Pizzi, Cinzia
Source :
Theoretical Computer Science. Jul2016, Vol. 638, p76-90. 15p.
Publication Year :
2016

Abstract

A growing number of measures of sequence similarity are being based on some underlying notion of relative compressibility. Within this paradigm, similar sequences are expected to share a large number of common substrings, or subsequences, or more complex patterns or motifs , and so on. In this paper, measures of sequence similarity are introduced and studied in which patterns in a pair are considered similar if they coincide up to a preset number of mismatches, that is, within a bounded Hamming distance. It is shown here that for some such measures bounds are achievable that are slightly better than O ( n 2 ) . Preliminary experiments demonstrate the potential applicability to phylogeny and classification of these similarity measures. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
03043975
Volume :
638
Database :
Academic Search Index
Journal :
Theoretical Computer Science
Publication Type :
Academic Journal
Accession number :
115941908
Full Text :
https://doi.org/10.1016/j.tcs.2016.01.023