1. Improved Lower Bounds for Graph Edit Distance
- Author
-
David Blumenthal and Johann Gamper
- Subjects
Graph database ,Matching (graph theory) ,Computer science ,Approximation algorithm ,02 engineering and technology ,Strength of a graph ,computer.software_genre ,Upper and lower bounds ,Graph ,Computer Science Applications ,Combinatorics ,Graph bandwidth ,Computational Theory and Mathematics ,Graph power ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Graph edit distance ,020201 artificial intelligence & image processing ,Edit distance ,Graph operations ,computer ,Information Systems - Abstract
The problem of deriving lower and upper bounds for the edit distance between undirected, labeled graphs has recently received increasing attention. However, only one algorithm has been proposed that allegedly computes not only an upper but also a lower bound for non-uniform edit costs and incorporates information about both node and edge labels. In this paper, we demonstrate that this algorithm is incorrect. We present a corrected version $\mathsf {B\scriptstyle{RANCH}}$ that runs in $\mathcal{O}(n^2\Delta ^3+n^3)$ time, where $\Delta$ is the maximum of the maximum degrees of input graphs $G$ and $H$ . We also develop a speed-up $\mathsf {B\scriptstyle{RANCH}}\mathsf{F\scriptstyle{AST}}$ that runs in $\mathcal{O}(n^2\Delta ^2+n^3)$ time and computes an only slightly less accurate lower bound. The lower bounds produced by $\mathsf {B\scriptstyle{RANCH}}$ and $\mathsf {B\scriptstyle{RANCH}}\mathsf{F\scriptstyle{AST}}$ are shown to be pseudo-metrics on a collection of graphs. Finally, we suggest an anytime algorithm $\mathsf {B\scriptstyle{RANCH}}\mathsf{T\scriptstyle{IGHT}}$ that iteratively improves $\mathsf {B\scriptstyle{RANCH}}$ ’s lower bound. $\mathsf {B\scriptstyle{RANCH}}\mathsf{T\scriptstyle{IGHT}}$ runs in $\mathcal{O}(n^3\Delta ^2+I(n^2\Delta ^3+n^3))$ time, where the number of iterations $I$ is controlled by the user. A detailed experimental evaluation shows that all suggested algorithms are Pareto optimal, that they are very effective when used as filters for edit distance range queries, and that they perform excellently when used within classification frameworks.
- Published
- 2018