Back to Search Start Over

Efficient probabilistic supergraph search

Authors :
Xuemin Lin
Ying Zhang
Gaoping Zhu
Wenjie Zhang
Ke Zhu
Source :
ICDE
Publication Year :
2016
Publisher :
IEEE, 2016.

Abstract

Given a query graph $q$ , retrieving the data graphs $g$ from a set $D$ of data graphs such that $q$ contains $g$ , namely supergraph containment search, is fundamental in graph data analysis with a wide range of real applications. It is very challenging due to the NP-Completeness of subgraph isomorphism testing. Driven by many real applications, in this paper, we study the problem of probabilistic supergraph search; that is, given a set $D$ of uncertain data graphs, a certain query graph $q$ and a probability threshold $\theta$ , we retrieve the data graphs $g^{u}$ from $D$ such that the probability of $q$ containing $g^{u}$ is not smaller than $\theta$ . We show that besides the NP-Completeness of subgraph isomorphism testing, the problem of calculating probabilities is #P-Complete; thus, it is even more challenging than the supergraph containment search. To tackle the computational hardness, we first propose two novel pruning rules, based on probabilistic connectivity and features, respectively, to efficiently prune non-promising data graphs. Then, efficient verification algorithms are developed with the aim of sharing computation and terminating non-promising computation as early as possible. Extensive performance studies on both real and synthetic data demonstrate the efficiency and effectiveness of our techniques in practice.

Details

Database :
OpenAIRE
Journal :
2016 IEEE 32nd International Conference on Data Engineering (ICDE)
Accession number :
edsair.doi.dedup.....ded735f8640cd5c63a4f402cd5f977f9
Full Text :
https://doi.org/10.1109/icde.2016.7498415