Back to Search
Start Over
Efficient probabilistic supergraph search
- Source :
- ICDE
- Publication Year :
- 2016
- Publisher :
- IEEE, 2016.
-
Abstract
- Given a query graph $q$ , retrieving the data graphs $g$ from a set $D$ of data graphs such that $q$ contains $g$ , namely supergraph containment search, is fundamental in graph data analysis with a wide range of real applications. It is very challenging due to the NP-Completeness of subgraph isomorphism testing. Driven by many real applications, in this paper, we study the problem of probabilistic supergraph search; that is, given a set $D$ of uncertain data graphs, a certain query graph $q$ and a probability threshold $\theta$ , we retrieve the data graphs $g^{u}$ from $D$ such that the probability of $q$ containing $g^{u}$ is not smaller than $\theta$ . We show that besides the NP-Completeness of subgraph isomorphism testing, the problem of calculating probabilities is #P-Complete; thus, it is even more challenging than the supergraph containment search. To tackle the computational hardness, we first propose two novel pruning rules, based on probabilistic connectivity and features, respectively, to efficiently prune non-promising data graphs. Then, efficient verification algorithms are developed with the aim of sharing computation and terminating non-promising computation as early as possible. Extensive performance studies on both real and synthetic data demonstrate the efficiency and effectiveness of our techniques in practice.
- Subjects :
- 0301 basic medicine
Theoretical computer science
Computer science
Subgraph isomorphism problem
02 engineering and technology
Tree-depth
law.invention
03 medical and health sciences
law
020204 information systems
Line graph
0202 electrical engineering, electronic engineering, information engineering
Graph isomorphism
Universal graph
Forbidden graph characterization
Discrete mathematics
Epigraph
Containment (computer programming)
Uncertain data
Probabilistic logic
Graph
Computer Science Applications
030104 developmental biology
Computational Theory and Mathematics
Enhanced Data Rates for GSM Evolution
MathematicsofComputing_DISCRETEMATHEMATICS
Information Systems
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2016 IEEE 32nd International Conference on Data Engineering (ICDE)
- Accession number :
- edsair.doi.dedup.....ded735f8640cd5c63a4f402cd5f977f9
- Full Text :
- https://doi.org/10.1109/icde.2016.7498415