Back to Search
Start Over
Querying Web-Scale Information Networks Through Bounding Matching Scores
- Source :
- WWW
- Publication Year :
- 2015
- Publisher :
- International World Wide Web Conferences Steering Committee, 2015.
-
Abstract
- Web-scale information networks containing billions of entities are common nowadays. Querying these networks can be modeled as a subgraph matching problem. Since information networks are incomplete and noisy in nature, it is important to discover answers that match exactly as well as answers that are similar to queries. Existing graph matching algorithms usually use graph indices to improve the efficiency of query processing. For web-scale information networks, it may not be feasible to build the graph indices due to the amount of work and the memory/storage required. In this paper, we propose an efficient algorithm for finding the best k answers for a given query without precomputing graph indices. The quality of an answer is measured by a matching score that is computed online. To speed up query processing, we propose a novel technique for bounding the matching scores during the computation. By using bounds, we can efficiently prune the answers that have low qualities without having to evaluate all possible answers. The bounding technique can be implemented in a distributed environment, allowing our approach to efficiently answer the queries on web-scale information networks. We demonstrate the effectiveness and the efficiency of our approach through a series of experiments on real-world information networks. The result shows that our bounding technique can reduce the running time up to two orders of magnitude comparing to an approach that does not use bounds.
Details
- Database :
- OpenAIRE
- Journal :
- Proceedings of the 24th International Conference on World Wide Web
- Accession number :
- edsair.doi...........6610869b8aedc2512cf7e5b8bec31a8c
- Full Text :
- https://doi.org/10.1145/2736277.2741131