Start Over

Querying Web-Scale Information Networks Through Bounding Matching Scores

Authors :: Jiahui Jin
Junzhou Luo
Lixin Gao
Samamon Khemmarat
Source :: WWW
Publication Year :: 2015
Publisher :: International World Wide Web Conferences Steering Committee, 2015.
Abstract: Web-scale information networks containing billions of entities are common nowadays. Querying these networks can be modeled as a subgraph matching problem. Since information networks are incomplete and noisy in nature, it is important to discover answers that match exactly as well as answers that are similar to queries. Existing graph matching algorithms usually use graph indices to improve the efficiency of query processing. For web-scale information networks, it may not be feasible to build the graph indices due to the amount of work and the memory/storage required. In this paper, we propose an efficient algorithm for finding the best k answers for a given query without precomputing graph indices. The quality of an answer is measured by a matching score that is computed online. To speed up query processing, we propose a novel technique for bounding the matching scores during the computation. By using bounds, we can efficiently prune the answers that have low qualities without having to evaluate all possible answers. The bounding technique can be implemented in a distributed environment, allowing our approach to efficiently answer the queries on web-scale information networks. We demonstrate the effectiveness and the efficiency of our approach through a series of experiments on real-world information networks. The result shows that our bounding technique can reduce the running time up to two orders of magnitude comparing to an approach that does not use bounds.

Subjects :: Speedup
Theoretical computer science
Matching (graph theory)
Bounding overwatch
Computer science
Computation
Graph (abstract data type)
Information networks
Data mining
computer.software_genre
computer
Graph

Details

Database :: OpenAIRE
Journal :: Proceedings of the 24th International Conference on World Wide Web
Accession number :: edsair.doi...........6610869b8aedc2512cf7e5b8bec31a8c
Full Text :: https://doi.org/10.1145/2736277.2741131