Back to Search
Start Over
Querying Web-Scale Knowledge Graphs Through Effective Pruning of Search Space
- Source :
- IEEE Transactions on Parallel and Distributed Systems. 28:2342-2356
- Publication Year :
- 2017
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2017.
-
Abstract
- Web-scale knowledge graphs containing billions of entities are common nowadays. Querying these graphs can be modeled as a subgraph matching problem. Since knowledge graphs are incomplete and noisy in nature, it is important to discover answers matching exactly as well as answers similar to queries. Existing graph matching algorithms usually use graph indices to accelerate query processing. For billion-node graphs, it may be infeasible to build the graph indices due to the amount of work and the memory/storage required. In this paper, we propose an efficient algorithm for finding the best $k$ answers for a given query without precomputing graph indices. An answer’s quality is measured by a matching score that is computed online. To accelerate query processing, we propose a novel technique for bounding the matching scores during the computation. By using bounds, the low quality answers can be efficiently pruned. The bounding technique can be implemented in a distributed environment, allowing our approach to efficiently query web-scale knowledge graphs. We evaluate the effectiveness and the efficiency of our approach on real-world datasets. The result shows that our bounding technique can reduce the running time up to two orders of magnitude comparing to an approach without using bounds.
- Subjects :
- Theoretical computer science
Matching (graph theory)
Computer science
Computation
Knowledge engineering
02 engineering and technology
computer.file_format
computer.software_genre
Graph
Computational Theory and Mathematics
Knowledge graph
Hardware and Architecture
020204 information systems
Signal Processing
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Data mining
RDF
computer
Computer Science::Databases
Subjects
Details
- ISSN :
- 10459219
- Volume :
- 28
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Parallel and Distributed Systems
- Accession number :
- edsair.doi...........0601a43c57f3a78c584eeb87f5d1686b
- Full Text :
- https://doi.org/10.1109/tpds.2017.2665478