Back to Search Start Over

Memory-Aware Framework for Efficient Second-Order Random Walk on Large Graphs

Authors :
Bin Cui
Shiyue Huang
Xupeng Miao
Yingxia Shao
Lei Chen
Source :
SIGMOD Conference
Publication Year :
2020
Publisher :
ACM, 2020.

Abstract

Second-order random walk is an important technique for graph analysis. Many applications use it to capture higher-order patterns in the graph, thus improving the model accuracy. However, the memory explosion problem of this technique hinders it from analyzing large graphs. When processing a billion-edge graph like Twitter, existing solutions (e.g., alias method) of the second-order random walk may take up 1796TB memory. Such high memory overhead comes from the memory-unaware strategies for node sampling across the graph. In this paper, to clearly study the efficiency of various node sampling methods in the context of second-order random walk, we design a cost model, and then propose a new node sampling method following the acceptance-rejection paradigm to achieve a better balance between memory and time cost. Further, to guarantee the efficiency of the second-order random walk within arbitrary memory budgets, we propose a memory-aware framework on the basis of the cost model. The framework applies a cost-based optimizer to assign desirable node sampling method for each node in the graph within a memory budget while minimizing the time cost. Finally, we provide general programming interfaces for users to benefit from the memory-aware framework easily. The empirical studies demonstrate that our memory-aware framework is robust with respect to memory and is able to achieve considerable efficiency by reducing 90% of the memory cost.

Details

Database :
OpenAIRE
Journal :
Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
Accession number :
edsair.doi...........532c932699bee8e084aa5c20cacad042
Full Text :
https://doi.org/10.1145/3318464.3380562