Back to Search Start Over

Parallel graph partitioning framework for solving the maximum clique problem using Hadoop

Authors :
Suqi Zhang
Jingjin Guo
Xiaowei Liu
Xing Gao
Source :
2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA).
Publication Year :
2017
Publisher :
IEEE, 2017.

Abstract

Graph pattern mining is an important part of the emerging social network science, and the research of the maximum clique problem is one of the most important research branches. In the big data environment, the mass of nodes and complexity of edges in the graph set a higher requirement on the speed and accuracy of the maximum clique (MCP) study. In this paper, we present a parallel graph partitioning framework for solving the maximum clique problem(PPMC) based on Hadoop. Firstly, This paper presents a new graph partition method based on degree sorting(GPD), the subgraph scale is greatly reduced. In order to balance the workload of solving the maximum clique problems. Secondly, the improved ant algorithm proposed by our team is used to solve the maximum clique of each subgraph. Finally, the framework is deployed on Hadoop distributed cloud computing platform and tested by Stanford large-scale datasets. The experimental results show that GPD algorithm can greatly reduce the subgraph scale, the subgraph number, and the complexity of time and space of solving the maximum clique problem of large-scale instances. And the efficiency and scalability of the Hadoop based parallel graph partitioning framework for solving the maximum clique problems has been further proved.

Details

Database :
OpenAIRE
Journal :
2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)
Accession number :
edsair.doi...........a7dc8120dd947b1a626fa2dfc9bf6d2c
Full Text :
https://doi.org/10.1109/icbda.2017.8078804