Computer Science ›› 2013, Vol. 40 ›› Issue (10): 190-193.

Previous Articles     Next Articles

MR-GSpar:A Distributed Large Graph Sparsification Algorithm Based on MapReduce

CHEN De-hua,ZHOU Meng,SUN Yan-qing and ZHENG Liang-liang   

  • Online:2018-11-16 Published:2018-11-16

Abstract: As an important data pre-processing operation,graph sparsification has attracted wide attentions from the area of database.Nowadays the graph data is becoming popular and scale.Thus this paper proposed an efficient parallel graph sparsification algorithm,namely the MR-GSpar algorithm.The MR-GSpar algorithm is presented by reforming the traditional Minhash algorithm into a parallel and distributed algorithm using MapReduce framework,which can archive efficient sparsification on large-scale graph data in a large machine cluster environment.Experiments on real datasets show that the algorithm is feasible and effective.

Key words: Graph sparsification,Minhsh,MapReduce framework,MR-GSpar algorithm

[1] Satuluri V,Parthasarathy S.Scalable graph clustering using stochastic flows:applications to community discovery[C]∥ACM SIGKDD.2009:737-746
[2] Kulis B,Basu S,Dhillon I,et al.Semi-supervised Graph Clustering:A Kernel Approach[J].Machine Learning,2009,74(1):1-22
[3] Satuluri V,Parthasarathy S,Ruan Y.Local graph Sparsification for Scalable Clustering[C]∥SIGMOD.2011:737-746
[4] 李建江,崔健,王聃,等.MapReduce并行编程模型研究综述[J].电子学报,2011,39(11):2635-2642
[5] Lin J,Schatz M.Design patterns for efficient graph algorithms in mapreduce[C]∥MLG.2010:78-85
[6] 尹丹,高宏,邹兆年,等.一种新的高效图聚集算法[J].计算机研究与发展,2011,48(10):1831-1841
[7] Lv Qin,Josephson W,Wang Zhe,et al.Multi-probe LSH:efficient indexing for high-dimensional similarity search[C]∥Proc of the 33rd Int Conf on Very Large Data Bases(VLDB’07). Vien-na Austria:VLDB Endowment,2007:950-961

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!