计算机科学 ›› 2013, Vol. 40 ›› Issue (8): 165-171.

• 软件与数据库技术 • 上一篇    下一篇

基于概念分层的图汇总算法

孙翀,卢炎生   

  1. 华中科技大学计算机学院 武汉430074;华中科技大学计算机学院 武汉430074
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受国家“十一五”部委预研基金(513150402)资助

Clustering-based Algorithms to Semantic Summarizing Graph with Multi-attributes’ Hierarchical Structures

SUN Chong and LU Yan-sheng   

  • Online:2018-11-16 Published:2018-11-16

摘要: 将原始图中节点分配到多个分组并根据原始边来确立分组间关系,这样得到的图称作汇总图。汇总图的规模可以由用户设定,用户可以通过浏览小规模的汇总图来获得原始图的相关信息。K-SGS方法是一种新的基于节点概念分层的图汇总算法,它解决了传统K-SNAP算法的汇总图规模参数受限问题。为了解决该问题,算法引入了节点的属性值概念分层,从而增强了图汇总过程中节点分组的灵活性:不仅可以合并同值的节点,还可合并具有相似值的节点。除了关注汇总过程中边的信息损失外,K-SGS方法还关注节点的信息损失,它将图汇总问题建模成多目标规划问题,并通过分层序列法和基于分级的统一评价函数两种不同策略来解决该问题。算法上,提出了快速的层次聚类方法,使得每轮可以合并多个聚类,从而提高效率。经数据集上的实验表明,新算法能生产各种规模参数的汇总图,并具有较好的汇总质量。

关键词: 图汇总,概念分层,多目标规划,层次凝聚

Abstract: This paper allocated the raw nodes of graph into some different groups and established relationships among the groups by considering the raw edges of graph.We called this procedure graph summarization and the newly built graph was called graph summary.Users can set the scale value of graph summary and obtain the information of raw graph with the help of it.However,by using the classic method,we can only build the graph summaries whose sizes are above a certain scale lower bound,which cannot be acceptable in many applications.K-SGS is a novel graph summarization method which solves the scale limits.By using the concept hierarchy of the nodes’ attributes,K-SGS can group the nodes in a flexible way.It groups the nodes not only with same values but also with similar values.Besides the edges’ information loss,it also considers the nodes’ information loss during the summarization and models the summarization as multi-objective planning.We proposed two hierarchical agglomerative algorithms.One is based on forbearing stratified sequencing method and the other is based on unified objective function method.The experiment on real life dataset shows that our methods can solve the problem and get the graph summaries with good quality.

Key words: Graph summarization,Concept hierarchy,Multi-objective planning,Hierarchical agglomerative

[1] Chakrabarti D,Faloutsos C.Graph mining:Laws,generators,and algorithms[J].ACM.Comput.Surv.,2006,38(1)
[2] Chakrabarti D,Faloutsos C,Zhan Y.Visualization of large networks with min-cut plots,A-plots and R-MAT[J].Int.J.Hum.-Comput.Stud.,2007,65(5):434-445
[3] Newman M E J.The structure and function of complex net-works[J].SIAM Review,2003,45:167-256
[4] Huan J,Wang W,Prins J,et al.SPIN:Mining maximal frequent subgraphs from graph databases[C]∥Proceedings of KDD’04. 2004:581-586
[5] Wang W,Wang C,Zhu Y,et al.GraphMiner:A structural pattern-mining system for large disk-based graph databases and its applications[C]∥Proceedings of SIGMOD’05.2005:879-881
[6] Sun J,Xie Y,Zhang H,et al.Less is more:Sparse graph mining with compact matrix decomposition[J].Stat.Anal.Data Min.,2008,1(1):6-22
[7] Xu X,Yuruk N,Feng Z,et al.SCAN:A structural clustering algorithm for networks[C]∥Proceedings of KDD’07.2007:824-833
[8] Battista G,Eades P,Tamassia R,et al.Graph Drawing:Algo-rithms for the Visualization of Graphs[M].PrenticeHall,1999
[9] Herman I,Melancon G,Marshall M S.Graph visualization and navigation in information visualization:A survey[J].IEEETrans.Vis.Comput.Graph.,2000,6(1):24-43
[10] Navlakha S,Rastogi R,Shrivastava N.Graph summarizationwith bounded error[C]∥Proceedings of SIGMOD’08.2008: 567-580
[11] Tian Y,Hankins R,Patel J M.Efficient aggregation for graph summarization[C]∥Proceedings of SIGMOD’08.2008:419-432
[12] Zhang N,Tian Y,Patel J M.Discovery-Driven Graph Summarization[M].Data Engineering (ICDE),2010
[13] 林锉云.多目标优化的方法与理论[M].长春:吉林教育出版社,1992

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!