Computer Science ›› 2020, Vol. 47 ›› Issue (10): 75-82.doi: 10.11896/jsjkx.190800002

• Database & Big Data & Data Science • Previous Articles     Next Articles

Overlapping Community Detection Method Based on Rough Sets and Distance Dynamic Model

ZHANG Qin, CHEN Hong-mei, FENG Yun-fei   

  1. School of Information Science and Technology,Southwest Jiaotong University,Chengdu 611756,China
    Key Laboratory of Cloud Computing and Intelligent Technology,Southwest Jiaotong University,Chengdu 611756,China
  • Received:2019-07-31 Revised:2019-11-11 Online:2020-10-15 Published:2020-10-16
  • About author:ZHANG Qin,born in 1995,postgra-duate,is a member of China Computer Federation.Her main research interests include database technology and data mining.
    CHEN Hong-mei,born in 1971,Ph.D,professor,Ph.D supervisor,is a member of China Computer Federation.Her main research interests include granular calculation,rough sets and intelligent information processing.
  • Supported by:
    National Natural Science Foundation of China (61572406,61976182) and Key Program for International Science and Technology Innovation Cooperation of Sichuan Province,China (2019YFH0097)

Abstract: The real world is considered to be composed of many different complex systems.In order to model and analyze the hidden rules and functions among individuals in complex systems,the complex system may be abstracted as a complex network composed of nodes and edges.Mining community structures in complex networks has important theoretical significance and practical value in content recommendation,behavior prediction and disease spread.With the continuous changes of individuals in complex systems,overlapping nodes appear among multiple communities.How to effectively and accurately mine the overlapping nodes in communities has brought some challenges.In order to effectively detect the overlapping nodes in the community,an overlapping community detection method based on rough sets and distance dynamic model (OCDRDD) is proposed in this paper.First of all,according to the topology of the network,it selects K core nodes by combining node degree centrality and distance,then initializes the approximation sets and the boundary region of the community according to the distance ratio relationship.Combined with the distance dynamic model,the distances between boundary region nodes and the lower approximation set nodes are changed iteratively.During each iteration,boundary region nodes that conform to the defined distance ratio relationship are classified into the lower approximation of the community,and the boundary region nodes are reduced until the optimal overlapping community structure is found.Finally,the “pseudo” overlapping nodes are processed according to the two rules defined in this paper.NMI and overlapping module degree EQ are taken as evaluation indexes on real network datasets and LFR Benchmark artificial network datasets.The OCDRDD method is compared with other typical community detection methods in recent years both on real network datasets and LFR Benchmark artificial network datasets.The experimental results show that OCDRDD method is better than other community detection algorithms on the whole.The results show that the proposed algorithm is effective and feasible.

Key words: Boundary region, Distance dynamic model, Overlapping community detection, Rough set

CLC Number: 

  • TP391
[1]COSCIA M,GIANNOTTI F,PEDRESCHI D.A classificationfor community discovery methods in complex networks[J].Statistical Analysis & Data Mining the Asa Data Science Journal,2011,4(5):512-546.
[2]KELLEY S,GOLDBERG M,MAGDON-ISMAIL M,et al.Defining and discovering communities in social networks[M] // Thai M T,Pardalos P M.Handbook of Optimization in Complex Network.New York:Springer,2012:139-168.
[3]CUI W,XIAO Y,WANG H,et al.Local search of communities in large graphs[C]//ACM SIGMOD International Conference on Management.2014:991-1002.
[4]XIE J,KELLEY S,SZYMANSKI B K.Overlapping community detection in networks:the State of the Art and Comparative Study[J].ACM Computing Surveys,2011,45(4):1-35.
[5]BAI X,YANG P,SHI X,et al.An overlapping community detection algorithm based on density peaks[J].Neurocomputing,2016,226:7-15.
[6]ZHOU X,LIU Y,WANG J,et al.A density based link clustering algorithm for overlapping community detection in networks[J].Physica A Statistical Mechanics & Its Applications,2017,486:65-78.
[7]SUN H,LIU J,HUANG J,et al.A link-based label propagation algorithm for overlapping community detection in networks[J].Computational Intelligence,2016,33(2):308-331.
[8]WU H,GAO L,DONG J,et al.Detecting overlapping protein complexes by rough-fuzzy clustering in protein-protein interaction networks[J].Plos One,2013,9(3):e91856.
[9]SUN Z,WANG B,SHENG J,et al.Overlapping community detection based on information dynamics[J].IEEE Access,2018,6:70919-70934.
[10]SHAO J,HAN Z,YANG Q,et al.Community detection based on distance dynamics[C]//Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2015:1075-1084.
[11]CHEN L,ZHANG J,CAI L J.Overlapping community detection based on link graph using distance dynamics[J].International Journal of Modern Physics B,2017,32(3):1850015.
[12]PAWLAK Z.Rough sets[J].InternationalJournal of Computer &Information Sciences,1982,11(5):341-356.
[13]ZHANG Y,WU B,LIU Y,et al.A novel community detection method based on rough set K-means[J].Journal of Electronics &Information Technology,2017,39(4):770-777.
[14]DWYER T.Visual analysis of network centralities[C]//Proceedings of Asia-Pacific Symposium on Information Visualization (APVIS 2006).2006:189-197.
[15]HENNIG C,HAUSDORF B.Design of dissimilarity measures:a new dissimilarity between species distribution areas[C]//Data Science and Classification.Berlin:Springer,2006:29-37.
[16]HUANG L,WANG G,WANG Y,et al.A link density clustering algorithm based on automatically selecting density peaks for overlapping community detection[J].International Journal of Modern Physics B,2016,30(24):15.
[17]DING J,HE X,YUAN J,et al.Community detection by propagating the label of center[J].Physica A:Statistical Mechanics and its Applications,2018,503:675-686.
[18]NEWMAN M.Network data[EB/OL].[2013-04-19].http://www personal.umich.edu/~mejn/netdata/.
[19]LANCICHINETTI A,FORTUNATO S.Community detectionalgorithms:A comparative analysis[J].Physical Review E,2009,80(5):056117.
[20]ZACHARY W.Information-flow model for conflict and fission in small groups[J].Journal of Anthropological Research,1977,33:452-473.
[21]LUSSEAU D,SCHNEIDER K,BOISSEAU O,et al.The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations[J].Behavioral Ecology & Sociobiology,2003,54(4):396-405.
[22]KREBS V.Books about US Politics[EB/OL].http://www.orgnet.com/.
[23]GIRVAN M,NEWMAN M E J.Community structure in social and biological networks[EB/OL].http://arxiv.org/abs/cond-mat/0112110/.
[24]ADAMIC L A,GLANCE N.The political blogosphere and the 2004 US Election[C]//Proceedings of the WWW-2005 Workshop on the Weblogging Ecosystem.2005.
[25]KNUTH D E.The stanford graphBase:A platform for combinatorial computing[M].Addison-Wesley,1993.
[26]ZHU M,MENG F R,ZHOU Y.Density-based link clusteringalgorithm for overlapping community detection[J].Journal of Computer Research and Development.,2013,50(12):2520-2530.
[27]LANCICHINETTI A,FORTUNATO S,KERTÉSZ J.Detec-ting the overlapping and hierarchical community structure in complex networks[J].New Journal of Physics,2009,11(3):033015.
[28]NICOSIA V,MANGIONI G,CARCHIOLO V,et al.Extending modularity definition for directed graphs with overlapping communities[OL].https://wenku.baidu.com/view/f1432a2d0066f5335a81219b.html.
[1] CHENG Fu-hao, XU Tai-hua, CHEN Jian-jun, SONG Jing-jing, YANG Xi-bei. Strongly Connected Components Mining Algorithm Based on k-step Search of Vertex Granule and Rough Set Theory [J]. Computer Science, 2022, 49(8): 97-107.
[2] XU Si-yu, QIN Ke-yun. Topological Properties of Fuzzy Rough Sets Based on Residuated Lattices [J]. Computer Science, 2022, 49(6A): 140-143.
[3] FANG Lian-hua, LIN Yu-mei, WU Wei-zhi. Optimal Scale Selection in Random Multi-scale Ordered Decision Systems [J]. Computer Science, 2022, 49(6): 172-179.
[4] CHEN Yu-si, AI Zhi-hua, ZHANG Qing-hua. Efficient Neighborhood Covering Model Based on Triangle Inequality Checkand Local Strategy [J]. Computer Science, 2022, 49(5): 152-158.
[5] SUN Lin, HUANG Miao-miao, XU Jiu-cheng. Weak Label Feature Selection Method Based on Neighborhood Rough Sets and Relief [J]. Computer Science, 2022, 49(4): 152-160.
[6] WANG Zi-yin, LI Lei-jun, MI Ju-sheng, LI Mei-zheng, XIE Bin. Attribute Reduction of Variable Precision Fuzzy Rough Set Based on Misclassification Cost [J]. Computer Science, 2022, 49(4): 161-167.
[7] WANG Zhi-cheng, GAO Can, XING Jin-ming. Three-way Approximate Reduction Based on Positive Region [J]. Computer Science, 2022, 49(4): 168-173.
[8] XUE Zhan-ao, HOU Hao-dong, SUN Bing-xin, YAO Shou-qian. Label-based Approach for Dynamic Updating Approximations in Incomplete Fuzzy Probabilistic Rough Sets over Two Universes [J]. Computer Science, 2022, 49(3): 255-262.
[9] CHEN Xiang-tao, ZHAO Mei-jie, YANG Mei. Overlapping Community Detection Algorithm Based on Subgraph Structure [J]. Computer Science, 2021, 48(9): 244-250.
[10] LI Yan, FAN Bin, GUO Jie, LIN Zi-yuan, ZHAO Zhao. Attribute Reduction Method Based on k-prototypes Clustering and Rough Sets [J]. Computer Science, 2021, 48(6A): 342-348.
[11] NING Yi-xin, XIE Hui, JIANG Huo-wen. Survey of Graph Neural Network in Community Detection [J]. Computer Science, 2021, 48(11A): 11-16.
[12] XUE Zhan-ao, SUN Bing-xin, HOU Hao-dong, JING Meng-meng. Optimal Granulation Selection Method Based on Multi-granulation Rough Intuitionistic Hesitant Fuzzy Sets [J]. Computer Science, 2021, 48(10): 98-106.
[13] XUE Zhan-ao, ZHANG Min, ZHAO Li-ping, LI Yong-xiang. Variable Three-way Decision Model of Multi-granulation Decision Rough Sets Under Set-pair Dominance Relation [J]. Computer Science, 2021, 48(1): 157-166.
[14] XUE Lei, TANG Xu-qing. Algorithm for Detecting Overlapping Communities Based on Centered Cliques [J]. Computer Science, 2020, 47(8): 157-163.
[15] SANG Bin-bin, YANG Liu-zhong, CHEN Hong-mei, WANG Sheng-wu. Incremental Attribute Reduction Algorithm in Dominance-based Rough Set [J]. Computer Science, 2020, 47(8): 137-143.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!