Computer Science ›› 2025, Vol. 52 ›› Issue (1): 160-169.doi: 10.11896/jsjkx.231100117

• Database & Big Data & Data Science • Previous Articles     Next Articles

Path-masked Autoencoder Guiding Unsupervised Attribute Graph Node Clustering

DING Xinyu1, KONG Bing1, CHEN Hongmei1, BAO Chongming2, ZHOU Lihua1   

  1. 1 School of Information Science and Engineering,Yunnan University,Kunming 650504,China
    2 School of Software,Yunnan University,Kunming 650504,China
  • Received:2023-11-19 Revised:2024-04-27 Online:2025-01-15 Published:2025-01-09
  • About author:DING Xinyu,born in 1997,postgra-duate,is a member of CCF(No.R3213G).His main research interests include deep graph clustering and data mining.
    KONG Bing,born in 1968,Ph.D,asso-ciate professor.His main research in-terests include social network analysis and machine learning.
  • Supported by:
    National Natural Science Foundation of China(62062066,61762090,61966036,62276227),Yunnan Fundamental Research Projects(202201AS070015),Young and Middle-aged Academic and Technical Leaders Reserve Talent Project in Yunnan Province(202205AC160033),Yunnan Key Laboratory of Intelligent Systems and Computing(202205AG070003) and Practical Innovation Project of Postgraduate Students in the Professional Degree of Yunnan University(ZC-23234311).

Abstract: The purpose of graph clustering is to discover the community structure of the network.Aiming at the problem that the current clustering methods can not well obtain the deep potential community information of the network,and can not make sui-table information integration of the features,resulting in unclear semantics of the node community,a path-masked autoencoder guiding unsupervised attribute graph node clustering(PAUGC)model is proposed.This model utilizes an autoencoder to deeply dig the network topology structure by randomly masking network paths,thereby obtaining excellent global structural semantic information.Utilizing a normative method for information integration of the features,so that the node features are able to better characterize the class information of the features.In addition,the model combines modularity maximization to capture the under-lying community clusters information in the whole graph,aiming to more reasonably fuse it into the low-dimensional node features.Finally,the model iteratively optimizes and updates the clustering representation through self-training clustering to obtain the final node features.By conducting extensive experiments and comparisons with 11 classical methods on 8 benchmark datasets,PAUGC has been proven to be effective compared to current mainstream methods.

Key words: Deep graph clustering, Unsupervised learning, Feature integration, Module maximization, Self-training for clustering

CLC Number: 

  • TP391
[1]LIU C,WEN L,KANG Z,et al.Self Supervised Consensus Rep-resentation Learning for Attributed Graph[C]//Proceedings of the 29th ACM International Conference on Multimedia.New York:ACM Press,2021:2654-2662.
[2]WANG M,WANG C,YU J X,et al.Community Detection inSocial Networks:An In Depth Benchmarking Study with A Procedure Oriented Framework[J].Proceedings of the VLDB Endowment,2015,8(10):998-1009.
[3]GARCIA J O,ASHOURVAN A,MULDOON S,et al.Applications of Community Detection Techniques to Brain Graphs:Algorithmic Considerations and Implications for Neural Function[J].Proceedings of the IEEE,2018,106(5):846-867.
[4]KRISHNAMURTHY B,WANG J.On Network-Aware Clustering of Web Clients[C]//Proceedings of the Conference on Applications,Technologies,Architectures,and Protocols for Computer Communication.New York:ACM Press,2000:97-110.
[5]CHIANG M M T,MIRKIN B.Intelligent Choice of the Number of Clusters in K-Means Clustering:An Experimental Study with Different Cluster Spreads[J].Journal of Classification,2010,27:3-40.
[6]MCINNES L,HEALY J,ASTELS S.HDBSCAN:HierarchicalDensity Based Clustering[J].Open Source Softw,2017,2(11):205.
[7]KIPF T N,WELLING M.Semi-Supervised ClassificAtion with Graph Convolutional Networks[J].arXiv:1609.02907,2016.
[8]TANG J M,HAN H,HUANG L.Coarse grained and Fine-grained Features Extraction Based on Unsupervised Learning in Pedestrian Reidentification[J].Computer Engineering,2022,48(4):269-275,283.
[9]WANG C,PAN S,LONG G,et al.Mgae:Marginalized GraphAutoencoder for Graph Clustering[C]//Proceedings of the 2017 ACM on Conference on Information and Knowledge Management.Singapore:ACM Press,2017:889-898.
[10]PARK J,LEE M,CHANG H J,et al.Symmetric Graph Convolutional Autoencoder for Unsupervised Graph Representation Learning[C]//Proceedings of the IEEE International Confe-rence on Computer Vision.Seoul:IEEE Press,2019:6519-6528.
[11]PAN S,HU R,LONG G,et al.Adversarially Regularized Graph Autoencoder for Graph Embedding[C]//Proceedings of the 27th International Joint Conference on Artificial Intelligence.Stockholm:IJCAI Press,2018:2609-2615.
[12]WANG C,PAN S,HU R,et al.Attributed Graph Clustering:A Deep Attentional Embedding Approach[C]//Proceedings of the 28th International Joint Conference on Artificial Intelligence.Macao:IJCA Press,2019:3670-3676.
[13]SALEHI A,DAVULCU H.Graph Attention Auto-Encoders[C]//Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence.ELECTR NETWORK:IEEE Press,2020:989-996.
[14]HUI B,ZHU P,HU Q.Collaborative Graph Convolutional Networks:Unsupervised Learning Meets Semi-Supervised Learning[C]//Proceedings of the AAAI Conference on Artificial Intelligence.New York:AAAI Press,2020:4215-4222.
[15]ZHOU X,SU L,LI X,et al.Community Detection Based on Unsupervised Attributed Network Embedding[J].Expert Systems with Applications,2023,213:118937.
[16]NEWMAN M E J.Modularity and Community Structure in Networks[J].Proceedings of the National Academy of Sciences,2006,103(23):8577-8582.
[17]YANG X,LIU Y,ZHOU S,et al.Cluster Guided Contrastive Graph Clustering Network[C]//Proceedings of the 37th AAAI Conference on Artificial Intelligence.Washington:AAAI Press,2023:10834-10842.
[18]ZHANG S,PANG J,LIAO M X.Graph Clustering Algorithm Based on Hybrid Feature Selection[J].Journal of Chinese Computer Systems,2024,45(3):606-612.
[19]SCHAUB M T,DELVENNE J C,ROSVALL M,et al.TheMany Facets of Community Detection in Complex Networks[J].Applied Network Science,2017,2(1):1-13.
[20]FORTUNATO S.Community Detection in Graphs[J].Physics Reports,2010,486(3/4/5):75-174.
[21]FORTUNATO S,HRIC D.Community Detection in Networks:A User Guide[J].Physics Reports,2016,100(659):1-44.
[22]AGARWAL G,KEMPE D.Modularity Maximizing Graph Communities via Mathematical Programming[J].The European Physical Journal B,2008,66(3):409-418.
[23]KIPF T N,WELLING M.Variational Graph Auto-Encoders[J].arXiv:1611.07308,2016.
[24]QIU C,HUANG Z,XU W,et al.VGAER:Graph Neural Network Reconstruction Based Community Detection[J].arXiv:2201.04066,2022.
[25]LI J,YU J,LI J,et al.Dirichlet Graph Variational Autoencoder[J].Advances in Neural Information Processing Systems,2020,33:5274-5283.
[26]TIAN Y,DONG K,ZHANG C,et al.Heterogeneous GraphMasked Autoencoders[C]//Proceedings of the 37th AAAI Conference on Artificial Intelligence.Washington:AAAI Press,2023:9997-10005.
[27]XU H,XIA W,GAO Q,et al.Graph Embedding Clustering:Graph Attention Auto Encoder with Cluster-Specificity Distribution[J].Neural Networks,2021,142:221-230.
[28]CUI G,ZHOU J,YANG C,et al.Adaptive Graph Encoder for Attributed Graph Embedding[C]//Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Disco-very & Data Mining.Electr Network:KDD Press,2020:976985.
[29]PENG Z,LIU H,JIA Y,et al.Deep Attention Guided Graph Clustering with Dual Self Supervision[J].IEEE Transactions on Circuits and Systems for Video Technology,2023,33(7):3296-3307.
[30]BO D,WANG X,SHI C,et al.Structural Deep Clustering Network[C]//Proceedings of the 29th World Wide Web Confe-rence.2020:1400-1410.
[31]TU W,ZHOU S,LIU X,et al.Deep Fusion Clustering Network[C]//Proceedings of the AAAI Conference on Artificial Intelligence.Electr Network:AAAI Press,2021:9978-9987.
[32]HASSANI K,KHASAHMADI A H.Contrastive Multi-ViewRepresentation Learning on Graphs[C]//Proceedings of the 37th International Conference on Machine Learning.Electr Network:ICML,2020:4074-4084.
[33]GONG L,ZHOU S,TU W,et al.Attributed Graph Clustering with Dual Redundancy Reduction[C]//Proceedings of the 31st International Joint Conference on Artificial Intelligence.Vienna:IJCAI Press,2022:3015-3021.
[34]LEE N,LEE J,PARK C.Augmentation-Free Self-SupervisedLearning on Graphs[C]//Proceedings of the 36th AAAI Conference on Artificial Intelligence.Electr Network:AAAI Press,2022:7372-7380.
[35]ZHAO H,YANG X,WANG Z,et al.Graph Debiased Contrastive Learning with Joint Representation Clustering[C]//Proceedings of the 30th International Jo int Conference on Artificial Intelligence.Electr Network:IJCAI Press,2021:3434-3440.
[1] LI Dongyang, NIE Rencan, PAN Linna, LI He. UMGN:An Infrared and Visible Image Fusion Network Based on Unsupervised Significance MaskGuidance [J]. Computer Science, 2024, 51(6A): 230600170-5.
[2] LOU Ren, HE Renqiang, ZHAO Sanyuan, HAO Xin, ZHOU Yueqi, WANG Xinyuan, LI Fangfang. Single Stage Unsupervised Visible-infrared Person Re-identification [J]. Computer Science, 2024, 51(6A): 230600138-7.
[3] HE Yifan, HE Yulin, CUI Laizhong, HUANG Zhexue. Subspace-based I-nice Clustering Algorithm [J]. Computer Science, 2024, 51(6): 153-160.
[4] CAI Jiacheng, DONG Fangmin, SUN Shuifa, TANG Yongheng. Unsupervised Learning of Monocular Depth Estimation:A Survey [J]. Computer Science, 2024, 51(2): 117-134.
[5] KONG Senlin, ZHANG Hui, HUANG Zhennan, LIU Youwu, TAO Yan. Asymmetric Teacher-Student Network Model for Industrial Image Anomaly Detection [J]. Computer Science, 2024, 51(11A): 240200069-7.
[6] JING Yeyiran, YU Zeng, SHI Yunxiao, LI Tianrui. Review of Unsupervised Domain Adaptive Person Re-identification Based on Pseudo-labels [J]. Computer Science, 2024, 51(1): 72-83.
[7] XU Jie, WANG Lisong. Contrastive Clustering with Consistent Structural Relations [J]. Computer Science, 2023, 50(9): 123-129.
[8] LIANG Yunhui, GAN Jianwen, CHEN Yan, ZHOU Peng, DU Liang. Unsupervised Feature Selection Algorithm Based on Dual Manifold Re-ranking [J]. Computer Science, 2023, 50(7): 72-81.
[9] SHEN Yuancheng, BAN Rui, CHEN Xin, HUA Runduo, WANG Yunhai. Anomaly Detection Algorithm for Network Device Configuration Based on Configuration Statement Tree [J]. Computer Science, 2023, 50(11A): 230200128-10.
[10] XIE Zhuo, KANG Le, ZHOU Lijuan, ZHANG Zhihong. Clustering Method Based on Contrastive Learning for Multi-relation Attribute Graph [J]. Computer Science, 2023, 50(11): 62-70.
[11] WANG Bin, LIANG Yudong, LIU Zhe, ZHANG Chao, LI Deyu. Study on Unsupervised Image Dehazing and Low-light Image Enhancement Algorithms Based on Luminance Adjustment [J]. Computer Science, 2023, 50(1): 123-130.
[12] SONG Jie, LIANG Mei-yu, XUE Zhe, DU Jun-ping, KOU Fei-fei. Scientific Paper Heterogeneous Graph Node Representation Learning Method Based onUnsupervised Clustering Level [J]. Computer Science, 2022, 49(9): 64-69.
[13] ZHANG Yu-xin, CHEN Yi-qiang. Driver Distraction Detection Based on Multi-scale Feature Fusion Network [J]. Computer Science, 2022, 49(11): 170-178.
[14] HOU Hong-xu, SUN Shuo, WU Nier. Survey of Mongolian-Chinese Neural Machine Translation [J]. Computer Science, 2022, 49(1): 31-40.
[15] LI Jin-xia, ZHAO Zhi-gang, LI Qiang, LV Hui-xian and LI Ming-sheng. Improved Locality and Similarity Preserving Feature Selection Algorithm [J]. Computer Science, 2020, 47(6A): 480-484.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!