Computer Science ›› 2023, Vol. 50 ›› Issue (6A): 220600230-9.doi: 10.11896/jsjkx.220600230

• Artificial Intelligence • Previous Articles     Next Articles

Graph Attention Networks Based on Causal Inference

ZHANG Tao, CHENG Yifei, SUN Xinxu   

  1. School of Information Science and Engineering,Yanshan University,Qinhuangdao,Hebei 066004,China;
    Hebei Key Laboratory of Information Transmission and Signal Processing,Qinhuangdao,Hebei 066004,China
  • Online:2023-06-10 Published:2023-06-12
  • About author:ZHANG Tao,born in 1979,Ph.D,professor,doctoral supervisor,is a member of China Computer Federation.His main research interests include the causal inference,machine learning,and formal concept analysis.
  • Supported by:
    National Natural Science Foundation of China(62176229) and Natural Science Foundation of Hebei Province,China(F2020203010).

Abstract: Graph attention network(GAT) is an important graph neural network with a wide range of applications in classification tasks.However,when the neighborhood nodes in the graph are disturbed,the model classification accuracy will be affected and degraded.In response,a graph attention network based on causal inference named causal graph attention network(C-GAT) is proposed to improve the robustness of the network.The model first calculates the causal weights between the neighborhood of the target node and its label and uses them to sample the neighborhood.Then the attention coefficient between the sampled neighborhood and the target node is calculated.Finally,the embedding features of the target nodes are obtained by weighted aggregation of the neighborhood information based on the attention coefficients.Experimental results on the Cora,Pubmed and Citeseer datasets show that the classification performance of C-GAT is on par with the classical model in the case of no perturbation.In the presence of perturbations,the classification accuracy of C-GAT improves by at least 6% compared to the classical model,with a better balance of robustness and time cost.

Key words: Graph attention networks, Causal inference, Attention mechanism, Causal weight

CLC Number: 

  • TP183
[1]VELIKOVI P,CUCURULL G,CASANOVA A,et al.GraphAttention Networks[C]//The 6th International Conference on Learning Representations.2018:1-12.
[2]TAN Y,WANG J,ZHANG C.Review of Text ClassificationMethods Based on Graph Convolutional Network[J].Computer Science,2022,49(8):205-216.
[3]XU S J,LIU Q Y,SHI Y,et al.Person Re-Identification Based on Diversified Local Attention Network[J].Journal of Electroni-cs & Information Technology,2022,44(1):211-220.
[4]JUNG J,HEO H S,YU H J,et al.Graph Attention Networks for Speaker Verification[C]//IEEE International Conference on Acoustics,Speech and Signal Processing.Ontario:IEEE,2021:6149-6153.
[5]HAN H,WU Y H,QIN X Y.An Interactive Graph AttentionNetworks Model for Aspect-level Sentiment Analysis[J].Journal of Electronics & Information Technology,2021,43(11):3282-3290.
[6]ZHANG J,LI M X,GAO K S,et al.Word and Graph Attention Networks for Semi-Supervised Classification[J].Knowledge and Information Systems,2021,63:2841-2859.
[7]CAI W W,WEI Z G.Remote Sensing Image Classification Based on a Cross-Attention Mechanism and Graph Convolution[J].IEEE Geoscience and Remote Sensing Letters,2022,19:1-5.
[8]ZHOU H,YANG Y Z,LUO T J,et al.A Unified Deep Sparse Graph Attention Network for Scene Graph Generation[J].Pattern Recognition,2022,123:108367.
[9]YANG Y D,WANG X C,SONG M L,et al.SPAGAN:Shortest Path Graph Attention Network[C]//International Joint Confe-rence on Artificial Intelligence.2019:4099-4105.
[10]ZHOU A Z,LI Y F.Structural Attention Network for Graph[J].Applied Intelligence,2021,51(8):6255-6264.
[11]BAI J,DING B X,XIAO Z,et al,Hyperspectral Image Classification Based on Deep Attention Graph Convolutional Network[J].IEEE Transactions on Geoscience and Remote Sensing,2022,60:1-16.
[12]GU W W,GAO F,LOU X D,et al.Discovering Latent Node Information by Graph Attention Network[J].Scientific Reports,2021,11(1):6979.
[13]ZHANG K,ZHU Y,WANG J,et al.Adaptive Structural Fin-gerprints for Graph Attention Networks[C]//The 8th International Conference on Learning Representations.2020.
[14]YANG L,LI W X,GUO Y F,et al.Graph-CAT:Graph Co-Attention Networks Via Local and Global Attribute Augmentations[J].Future Generation Computer Systems,2021,118:170-179.
[15]XIE Y,ZHANG Y Q,GONG M G,et al.Multi-View Graph Attention Networks[J].Neural Networks,2020,132:180-189.
[16]ZHANG Z C,LI M,LIN X,et al.Multistep Speed Prediction on Traffic Networks:A Deep Learning Approach ConsideringSpatio-Temporal Dependencies[J].Transportation Research Part C:Emerging Technologies,2019,105:297-322.
[17]JI C J,WANG R X,ZHU R X,et al.Hop-Aware SupervisionGraph Attention Networks for Sparsely Labeled Graphs[J/OL].https://arxiv.org/abs/2004.04333.
[18]ZHANG H M,XU M.Graph Neural Networks with MultipleKernel Ensemble Attention[J].Knowledge-Based Systems,2021,229:107299.
[19]BAI S,ZHANG F H,TORR P H S.Hypergraph Convolution and Hypergraph Attention[J].Pattern Recognition,2021,110:107637.
[20]WANG Z H,SHEN H W,CAO Q,et al.Survey on Graph Classification[J].Journal of Software,2022,33(1):171-192.
[21]FENG W Z,ZHANG J,DONG Y X,et al.Graph Random Neural Networks for Semi-supervised Learning on Graphs[C]//Advances in Neural Information Processing Systems,2020,33:22092-22103.
[22]JUDEA P.The Seven Tools of Causal Inference with Reflections on Machine Learning[J].Communications of the ACM,2019,62(3):54-60.
[23]JUDEA P.Causal Inference[M].Causality:Objectives and Assessment,2010:39-58.
[24]SCHOLKOPF B,LOCATELLO F,BAUER S,et al.TowardCausal Representation Learning[J].Proceedings of the IEEE,2021,109(5):612-634.
[25]RICHENS J G,LEE C M,JOHRI S.Improving the Accuracy of Medical Diagnosis with Causal Machine Learning[J].Nature Communications,2020,11:47-54.
[26]LITTLE M A,BADAWY R.Causal Bootstrapping[J/OL].https://arxiv.org/abs/1910.09648.
[27]LIU L,WANG S,HU B,et al.Learning structures of interval-based Bayesian networks in probabilistic generative model for human complex activity recognition[J].Pattern Recognition,2018,81:545-561
[28]ZHANG T,LIU M Q,LIU W Y.The Causality Research Between Syndrome Elements by Attribute Topology[J].Computational and Mathematical Methods in Medicine,2018,2018:1-12.
[29]KINGMA D P,BA J L.Adam:A Method for Stochastic Optimization[C]//ICIR 2015.2015.
[30]KIPF T N,WELLING M.Semi-supervised Classification withGraph Convolutional Networks[C]//International Conference on Learning Representations.2017.
[31]HAMILTON W L,YING R,LESKOVEC J.Inductive Representation Learning on Large Graphs[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems.2017:1025-1035.
[32]XIE T,WANG B,JAY K C C.GraphHop:An Enhanced Label Propagation Method for Node Classification[J/OL].https://arxiv.org/abs/2101.02326v1.
[1] ZHANG Yian, YANG Ying, REN Gang, WANG Gang. Study on Multimodal Online Reviews Helpfulness Prediction Based on Attention Mechanism [J]. Computer Science, 2023, 50(8): 37-44.
[2] TENG Sihang, WANG Lie, LI Ya. Non-autoregressive Transformer Chinese Speech Recognition Incorporating Pronunciation- Character Representation Conversion [J]. Computer Science, 2023, 50(8): 111-117.
[3] WANG Jiahao, ZHONG Xin, LI Wenxiong, ZHAO Dexin. Human Activity Recognition with Meta-learning and Attention [J]. Computer Science, 2023, 50(8): 193-201.
[4] WANG Yu, WANG Zuchao, PAN Rui. Survey of DGA Domain Name Detection Based on Character Feature [J]. Computer Science, 2023, 50(8): 251-259.
[5] YAN Mingqiang, YU Pengfei, LI Haiyan, LI Hongsong. Arbitrary Image Style Transfer with Consistent Semantic Style [J]. Computer Science, 2023, 50(7): 129-136.
[6] GAO Xiang, TANG Jiqiang, ZHU Junwu, LIANG Mingxuan, LI Yang. Study on Named Entity Recognition Method Based on Knowledge Graph Enhancement [J]. Computer Science, 2023, 50(6A): 220700153-6.
[7] CUI Lin, CUI Chenlu, LIU Zhengwei, XUE Kai. Speech Emotion Recognition Based on Improved MFCC and Parallel Hybrid Model [J]. Computer Science, 2023, 50(6A): 220800211-7.
[8] DUAN Jianyong, YANG Xiao, WANG Hao, HE Li, LI Xin. Document-level Relation Extraction of Graph Attention Convolutional Network Based onInter-sentence Information [J]. Computer Science, 2023, 50(6A): 220800189-6.
[9] YANG Xing, SONG Lingling, WANG Shihui. Remote Sensing Image Classification Based on Improved ResNeXt Network Structure [J]. Computer Science, 2023, 50(6A): 220100158-6.
[10] ZHANG Shunyao, LI Huawang, ZHANG Yonghe, WANG Xinyu, DING Guopeng. Image Retrieval Based on Independent Attention Mechanism [J]. Computer Science, 2023, 50(6A): 220300092-6.
[11] LIU Haowei, YAO Jingchi, LIU Bo, BI Xiuli, XIAO Bin. Two-stage Method for Restoration of Heritage Images Based on Muti-scale Attention Mechanism [J]. Computer Science, 2023, 50(6A): 220600129-8.
[12] LI Fan, JIA Dongli, YAO Yumin, TU Jun. Graph Neural Network Few Shot Image Classification Network Based on Residual and Self-attention Mechanism [J]. Computer Science, 2023, 50(6A): 220500104-5.
[13] SUN Kaiwei, WANG Zhihao, LIU Hu, RAN Xue. Maximum Overlap Single Target Tracking Algorithm Based on Attention Mechanism [J]. Computer Science, 2023, 50(6A): 220400023-5.
[14] WU Liuchen, ZHANG Hui, LIU Jiaxuan, ZHAO Chenyang. Defect Detection of Transmission Line Bolt Based on Region Attention Mechanism andMulti-scale Feature Fusion [J]. Computer Science, 2023, 50(6A): 220200096-7.
[15] ZHANG Shuaiyu, PENG Li, DAI Feifei. Person Re-identification Method Based on Progressive Attention Pyramid [J]. Computer Science, 2023, 50(6A): 220200084-8.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!