Computer Science ›› 2024, Vol. 51 ›› Issue (6A): 230900018-5.doi: 10.11896/jsjkx.230900018

• Artificial Intelligenc • Previous Articles     Next Articles

Text Classification Based on Invariant Graph Convolutional Neural Networks

HUANG Rui1, XU Ji2   

  1. 1 College of Computer Science and Technology,Guizhou University,Guiyang 550025,China
    2 State Key Laboratory of Public Big Data,Guizhou University,Guiyang 550025,China
  • Published:2024-06-06
  • About author:HUANG Rui,born in 1999,postgra-duate,is a member of CCF(No.N8705G).Her main research interests include machine learning and natural language processing.
    XU Ji,born in 1979,Ph.D,professor,is a member of CCF(No.12919M).His main research interests include data mining,granular computing and machine learning.
  • Supported by:
    National Natural Science Foundation of China(61966005,62366008).

Abstract: Text classification is a basic and important task in natural language processing,and graph neural networks have been applied to this task in recent years.However,the graph representation learning using graph neural networks can not well satisfy the generalization learning of new words in the task involving text classification.It is generally assumed that training and testing data come from the same distribution,which is often invalid in reality.To overcome these problems,this paper puts forward the Invariant-GCN,which is used for text categorization by GCN reported.First,to build a single figure for each document,use GCN to learn fine-grained word representation according to its local structure,which can effectivelygenerate embeddings for words not seen in the new document and then merge the word nodes as document embeddings.And then extract the maximum limit retained within the same class information expectations subgraph,use the graph to study is not affected by the distribution change.Finally,the text classification is completed by graph classification method.In four benchmark datasets,the the Invariant-GCN is compared with five classification methods,and the experimental results show that it has a good effect of text categorization.

Key words: Text classification, Graph convolutional neural network, Casual learning, Text graph construction

CLC Number: 

  • TP391
[1]ROY P K,SINGH J P,BANERJEE S.Deep learning to filterSMS Spam[J].Future Generation Computer Systems,2020,102:524-533.
[2]ZHANG L,WANG S,LIU B.Deep learning for sentiment analysis:A survey[J].Wiley Interdisciplinary Reviews:Data Mining and Knowledge Discovery,2018,8(4):e1253.
[3]ZENG Z,HE K,YAN Y,et al.Modeling Discriminative Repre-sentations for Out-of-Domain Detection with SupervisedCon-trastive Learning[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing(Volume 2:Short Papers).2021:870-878.
[4]JOHNSON R,ZHANG T.Deep pyramid convolutional neural networks for text categorization[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers).2017:562-570.
[5]ZHANG Y,LIU Q,SONG L.Sentence-State LSTM for Text Representation[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers).2018:317-327.
[6]WAN S,PAN S,YANG J,et al.Contrastive and generativegraph convolutional networks for graph-based semi-supervised learning[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021,35(11):10049-10057.
[7]AHUJA K,CABALLERO E,ZHANG D,et al.Invariance principle meets information bottleneck for out-of-distribution gene-ralization[J].Advances in Neural Information Processing Systems,2021,34:3438-3450.
[8]MCCALLUM A,NIGAM K.A comparison of event models fornaive bayes text classification[C]//AAAI-98 Workshop on Learning for Text Categorization.1998:41-48.
[9]JOACHIMS T.Text categorization with support vector ma-chines:Learning with many relevant features[C]//European Conference on Machine Learning.Berlin,Springer,1998:137-142.
[10]COVER T,HART P.Nearest neighbor pattern classification[J].IEEE Transactions on Information Theory,1967,13(1):21-27.
[11]KIM Y.Convolutional Neural Networks for Sentence Classification[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing(EMNLP).Association for Computational Linguistics,2014.
[12]LIU P,QIU X,HUANG X.Recurrent neural network for text classification with multi-task learning[C]//Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence.2016:2873-2879.
[13]WANG P,XU B,XU J,et al.Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification[J].Neurocomputing,2016,174:806-814.
[14]YAO L,MAO C,LUO Y.Graph convolutional networks fortext classification[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:7370-7377.
[15]HUANG L,MA D,LI S,et al.Text Level Graph Neural Network for Text Classification[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing(EMNLP-IJCNLP).2019:3444-3450.
[16]ZHANG Y,YU X,CUI Z,et al.Every Document Owns ItsStructure:Inductive Text Classification via Graph Neural Networks[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:334-339.
[17]LIU X,YOU X,ZHANG X,et al.Tensor graph convolutional networks for text classification[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020:8409-8416.
[18]JOULIN A,GRAVE É,BOJANOWSKI P,et al.Bag of Tricks for Efficient Text Classification[C]//Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics:Volume 2,Short Papers.2017:427-431.
[19]SHEN D,WANG G,WANG W,et al.Baseline Needs MoreLove:On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Vo-lume 1:Long Papers).Association for Computational Linguistics,2018.
[1] LI Guo, CHEN Chen, YANG Jing, QUN Nuo. Study on Tibetan Short Text Classification Based on DAN and FastText [J]. Computer Science, 2024, 51(6A): 230700064-5.
[2] WEI Niannian, HAN Shuguang. New Solution for Traveling Salesman Problem Based on Graph Convolution and AttentionNeural Network [J]. Computer Science, 2024, 51(6A): 230700222-8.
[3] LIU Hui, JI Ke, CHEN Zhenxiang, SUN Runyuan, MA Kun, WU Jun. Malicious Attack Detection in Recommendation Systems Combining Graph Convolutional Neural Networks and Ensemble Methods [J]. Computer Science, 2024, 51(6A): 230700003-9.
[4] MENG Xiangfu, REN Quanying, YANG Dongshen, LI Keqian, YAO Keyu, ZHU Yan. Literature Classification of Individual Reports of Adverse Drug Reactions Based on BERT and CNN [J]. Computer Science, 2024, 51(6A): 230400049-6.
[5] LI Zichen, YI Xiuwen, CHEN Shun, ZHANG Junbo, LI Tianrui. Government Event Dispatch Approach Based on Deep Multi-view Network [J]. Computer Science, 2024, 51(5): 216-222.
[6] ZHANG Liying, SUN Haihang, SUN Yufa , SHI Bingbo. Review of Node Classification Methods Based on Graph Convolutional Neural Networks [J]. Computer Science, 2024, 51(4): 95-105.
[7] YAN Wenjie, YIN Yiying. Human Action Recognition Algorithm Based on Adaptive Shifted Graph Convolutional Neural
Network with 3D Skeleton Similarity
[J]. Computer Science, 2024, 51(4): 236-242.
[8] YI Liu, GENG Xinyu, BAI Jing. Hierarchical Multi-label Text Classification Algorithm Based on Parallel Convolutional Network Information Fusion [J]. Computer Science, 2023, 50(9): 278-286.
[9] HUANG Yujiao, CHEN Mingkai, ZHENG Yuan, FAN Xinggang, XIAO Jie, LONG Haixia. Text Classification Based on Weakened Graph Convolutional Networks [J]. Computer Science, 2023, 50(6A): 220700039-5.
[10] ZHANG Xue, ZHAO Hui. Sentiment Analysis Based on Multi-event Semantic Enhancement [J]. Computer Science, 2023, 50(5): 238-247.
[11] SHAO Yunfei, SONG You, WANG Baohui. Study on Degree of Node Based Personalized Propagation of Neural Predictions forSocial Networks [J]. Computer Science, 2023, 50(4): 16-21.
[12] WANG Zhendong, DONG Kaikun, HUANG Junheng, WANG Bailing. SemFA:Extreme Multi-label Text Classification Model Based on Semantic Features and Association Attention [J]. Computer Science, 2023, 50(12): 270-278.
[13] CHI Tang, CHE Chao. Entity Alignment Method Combining Iterative Relationship Graph Matching and Attribute Semantic Embedding [J]. Computer Science, 2023, 50(11A): 230200041-6.
[14] ZHANG Kaixuan, CAI Guoyong, ZHU Kunri. Image Aesthetics-enhanced Visual Perception Recommendation System [J]. Computer Science, 2023, 50(11A): 221100083-8.
[15] MA Handa, FANG Yuqing. Dynamic Negative Sampling for Graph Convolution Network Based Collaborative Filtering Recommendation Model [J]. Computer Science, 2023, 50(11A): 230200149-7.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!