Computer Science ›› 2023, Vol. 50 ›› Issue (9): 278-286.doi: 10.11896/jsjkx.221200133

• Artificial Intelligence • Previous Articles     Next Articles

Hierarchical Multi-label Text Classification Algorithm Based on Parallel Convolutional Network Information Fusion

YI Liu, GENG Xinyu, BAI Jing   

  1. School of Computer Science,Southwest Petroleum University,Chengdu 610000,China
  • Received:2022-12-23 Revised:2023-04-07 Online:2023-09-15 Published:2023-09-01
  • About author:YI Liu,born in 1995,postgraduate.His main research interests include natural language processing and text classification.
    GENG Xinyu,born in 1964,professor.His main research interests include data mining and artificial neural networks.
  • Supported by:
    Sichuan Science and Technology Program(2022NSFSC0555).

Abstract: Natural language processing(NLP) is an important research direction in the field of artificial intelligence and machine learning,which aims to use computer technology to analyze,understand,and process natural language.One of the main research areas in NLP is to obtain information from textual content and automatically classify and label textual content based on a certain labeling system or standard.Compared to single-label text classification,multi-label text classification has the characteristic that a data element belongs to multiple labels,which makes it more difficult to obtain multiple categories of data features from textual information.Hierarchical classification of multi-label texts isa special category,whichdivides the information contained in the text into different category labeling systems,and each category labeling system has an interdependent hierarchical relationship.Therefore,the use of the hierarchical relationship in the internal labeling system to more accurately classify the text into corresponding labels becomes the key to solving the problem.To this end,this paper proposes a hierarchical classification algorithm for multi-label texts based on the fusion of parallel convolutional network information.First,the algorithm uses the BERT model for word integration in textual information,then it enhances the semantic features of textual information using a self-attention mechanism and extracts the features of textual data using different convolutional kernels.The multi-faceted semantic information of the text is more effectively used for the task of a hierarchical classification of multi-label texts by using a threshold-controlled tree structure to establish inter-node relationships between higher and lower bits.The results obtained on the Kanshan-Cup public dataset and the CI enterprise information dataset show that the algorithm outperforms TextCNN,TextRNN,FastTex and other comparative models in three evaluation measures,namely macro-precision,macro-recall,and micro F1 value,and has a better cascade multi-label text classification effect.

Key words: Hierarchical multi-label text classification, Pre-training model, Attention mechanism, Convolutional neural network, Tree structure

CLC Number: 

  • TP391
[1]WU S,GAO M,XIAO Q,et al.A topic-enhanced recurrent autoencoder model for sentiment analysis of short texts[J].International Journal of Internet Manufacturing and Services,2020,7(4):393-406.
[2]BIN N,WU J W,HU F.Spam message classification based on theNaïve Bayes classification algorithm[J].IAENG International Journal of Computer Science,2019,46(1):46-53.
[3]CHEN J,HE J,SHEN Y,et al.End-to-end learning of LDA by mirror-descent back propagation over a deep architecture[J].arXiv:1508.03398,2015.
[4]MINAEE S,KALCHBRENNER N,CAMBRIA E,et al.Deeplearning--based text classification:a comprehensive review[J].ACM Computing Surveys(CSUR),2021,54(3):1-40.
[5]TAN C.Short Text Classification Based on LDA and SVM [J].International Journal of Applied Mathematics & Stats,2013,51(22):205-214.
[6]YIN C,SHI L,WANG J.Short Text Classification Technology Based on KNN+Hierarchy SVM [C] // International Confe-rence on Multimedia and Ubiquitous Engineering International Conference on Future Information Technology.2017:633-639.
[7]JIANG T,WANG D,SUN L,et al.Transformer with DynamicNegative Sampling for High-Performance Extreme Multi-label Text Classification[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:7987-7994.
[8]JOHNSON R,ZHANG T.Effective use of word order for text categorization with convolutional neural networks[J].arXiv:1412.1058,2014.
[9]GARGIULO F,SILVESTRI S,CIAMPI M,et al.Deep neuralnetwork for hierarchical extreme multi-label text classification[J].Applied Soft Computing,2019,79:125-138.
[10]LIU J,CHANG W C,WU Y,et al.Deep learning for extreme multi-label text classification [C]//Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval.2017:115-124.
[11]KIM Y.Convolutional Neural Networks for Sentence Classification [C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing.2014:1746-1751.
[12]GRAVES A,MOHAMED A,HINTON G.Speech recognitionwith deep recurrent neural networks[C]//Proceedings of the IEEE International Conference on Acoustics,Speech and Signal Processing.2013:6645-6649.
[13]JOULIN A,GRAVE E,BOJANOWSKI P,et al.FastText.zip:Compressing text classification models[J].arXiv:1612.03651,2016.
[14]GARGIULO F,SILVESTRI S,CIAMPI M,et al.Deep neuralnetwork for hierarchical extreme multi-label text classification[J].Applied Soft Computing,2019,79:125-138.
[15]ZHENG C,HONG T T,XUE M Y.BLSTM_MLPCNN Model For short Text Classification [J].Computer Science,2019,46(6):206-211.
[16]DUAN D D,TANG J S,WEN Y,et al.Chinese short text classification algorithm based on BERT model[J].Computer engineering,2021,47(1):79-86.
[17]LAN Z,CHEN M,GOODMAN S,et al.Albert:A lite bert for self-supervised learning of language representations[J].arXiv:1909.11942,2019.
[18]GARGIULO F,SILVESTRI S,CIAMPI M,et al.Deep neural network for hierarchical extreme multi-label text classification[J].Applied Soft Computing,2019,79:125-138.
[19]SOUCY P,MINEAU G W.A simple KNN algorithm for text categorization[C]//Proceedings 2001 IEEE International Conference on Data Mining.IEEE,2001:647-648.
[20]CAI L,HOFMANN T.Hierarchical document categorizationwith support vector machines[C]//Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management.2004:78-87.
[1] ZHU Ye, HAO Yingguang, WANG Hongyu. Deep Learning Based Salient Object Detection in Infrared Video [J]. Computer Science, 2023, 50(9): 227-234.
[2] LUO Yuanyuan, YANG Chunming, LI Bo, ZHANG Hui, ZHAO Xujian. Chinese Medical Named Entity Recognition Method Incorporating Machine ReadingComprehension [J]. Computer Science, 2023, 50(9): 287-294.
[3] LI Ke, YANG Ling, ZHAO Yanbo, CHEN Yonglong, LUO Shouxi. EGCN-CeDML:A Distributed Machine Learning Framework for Vehicle Driving Behavior Prediction [J]. Computer Science, 2023, 50(9): 318-330.
[4] WANG Jiahao, ZHONG Xin, LI Wenxiong, ZHAO Dexin. Human Activity Recognition with Meta-learning and Attention [J]. Computer Science, 2023, 50(8): 193-201.
[5] WANG Yu, WANG Zuchao, PAN Rui. Survey of DGA Domain Name Detection Based on Character Feature [J]. Computer Science, 2023, 50(8): 251-259.
[6] ZHANG Yian, YANG Ying, REN Gang, WANG Gang. Study on Multimodal Online Reviews Helpfulness Prediction Based on Attention Mechanism [J]. Computer Science, 2023, 50(8): 37-44.
[7] TENG Sihang, WANG Lie, LI Ya. Non-autoregressive Transformer Chinese Speech Recognition Incorporating Pronunciation- Character Representation Conversion [J]. Computer Science, 2023, 50(8): 111-117.
[8] YAN Mingqiang, YU Pengfei, LI Haiyan, LI Hongsong. Arbitrary Image Style Transfer with Consistent Semantic Style [J]. Computer Science, 2023, 50(7): 129-136.
[9] ZHAO Ran, YUAN Jiabin, FAN Lili. Medical Ultrasound Image Super-resolution Reconstruction Based on Video Multi-frame Fusion [J]. Computer Science, 2023, 50(7): 143-151.
[10] ZHANG Shunyao, LI Huawang, ZHANG Yonghe, WANG Xinyu, DING Guopeng. Image Retrieval Based on Independent Attention Mechanism [J]. Computer Science, 2023, 50(6A): 220300092-6.
[11] LIU Haowei, YAO Jingchi, LIU Bo, BI Xiuli, XIAO Bin. Two-stage Method for Restoration of Heritage Images Based on Muti-scale Attention Mechanism [J]. Computer Science, 2023, 50(6A): 220600129-8.
[12] LI Fan, JIA Dongli, YAO Yumin, TU Jun. Graph Neural Network Few Shot Image Classification Network Based on Residual and Self-attention Mechanism [J]. Computer Science, 2023, 50(6A): 220500104-5.
[13] SUN Kaiwei, WANG Zhihao, LIU Hu, RAN Xue. Maximum Overlap Single Target Tracking Algorithm Based on Attention Mechanism [J]. Computer Science, 2023, 50(6A): 220400023-5.
[14] WU Liuchen, ZHANG Hui, LIU Jiaxuan, ZHAO Chenyang. Defect Detection of Transmission Line Bolt Based on Region Attention Mechanism andMulti-scale Feature Fusion [J]. Computer Science, 2023, 50(6A): 220200096-7.
[15] ZHANG Shuaiyu, PENG Li, DAI Feifei. Person Re-identification Method Based on Progressive Attention Pyramid [J]. Computer Science, 2023, 50(6A): 220200084-8.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!