计算机科学 ›› 2022, Vol. 49 ›› Issue (3): 246-254.doi: 10.11896/jsjkx.201200073

• 人工智能 • 上一篇    下一篇

融合双重权重机制和图卷积神经网络的微博细粒度情感分类

李浩, 张兰, 杨兵, 杨海潇, 寇勇奇, 王飞, 康雁   

  1. 云南大学软件学院 昆明650504
  • 收稿日期:2020-12-07 修回日期:2021-06-08 出版日期:2022-03-15 发布日期:2022-03-15
  • 通讯作者: 康雁(562530855@qq.com)
  • 作者简介:(lihao707@ynu.edu.cn)
  • 基金资助:
    国家自然科学基金(61762092);云南省软件工程重点实验室开放基金项目(2020SE303);云南省重大科技专项(202002AB080001);《材料基因工程-基于Metcloud的集成计算功能模块计算软件开发》(2019CLJY06);云南省稀贵金属材料基因工程-稀贵金属材料高通量集成计算和数据分析技术研发及示范应用(2019ZE001-1,202002AB080001)

Fine-grained Sentiment Classification of Chinese Microblogs Combining Dual Weight Mechanismand Graph Convolutional Neural Network

LI Hao, ZHANG Lan, YANG Bing, YANG Hai-xiao, KOU Yong-qi, WANG Fei, KANG Yan   

  1. School of Software,Yunnan University,Kunming 650504,China
  • Received:2020-12-07 Revised:2021-06-08 Online:2022-03-15 Published:2022-03-15
  • About author:LI Hao,born in 1970,professor,Ph.D,His main research interests include distributed computing,grid and cloud computing research.
    KANG Yan,born in 1972,Ph.D,asso-ciate professor.Her main research inte-rests include software engineering,system optimization,big data processing and mining.
  • Supported by:
    National Natural Science Foundation of China(61762092),Open Fund Project of Key Laboratory of Software Engineering in Yunnan Province(2020SE303),Major Science and Technology Projects in Yunnan Province(202002AB080001),Material Genetic Engineering-Calculation Software Development of Integrated Calculation Function Module Based on Metcloud(2019CLJY06) and Gene Enginee-ring of Rare and Precious Metal Materialsin Yunnan Province-R & D and Demonstration Application of High-throughput Integrated Computing and Data Analysis Technology for Rare and Precious Metal Materials(2019ZE001-1,202002AB080001).

摘要: 利用深度学习模型和注意力机制对微博文本进行细粒度情感分类,已成为研究的热点,但是现有注意力机制只考虑单词对单词的影响,对单词本身的多种维度特性(如词义、词性、语义等特征信息)缺乏有效的融合。为了解决这个问题,文中提出了一种双重权重机制WDWM(Word and Dimension Weight Mechanism),并将其与基于解析依赖树的GCN模型相结合,通过选择每条微博中含有关键信息的单词,来抽取单词的重要维度特性,对单词的多种维度特性进行有效融合,从而捕获更加丰富的特征信息。在针对微博细粒度情感分类的实验中,融合双重权重机制和图卷积神经网络的微博细粒度情感分类模型(WDWM-GCN)的F测度达到了84.02%,比2020年提出的最新的算法高出1.7%,这进一步证明,WDWM-GCN能对单词的多维度特性进行有效的融合,能够捕获丰富的特征信息。在对搜狗新闻数据集进行分类的实验中,BERT模型在加入WDWM后,其分类效果得到了进一步提升,这充分证明 WDWM对所提分类模型有明显的改进效果。

关键词: 双重权重机制, 图卷积神经网络, 细粒度情感分类, 注意力机制

Abstract: Using deep learning models and attention mechanisms to classify fine-grained emotions of Chinese microblogs has become a research hotspot.However,the existing attention mechanisms consider the impact of words on words,and lack effective integration of the various dimensional characteristics of the words themselves (such as word meaning,part of speech,semantics and other characteristic information).In order to solve this problem,the paper proposes a dual weight mechanism WDWM (word and dimension weight mechanism),and combines it with the GCN model based on the analytical dependency tree,so that it can not only select the words that contain key information in each microblog,but also extract the important dimensional characteristics of the word and effectively integrate multiple dimensional characteristics of words,so as to capture more rich feature information.The F measure of fine-grained sentiment classification of Chinese microblogs combining dual weight mechanism and graph convolutional neural network(WDWM-GCN) reaches 84.02%,which is 1.7% higher than the latest algorithm proposed by WWW in 2020,which further proves that WDWM-GCN can effectively integrate the multi-dimensional characteristics of words and capture rich feature information.In the experiment on the classification of Sogou news data set,after the BERT model is addedto the WDWM mechanism,the classification effect is further improved,which fully provs that the WDWM has a significant improvement on the text classification model.

Key words: Attention mechanism, Dual weight mechanism, Fine-grained emotion classification, Graph convolutional neural network

中图分类号: 

  • TP391
[1]HU M,LIU B.Mining and Summarizing Customer Reviews[C]//Tenth Acm Sigkdd International Conference on Know-ledge Discovery & Data Mining ACM.2004:168-177.
[2]LI R,LIN Z,LIN H L,et al.A Review of Text Sentiment Ana-lysis[J].Journal of Computer Research and Development,2018,55(1):30-52.
[3]ZHAO J Q,GUI X L,ZHANG X J.Deep Convolution Neural Networks for Twitter Sentiment Analysis[J].IEEE Access,2018,6(1):23253-23260.
[4]WEI X,TAO L.Aspect Based Sentiment Analysis with Gated Convolutional Networks[J].arXiv:1805.07043,2018.
[5]ALIAKSEI S,MOSCHITTI A.Twitter Sentiment Analysiswith Deep Convolutional Neural Networks[C]//International ACM Sigir Conference.ACM,2015:959-962.
[6]PWINT OO S H,THEERAMUNKONG T,HUNG N D.Sentence Sentiment Classification Using Convolutional Neural Network in Myanmar Texts[C]//2020 2nd International Confe-rence on Image,Video and Signal Processing(IVSP ’20).2020.
[7]SOCHER R,PERELYGIN A,WU Y J,et al.Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank[C]//Conference on Empirical Methods in Natural Language Processing(EMNLP 2013).2013.
[8]ABDUL-MAGEED M,UNGAR L.Emonet:Fine-grained Emo-tion Detection with Gated Recurrent Neural Networks[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistic (ACL’17).2017:718-728.
[9]SACHIN S,TRIPATHI A,MAHAJAN N,et al.SentimentAnalysis Using Gated Recurrent Neural Networks[J].SN Computer Science,2020,1(2):1-13.
[10]BAZIOTIS C,PELEKIS N,DOULKERIDIS C.Datastories atsemeval-2017 Task 4:Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis [C]//Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017).2017.
[11]LU Y,ZHOU J,DAI H N,et al.Sentiment Analysis of Chinese Microblog Based on Stacked Bidirectional LSTM[C]//2018 15th International Symposium on Pervasive Systems,Algorithms and Networks (I-SPAN).2018.
[12]QIAN Q,HUANG M,LEI J,et al.Linguistically RegularizedLSTM for Sentiment Classification[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics.2017.
[13]BAHDANAU D,CHO K,BENGIO Y.Neural Machine Translation by Jointly Learning to Align and Translate[J].arXiv:1409.0473,2014.
[14]TRAN N K,NIEDEREÉE C.Multihop Attention Networks for Question Answer Matching[C]//The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR’18).2018:325-334.
[15]LIU P,LIU T W,SHI J,et al.Aspect level sentiment classification with unbiased attention and target enhanced representations[C]//Proceedings of the 35th Annual ACM Symposium on Applied Computing (SAC ’20).2020:843-850.
[16]LEI Z,YANG Y,YANG M,et al.A Multi-sentiment-resource Enhanced Attention Network for Sentiment Classification[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Short Papers).2018.
[17]SHAO X L.Fine-Grained Sentiment Analysis Based on Hierarchical Attention Networks[J].Computer Ence and Application,2019,9(11):2143-2153.
[18]WANG Y,HUANG M,ZHU X,et al.Attention-based LSTM for Aspect-level Sentiment Classification[C]//Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing.2016.
[19]WU X H,CHEN L,WEI T T,et al.Sentiment analysis of Chinese short text based on Self-Attention and Bi-LSTM[J].Journal of Chinese Information Processing,2019,33(6):100-107.
[20]WANG Z T,YU Z W,GUO B,et al.Sentiment analysis of Chinese microblog based on lexicon and rule set[J].Computer Engineering & Applications,2015,51(8):218-225.
[21]VILARES D,ALONSO M A,GÓMEZ-RODRÍGUEZ C.Supervised Sentiment Analysis in Multilingual Environments[J].Information Processing & Management,2017,53(3):595-607.
[22]GAMON M.Sentiment classification on customer feedback data[C]//Proceedings of the 20th International Conference on Computational Linguistics.2004:841-847.
[23]LIU P F,MENG H L.Conditional random fields labeling andmaximum entropy classification for aspect based sentiment ana-lysis[C]//International Workshop on Semantic Evaluation.2014.
[24]BO P,LILLIAN L,SHIVAKUMAR V.Thumbs up? Sentiment Classification using Machine Learning Techniques[C]//EMNLP.2002:79-86.
[25]CHEN K,LIANG B,KE W D,et al.Sentiment Analysis of Chinese Weibo Based on Multi-channel Convolutional Neural Network[J].Computer Research and Development,2018,55(5):945-957.
[26]ZHENG S M,YANG M.A New Method of Improving BERT for Text Classification [C]//International Conference on Intelligent Science and Big Data Engineering.2019:442-452.
[27]ZHENG Z,LI L,JING C.Deeply Hierarchical Bi-directionalLSTM for Sentiment Classification[J].Computer Science,2018,45(8):213-217.
[28]WANG J L,YANG Y X,WANG X M.ResLCNN model for short text classification[J].Journal of Software,2017,28(s2):61-69.
[29]YANG Z C,YANG D Y,DYER C,et al.Hierarchical attention networks for document classification[C]//ACL.2016:1480-1489.
[30]LUONG M,PHAM H,MANNING C.Effective approaches to attention-based neural machine translation[C]//EMNLP.2015.
[31]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[C]//Proceedings of the 31st International Confe-rence on Neural Information Processing Systems.2017:6000-6010.
[32]DU C,HUANG L.Text Classification Research with Attention-based Recurrent Neural Networks[J].International Journal of Computers,Communications & Control (IJCCC),2018,13(1):50.
[33]ZHANG J,DUAN L G,LI A P,et al.Fine-grained SentimentAnalysis Based on Combination of Attention and Gated Mechanism[J].Computer Science,2021,48(8):226-233.
[34]KIP F T,WELLING M.Semi-supervised classification withgraph convolutional networks[C]//ICLR.2016.
[35]LAI Y,ZHANG L,HAN D,et al.Fine-Grained Emotion Classification of Chinese Microblogs Based on Graph Convolution Networks[J].World Wide Web,2020,23(4):1-17.
[36]WEN S,WAN X.Emotion Classification in Microblog TextsUsing Class Sequential Rules[C]//AAAI.2014:187-193.
[37]JIANG F,LIU Y,LUAN H,et al.Microblog Sentiment Analysis with Emoticon Space Model[C]//Chinese National Confe-rence on Social Media Processing.Berlin:Springer,2014.
[38]YE P,KUMAR J,KANG L,et al.Unsupervised feature lear-ning framework for no-reference image quality assessment[C]//2012 IEEE Conference on Computer Vision and Pattern Recognition.2012:1098-1105.
[39]YOON K.Convolutional Neural Networks for Sentence Classification[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing.2014.
[40]LIU P F,QIU X P,HUANG X J,et al.Recurrent Neural Network for Text Classification with Multi-Task Learning[C]//Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence.2016:2873-2879.
[41]JOULIN A,GRAVE E,BOJANOWSKI P,et al.Bag of Tricks for Efficient Text Classification[J].arXiv:1607.01759,2016.
[42]JOHNSON R,ZHANG T.Deep Pyramid Convolutional Neural Networks for Text Categorization[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics(Long Papers).2017.
[43]DEVLIN J,CHANG M W,LEE K,et al.Pre-training of Deep Bidirectional Transformers for Language Understanding[J].arXiv:1810.04805,2018.
[1] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[2] 周芳泉, 成卫青.
基于全局增强图神经网络的序列推荐
Sequence Recommendation Based on Global Enhanced Graph Neural Network
计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[3] 戴禹, 许林峰.
基于文本行匹配的跨图文本阅读方法
Cross-image Text Reading Method Based on Text Line Matching
计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032
[4] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[5] 熊丽琴, 曹雷, 赖俊, 陈希亮.
基于值分解的多智能体深度强化学习综述
Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization
计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112
[6] 汪鸣, 彭舰, 黄飞虎.
基于多时间尺度时空图网络的交通流量预测模型
Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction
计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188
[7] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[8] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[9] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[10] 闫佳丹, 贾彩燕.
基于双图神经网络信息融合的文本分类方法
Text Classification Method Based on Information Fusion of Dual-graph Neural Network
计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042
[11] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[12] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[13] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[14] 曾志贤, 曹建军, 翁年凤, 蒋国权, 徐滨.
基于注意力机制的细粒度语义关联视频-文本跨模态实体分辨
Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism
计算机科学, 2022, 49(7): 106-112. https://doi.org/10.11896/jsjkx.210500224
[15] 徐鸣珂, 张帆.
Head Fusion:一种提高语音情绪识别的准确性和鲁棒性的方法
Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition
计算机科学, 2022, 49(7): 132-141. https://doi.org/10.11896/jsjkx.210100085
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!