计算机科学 ›› 2021, Vol. 48 ›› Issue (5): 217-224.doi: 10.11896/jsjkx.200500076

• 人工智能 • 上一篇    下一篇

基于ATT-DGRU的文本方面级别情感分析

尹久, 池凯凯, 宦若虹   

  1. 浙江工业大学计算机科学与技术学院 杭州310023
  • 收稿日期:2020-05-18 修回日期:2020-07-30 出版日期:2021-05-15 发布日期:2021-05-09
  • 通讯作者: 池凯凯(kkchi@zjut.edu.cn)
  • 基金资助:
    国家自然科学基金(61872322)

Aspect-level Sentiment Analysis of Text Based on ATT-DGRU

YIN Jiu, CHI Kai-kai, HUAN Ruo-hong   

  1. School of Computer Science and Technology,Zhejiang University of Technology,Hangzhou 310023,China
  • Received:2020-05-18 Revised:2020-07-30 Online:2021-05-15 Published:2021-05-09
  • About author:YIN Jiu,born in 1996,postgraduate.Her main research interest include machine learning and so on.
    CHI Kai-kai,born in 1980,Ph.D,professor,Ph.D supervisor,is a member of China Computer Federation.His main research interests include wireless networks and machine learning.
  • Supported by:
    National Natural Science Foundation of China(61872322).

摘要: 方面级别情感分类是针对给定文本、分析其在给定方面所表达出的情感极性。现有的主流解决方案中,基于注意力机制的循环神经网络模型忽略了关键词邻近上下文信息的重要性,而结合卷积神经网络(Convolutional Neural Network,CNN)的多层模型不擅长捕捉句子级别的长距离依赖信息。因此,提出了一种基于截断循环神经网络(Disconnected Gated Recurrent Units,DGRU)和注意力机制的方面级别情感分类网络模型(Attention-Disconnected Gated Recurrent Units,ATT-DGRU)。DGRU网络综合了循环神经网络和CNN的优点,既能捕捉文本的长距离依赖语义信息,又可以很好地抽取关键短语的语义信息。注意力机制在推断方面情感极性时捕获每一个单词与给定方面的关联程度,同时生成一个情感权重向量用于可视化。ATT-DGRU模型在中文酒店评论数据集上进行ACSA任务,任务结果表明,其二分类、三分类准确率分别达到91.53%,86.61%;在SemEval2014-Restaurant数据集进行ATSA任务,任务结果表明,其二分类、三分类准确率分别可达90.06%,77.21%。

关键词: 方面情感分析, 截断循环神经网络, 深度学习

Abstract: Aspect-level sentiment classification is to analyze the sentiment polarity in a given aspect for a given text.In the exis-ting mainstream solutions,the attention mechanism-based cyclic neural network model ignores the importance of keyword pro-ximity context information,and the CNN multilayer model is not good at capturing sentence-level long-distance dependency information.This paper proposes an aspect-level emotion classification network model based on disconnected gated recurrent units(DGRU) and attention mechanism,which is abbreviated as ATT-DGRU.The DGRU network used in this model integrates the advantages of circular neural network and CNN.It can not only capture the long-distance dependent semantic information of text,but also extract the semantic information of key phrases.Attention mechanism is used to capture the importance of each word to a specific aspect when deducing the sentiment polarity of a specific aspect,meanwhile generates an emotional weight vector,which can be visualized.Accuracies of two-class and three-class of ATT-DGRU model constructed in this paper can reach 91.11% and 87.76% respectively in ACSA task on Chinese hotel review datasets.Accuracies of two-class and three class of ATT-DGRU model can reach 77.21% and 90.06% respectively in ATSA task on SemEval2014-Restaurant dataset.

Key words: Aspect-level sentiment classification, Deep learning, Disconnected gated recurrent unit

中图分类号: 

  • TP391
[1]PANG B,LEE L,VAITHYANATHAN S,et al.Thumbs up? Sentiment Classification using Machine Learning Techniques[C]//The ACL-02 Conference on Empirical Methods in Natural Language Processing.2002:79-86.
[2]ZHANG L,LIU B.Sentiment Analysis and Opinion Mining[M].Morgan:Claypool Publishers,2017:1-50.
[3]PANG B,LEE L.Opinion Mining and Sentiment Analysis[J].Foundations and Trends in Information Retrieval,2008,2(1):1-135.
[4]PONTIKI M,GALANIS D,PAVLOPOULOS J,et al.SemEval-2014 Task 4:Aspect Based Sentiment Analysis[C]//International Conference on Computational Linguistics.2014:27-35.
[5]GOODFELLOW I,BENGIO Y,COURVILLE A.Deep Learning[M].Massachusetts:The MIT Press,2016:24-31.
[6]WANG B.Disconnected Recurrent Neural Networks for TextCategorization[C]//the 56th Annual Meeting of the Association for Computational Linguistics.2018:2311-2320.
[7]CHO K,VAN MERRIENBOER B,GULCEHRE C,et al.Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation[J].arXiv:1406.1078,2014.
[8]GERS F A,SCHMIDHUBER J,CUMMINS F.Learning to Forget:Continual Prediction with LSTM[J].Neural Computation,2000,12(10):2451-2471.
[9]ZHANG Z,ZOU Y,GAN C.Textual sentiment analysis viathree different attention convolutional neural networks and cross-modality consistent regression[J].Neurocomputing,2018,275:1407-1415.
[10]AKHTAR M S,GUPTA D,EKBAL A,et al.Feature selection and ensemble construction:A two-step method for aspect based sentiment analysis[J].Knowledge Based Systems,2017,125(1):116-135.
[11]HAN H,ZHANG J,YANG J,et al.Generate domain-specificsentiment lexicon for review sentiment analysis[J].Multimedia Tools & Applications,2018,77(16):21265-21280.
[12]COLLOBERT R,WESTON J,BOTTOU L,et al.Natural Language Processing(Almost) from Scratch[J].Journal of Machine Learning Research,2011,12(1):2493-2537.
[13]KALCHBRENNER N,GREFENSTETTE E,BLUNSOM P,et al.A Convolutional Neural Network for Modelling Sentences[C]//Meeting of the Association for Computational Linguistics.2014:655-665.
[14]WANG L,CAO Z,DE MELO G,et al.Relation Classification via Multi-Level Attention CNNs[C]//the 54th Annual Meeting of the Association for Computational Linguistics,2016:1298-1307.
[15]TANG D,QIN B,LIU T,et al.Document Modeling with Gated Recurrent Neural Network for Sentiment Classification[C]//The 2015 Conference on Empirical Methods in Natural Language Processing.2015:1422-1432.
[16]TANG D,QIN B,FENG X,et al.Effective LSTMs for Target-Dependent Sentiment Classification[J].arXiv:1512.01100v2,2016.
[17]ZHANG M,ZHANG Y,VO D T,et al.Gated neural networks for targeted sentiment analysis[C]//Thirtieth AAAI Confe-rence on Artificial Intelligence,2016:3087-3093.
[18]SHUANG K,REN X,YANG Q,et al.AELA-DLSTMs:Attention-enabled and location-aware double LSTMs for aspect-level sentiment classification[J].Neurocomputing,2019,334:25-34.
[19]MA D,LI S,ZHANG X,et al.Interactive attention networks for aspect-level sentiment classification[J].arXiv:1709.00893,2017.
[20]XUE W,LI T.Aspect based sentiment analysis with gated convo-lutional networks[J].arXiv:1805.07043,2018.
[21]SHI Y,YAO K,TIAN L,et al.Deep LSTM based Feature Mapping for Query Classification[C]//The 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2016:1501-1511.
[22]YANG M,TU W,WANG J,et al.Attention Based LSTM for Target Dependent Sentiment Classification[C]//Thirty-First AAAI Conference on Artificial Intelligence.2017:5013-5014.
[23]CHEN P,SUN Z,BING L,et al.Recurrent attention network on memory for aspect sentiment analysis[C]//Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing.2017:452-461.
[1] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[2] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[3] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[4] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[5] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[6] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[7] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[8] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[9] 周慧, 施皓晨, 屠要峰, 黄圣君.
基于主动采样的深度鲁棒神经网络学习
Robust Deep Neural Network Learning Based on Active Sampling
计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044
[10] 苏丹宁, 曹桂涛, 王燕楠, 王宏, 任赫.
小样本雷达辐射源识别的深度学习方法综述
Survey of Deep Learning for Radar Emitter Identification Based on Small Sample
计算机科学, 2022, 49(7): 226-235. https://doi.org/10.11896/jsjkx.210600138
[11] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
[12] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[13] 刘伟业, 鲁慧民, 李玉鹏, 马宁.
指静脉识别技术研究综述
Survey on Finger Vein Recognition Research
计算机科学, 2022, 49(6A): 1-11. https://doi.org/10.11896/jsjkx.210400056
[14] 孙福权, 崔志清, 邹彭, 张琨.
基于多尺度特征的脑肿瘤分割算法
Brain Tumor Segmentation Algorithm Based on Multi-scale Features
计算机科学, 2022, 49(6A): 12-16. https://doi.org/10.11896/jsjkx.210700217
[15] 康雁, 徐玉龙, 寇勇奇, 谢思宇, 杨学昆, 李浩.
基于Transformer和LSTM的药物相互作用预测
Drug-Drug Interaction Prediction Based on Transformer and LSTM
计算机科学, 2022, 49(6A): 17-21. https://doi.org/10.11896/jsjkx.210400150
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!