计算机科学 ›› 2021, Vol. 48 ›› Issue (5): 217-224.doi: 10.11896/jsjkx.200500076
尹久, 池凯凯, 宦若虹
YIN Jiu, CHI Kai-kai, HUAN Ruo-hong
摘要: 方面级别情感分类是针对给定文本、分析其在给定方面所表达出的情感极性。现有的主流解决方案中,基于注意力机制的循环神经网络模型忽略了关键词邻近上下文信息的重要性,而结合卷积神经网络(Convolutional Neural Network,CNN)的多层模型不擅长捕捉句子级别的长距离依赖信息。因此,提出了一种基于截断循环神经网络(Disconnected Gated Recurrent Units,DGRU)和注意力机制的方面级别情感分类网络模型(Attention-Disconnected Gated Recurrent Units,ATT-DGRU)。DGRU网络综合了循环神经网络和CNN的优点,既能捕捉文本的长距离依赖语义信息,又可以很好地抽取关键短语的语义信息。注意力机制在推断方面情感极性时捕获每一个单词与给定方面的关联程度,同时生成一个情感权重向量用于可视化。ATT-DGRU模型在中文酒店评论数据集上进行ACSA任务,任务结果表明,其二分类、三分类准确率分别达到91.53%,86.61%;在SemEval2014-Restaurant数据集进行ATSA任务,任务结果表明,其二分类、三分类准确率分别可达90.06%,77.21%。
中图分类号:
[1]PANG B,LEE L,VAITHYANATHAN S,et al.Thumbs up? Sentiment Classification using Machine Learning Techniques[C]//The ACL-02 Conference on Empirical Methods in Natural Language Processing.2002:79-86. [2]ZHANG L,LIU B.Sentiment Analysis and Opinion Mining[M].Morgan:Claypool Publishers,2017:1-50. [3]PANG B,LEE L.Opinion Mining and Sentiment Analysis[J].Foundations and Trends in Information Retrieval,2008,2(1):1-135. [4]PONTIKI M,GALANIS D,PAVLOPOULOS J,et al.SemEval-2014 Task 4:Aspect Based Sentiment Analysis[C]//International Conference on Computational Linguistics.2014:27-35. [5]GOODFELLOW I,BENGIO Y,COURVILLE A.Deep Learning[M].Massachusetts:The MIT Press,2016:24-31. [6]WANG B.Disconnected Recurrent Neural Networks for TextCategorization[C]//the 56th Annual Meeting of the Association for Computational Linguistics.2018:2311-2320. [7]CHO K,VAN MERRIENBOER B,GULCEHRE C,et al.Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation[J].arXiv:1406.1078,2014. [8]GERS F A,SCHMIDHUBER J,CUMMINS F.Learning to Forget:Continual Prediction with LSTM[J].Neural Computation,2000,12(10):2451-2471. [9]ZHANG Z,ZOU Y,GAN C.Textual sentiment analysis viathree different attention convolutional neural networks and cross-modality consistent regression[J].Neurocomputing,2018,275:1407-1415. [10]AKHTAR M S,GUPTA D,EKBAL A,et al.Feature selection and ensemble construction:A two-step method for aspect based sentiment analysis[J].Knowledge Based Systems,2017,125(1):116-135. [11]HAN H,ZHANG J,YANG J,et al.Generate domain-specificsentiment lexicon for review sentiment analysis[J].Multimedia Tools & Applications,2018,77(16):21265-21280. [12]COLLOBERT R,WESTON J,BOTTOU L,et al.Natural Language Processing(Almost) from Scratch[J].Journal of Machine Learning Research,2011,12(1):2493-2537. [13]KALCHBRENNER N,GREFENSTETTE E,BLUNSOM P,et al.A Convolutional Neural Network for Modelling Sentences[C]//Meeting of the Association for Computational Linguistics.2014:655-665. [14]WANG L,CAO Z,DE MELO G,et al.Relation Classification via Multi-Level Attention CNNs[C]//the 54th Annual Meeting of the Association for Computational Linguistics,2016:1298-1307. [15]TANG D,QIN B,LIU T,et al.Document Modeling with Gated Recurrent Neural Network for Sentiment Classification[C]//The 2015 Conference on Empirical Methods in Natural Language Processing.2015:1422-1432. [16]TANG D,QIN B,FENG X,et al.Effective LSTMs for Target-Dependent Sentiment Classification[J].arXiv:1512.01100v2,2016. [17]ZHANG M,ZHANG Y,VO D T,et al.Gated neural networks for targeted sentiment analysis[C]//Thirtieth AAAI Confe-rence on Artificial Intelligence,2016:3087-3093. [18]SHUANG K,REN X,YANG Q,et al.AELA-DLSTMs:Attention-enabled and location-aware double LSTMs for aspect-level sentiment classification[J].Neurocomputing,2019,334:25-34. [19]MA D,LI S,ZHANG X,et al.Interactive attention networks for aspect-level sentiment classification[J].arXiv:1709.00893,2017. [20]XUE W,LI T.Aspect based sentiment analysis with gated convo-lutional networks[J].arXiv:1805.07043,2018. [21]SHI Y,YAO K,TIAN L,et al.Deep LSTM based Feature Mapping for Query Classification[C]//The 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2016:1501-1511. [22]YANG M,TU W,WANG J,et al.Attention Based LSTM for Target Dependent Sentiment Classification[C]//Thirty-First AAAI Conference on Artificial Intelligence.2017:5013-5014. [23]CHEN P,SUN Z,BING L,et al.Recurrent attention network on memory for aspect sentiment analysis[C]//Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing.2017:452-461. |
[1] | 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺. 时序知识图谱表示学习 Temporal Knowledge Graph Representation Learning 计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204 |
[2] | 饶志双, 贾真, 张凡, 李天瑞. 基于Key-Value关联记忆网络的知识图谱问答方法 Key-Value Relational Memory Networks for Question Answering over Knowledge Graph 计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277 |
[3] | 汤凌韬, 王迪, 张鲁飞, 刘盛云. 基于安全多方计算和差分隐私的联邦学习方案 Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy 计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108 |
[4] | 王剑, 彭雨琦, 赵宇斐, 杨健. 基于深度学习的社交网络舆情信息抽取方法综述 Survey of Social Network Public Opinion Information Extraction Based on Deep Learning 计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099 |
[5] | 郝志荣, 陈龙, 黄嘉成. 面向文本分类的类别区分式通用对抗攻击方法 Class Discriminative Universal Adversarial Attack for Text Classification 计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077 |
[6] | 姜梦函, 李邵梅, 郑洪浩, 张建朋. 基于改进位置编码的谣言检测模型 Rumor Detection Model Based on Improved Position Embedding 计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046 |
[7] | 孙奇, 吉根林, 张杰. 基于非局部注意力生成对抗网络的视频异常事件检测方法 Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection 计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061 |
[8] | 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木. 中文预训练模型研究进展 Advances in Chinese Pre-training Models 计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018 |
[9] | 周慧, 施皓晨, 屠要峰, 黄圣君. 基于主动采样的深度鲁棒神经网络学习 Robust Deep Neural Network Learning Based on Active Sampling 计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044 |
[10] | 苏丹宁, 曹桂涛, 王燕楠, 王宏, 任赫. 小样本雷达辐射源识别的深度学习方法综述 Survey of Deep Learning for Radar Emitter Identification Based on Small Sample 计算机科学, 2022, 49(7): 226-235. https://doi.org/10.11896/jsjkx.210600138 |
[11] | 胡艳羽, 赵龙, 董祥军. 一种用于癌症分类的两阶段深度特征选择提取算法 Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification 计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092 |
[12] | 程成, 降爱莲. 基于多路径特征提取的实时语义分割方法 Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction 计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157 |
[13] | 刘伟业, 鲁慧民, 李玉鹏, 马宁. 指静脉识别技术研究综述 Survey on Finger Vein Recognition Research 计算机科学, 2022, 49(6A): 1-11. https://doi.org/10.11896/jsjkx.210400056 |
[14] | 孙福权, 崔志清, 邹彭, 张琨. 基于多尺度特征的脑肿瘤分割算法 Brain Tumor Segmentation Algorithm Based on Multi-scale Features 计算机科学, 2022, 49(6A): 12-16. https://doi.org/10.11896/jsjkx.210700217 |
[15] | 康雁, 徐玉龙, 寇勇奇, 谢思宇, 杨学昆, 李浩. 基于Transformer和LSTM的药物相互作用预测 Drug-Drug Interaction Prediction Based on Transformer and LSTM 计算机科学, 2022, 49(6A): 17-21. https://doi.org/10.11896/jsjkx.210400150 |
|