计算机科学 ›› 2021, Vol. 48 ›› Issue (8): 226-233.doi: 10.11896/jsjkx.200700058

所属专题: 自然语言处理 虚拟专题

• 人工智能 • 上一篇    下一篇

基于注意力与门控机制相结合的细粒度情感分析

张瑾, 段利国, 李爱萍, 郝晓燕   

  1. 太原理工大学信息与计算机学院 太原030024
  • 收稿日期:2020-07-09 修回日期:2020-08-14 发布日期:2021-08-10
  • 通讯作者: 段利国(463035793@qq.com)
  • 基金资助:
    山西省科技厅基础研究计划项目(201801D121137)

Fine-grained Sentiment Analysis Based on Combination of Attention and Gated Mechanism

ZHANG Jin, DUAN Li-guo, LI Ai-ping, HAO Xiao-yan   

  1. College of Information and Computer,Taiyuan University of Technology,Taiyuan 030024,China
  • Received:2020-07-09 Revised:2020-08-14 Published:2021-08-10
  • About author:ZHANG Jin,born in 1996,postgra-duate.Her main research interests include sentiment analysis and natural language processing.(295511703@qq.com)DUAN Li-guo,born in 1970,Ph.D,associate professor,is a senior member of China Computer Federation.His main research interests include automatic question answering system,text sentiment analysis,entity relationship extraction and knowledge mapping.
  • Supported by:
    Basic Research Project of Shanxi Province(201801D121137).

摘要: 细粒度情感分析(fine-grained sentiment analysis)是自然语言处理领域的关键问题之一,其通过学习文本的上下文信息来进行特定方面的情感分析,可以帮助用户和商家更好地了解用户评论特定方面的情感。针对基于用户评论的方面级别细粒度情感分析任务,提出了BiGRU-Attention与门控机制(gated mechanisms)相结合的文本情感分类模型。首先,通过整合现有的情感资源,将HOWNET评价情感词典作为种子情感词典,利用SO-PMI算法扩充用户评论情感词典,结合否定词典以及词性信息扩充用户评论情感知识,将用户评价情感知识作为用户评论情感特征信息;其次,引入字词特征与情感特征信息,将它们联合作为模型输入,使用BiGRU对文本进行深层次的特征提取;然后,结合门控机制以及注意力机制,根据获取的方面词信息进一步提取与方面词相关的上下文情感特征信息;最后,在输出层进行文本情感分析,经过softmax获得最终的情感极性。在AIchallenger2018细粒度情感分析中文数据集上,所提模型的Macro_F1_ score值达到了0.7218,性能超过基线系统,获得了较好的实验结果。

关键词: BiGRU, 门控机制, 情感分析, 深度学习, 注意力机制

Abstract: The fine-grained sentiment analysis is one of the key problems in the area of natural language processing.By learning contextual information of the text to conduct sentiment analysis on specific aspects,it can help users and businesses to better understand the sentiment information of specific aspects of users' comments.Aiming at the task of fine-grained sentiment analysis on users' comments,a text sentiment classification model combining BiGRU-attention and Gated Mechanisms is proposed.By integrating existing sentiment resources,HOWNET evaluation sentiment dictionary is used as the seed sentiment dictionary to expand the user comment sentiment dictionary through SO-PMI algorithm,the negative dictionary and part of speech information are combined to expand the user comment sentiment knowledge as the users' comment sentiment characteristic information.Introducing word,character and sentiment characteristics as the model of input infotmation,and using BiGRU to extract deep text features,then combined with gated mechanism as well as the attention mechanism,according to the acquired aspect word information to further extract the contextual sentiment characteristics related to aspect words,the final sentiment polarity is obtained by the softmax classfier.Experimental results show that the proposed model achieves better experimental results on the AIchallenger 2018 fine-grained sentiment analysis Chinese data sets,the Macro_F1_ score value reaches 0.7218,and the performance exceeds the baseline system.

Key words: Attention mechanism, BiGRU, Deep learning, Gated mechanism, Sentiment analysis

中图分类号: 

  • TP391
[1]LI Y,LI Z X,TENG L,et al.Comment Sentiment Analysis and Sentiment Words Detection Based on Attention Mechanism [J].Computer Science,2020,47(1):186-192.
[2]ZHAO Y Y,QIN B,LIU T.Sentimentanalysis[J].Journal of Software,2010,21(8):1834-1848.
[3]TANG X B,LIU G C.A Review of Studies on Fine-grained Sentiment Analysis[J].Library and Information Service,2017(5):132-140.
[4]DOHAIHA H H,PRASAD P,MAAG A,et al.Deep Learning for Aspect-Based Sentiment Analysis:A Comparative Review[J].Expert Systems with Applications,2019,118(1):272-299.
[5]LI Y H,XIE M,YI Y.Fine-grained Sentiment Analysis for Social Network Platform based on Deep-learning model [J].Application Research of Computers,2017,34(3):743-747.
[6]MALHOTRA P,VIG L,SHROFF G,et al.Long Short Term Memory Networks for Anomaly Detection in Time Series[C]//European Symposium on Artificial Neural Networks.2015.
[7]CHUNG J,GULCEHRE C,CHO K H,et al.Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling[J].arXiv:1412.3555,2014.
[8]ZHANG Z,ROBINSON D,TEPPER J.Detecting hate speech on Twitter using a convolution-GRU based deep neural network[C]//ESWC 2018.Springer,Cham,2018.
[9]RAFFEL C,ELLIS D P W.Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems[J].arXiv:1512.08756,2015.
[10]TANG D,QIN B,FENG X,et al.Effective LSTMs for Target-Dependent Sentiment Classification[J].arXiv:1512.01100,2015.
[11]WANG Y,HUANG M,ZHU X,et al.Attention-based LSTM for Aspect-level Sentiment Classification[C]//Conference on Empirical Methods in Natural Language Processing.2016.
[12]HUANG B,OU Y,CARLEY K M.Aspect Level SentimentClassification with Attention-over-Attention Neural Networks[J].arXiv:1804.06536,2018.
[13]CHENG J,ZHAO S,ZHANG J,et al.Aspect-level Sentiment Classification with HEAT (HiErarchical ATtention) Network[C]//ACM.2017:97-106.
[14]WEI X,TAO L.Aspect Based Sentiment Analysis with GatedConvolutional Networks[J].arXiv:1805.07043,2018.
[15]SHUANG K,REN X,YANG Q,et al.AELA-DLSTMs:Attention-Enabledand Location-Aware Double LSTMs for aspect-level sentiment classification[J].Neurocomputing,2019,334:25-34.
[16]TURNEY P D,LITTMAN M L.Measuring praise and criti-cism:inference of semantic orientation from association[J].Acm Transactions on Information Systems,2003,21(4):315-346.
[17]CHUNG J,GULCEHRE C,CHO K H,et al.Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling[J].arXiv:1412.3555,2014.
[18]JOZEFOWIC Z,RAFA L,ZAREMB A,et al.An Empirical Exploration of Recurrent Network Architectures[C]//Proceedings of the 32nd International Conference on Machine Learning.2015:2342-2350.
[19]DAUPHIN Y N,FAN A,AULI M,et al.Language Modeling with Gated Convolutional Networks[J].arXiv:1612.08083,2016.
[1] 周芳泉, 成卫青.
基于全局增强图神经网络的序列推荐
Sequence Recommendation Based on Global Enhanced Graph Neural Network
计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[2] 戴禹, 许林峰.
基于文本行匹配的跨图文本阅读方法
Cross-image Text Reading Method Based on Text Line Matching
计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032
[3] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[4] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[5] 熊丽琴, 曹雷, 赖俊, 陈希亮.
基于值分解的多智能体深度强化学习综述
Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization
计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112
[6] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[7] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[8] 汪鸣, 彭舰, 黄飞虎.
基于多时间尺度时空图网络的交通流量预测模型
Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction
计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188
[9] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[10] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[11] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[12] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[13] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[14] 闫佳丹, 贾彩燕.
基于双图神经网络信息融合的文本分类方法
Text Classification Method Based on Information Fusion of Dual-graph Neural Network
计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042
[15] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!