计算机科学 ›› 2019, Vol. 46 ›› Issue (6A): 93-97.

• 智能计算 • 上一篇    下一篇

基于情绪特定词向量的情绪分类算法

张璐, 沈忱林, 李寿山   

  1. 苏州大学计算机科学与技术学院 江苏 苏州215006
  • 出版日期:2019-06-14 发布日期:2019-07-02
  • 通讯作者: 李寿山(1980-),男,教授,主要研究方向为自然语言处理、情感分析,E-mail:lishoushan.suda.edu.cn(通信作者)。
  • 作者简介:张 璐(1994-),女,硕士生,主要研究方向为自然语言处理、情感分析,E-mail:lzhang0107@stu.suda.edu.cn;沈忱林(1993-),男,硕士生,主要研究方向为自然语言处理、情感分析;
  • 基金资助:
    本文受国家自然科学基金(61331011,61375073)资助。

Emotion Classification Algorithm Based on Emotion-specific Word Embedding

ZHANG Lu, SHEN Chen-lin, LI Shou-shan   

  1. School of Computer Science & Technology,Soochow University,Suzhou,Jiangsu 215006,China
  • Online:2019-06-14 Published:2019-07-02

摘要: 情绪分析是自然语言处理领域的一个研究热点,其通过分析人们发布的文本推测人们的主观感受。情绪分类是情绪分析中的一个基本任务,旨在判断一个文本的情绪类别。对情绪分类来说,词语的表示具有决定性的作用。许多现有的词向量学习算法只对词语的上下文语义信息进行建模,而忽略了词语的情绪信息,这样会导致上下文相似但情绪相反的词语有相似的词向量。为了解决该问题,通过构建一个由两个基本网络(即文档-词网络和情绪图标-词网络)组成的异构网络来学习情绪特定的词向量。最后,在标注样本上训练一个LSTM分类器。实验结果表明了所提情绪特定词向量学习算法的有效性。

关键词: 词向量, 情感分析, 情绪分类

Abstract: Emotion analysis is a hot research issue in the field of NLP,and it infersthe feelings of individuals through analyzing the text they have published.Emotion classification is a fundamental task in emotion analysis,which aims to determine the emotion categories in a piece of text.The representation of words is a critical prerequisite for emotion classification.Many intuitive choices of learning word embedding are available,but these word embedding algorithms typically model the syntactic context of words but ignore the emotion information relevant to words.As a result,words with opposite emotion but similar syntactic context tend to be represented as close vectors.To address the problem,this paper proposeda a heterogeneous network composed of two basic networks,i.e.,document-word network and emoticon-word network to learn emotion-specific word embedding .Finally,an LSTM network was trained on the labeled data.Empirical studies demonstrate the effectiveness of the proposed approach to learn emotion-specific word embedding.

Key words: Emotion classification, Sentiment analysis, Word Embedding

中图分类号: 

  • TP391
[1]MIKOLOV T,SUTSKEVER I,CHEN K,et al.Distributed representations of words and phrases and their compositionality [J].Advances in Neural Information Processing Systems,2013,26:3111-3119.
[2]COLLOBERT R,WESTON J,BOTTOU L,et al.Natural language processing(almost) from scratch [J].Journal of Machine Learning Research,2011,12(1):2493-2537.
[3]TURIAN J,RATINOV L,BENGIO Y.Word representations:a simple and general method for semi-supervised learning[C]∥Proceedings of the Meeting of the Association for Computational Linguistics.2010:384-394.
[4]MIKOLOV T,CHEN K,CORRADO G,et al.Efficient estimation of word representations in vector space [J].arXiv:1301.3781.
[5]TANG J,QU M,WANG M,et al.LINE:large-scale information network embedding[C]∥Proceedings of the International World Wide Web Conference.2015:1067-1077.
[6]黄磊,李寿山,周国栋.基于句法信息的微博情绪识别方法研究 [J].计算机科学,2017,44(2):244-249.
[7]LIU H H,LI S S,ZHOU G D,et al.Joint modeling of news reader’s and comment writer’s emotions[C]∥Proceedings of the Meeting of the Association for Computational Linguistics.2013:511-515.
[8]ABDUL-MAGEED M,UNGAR L.EmoNet:fine-grained emotion detection with gated recurrent neural networks[C]∥Proceedings of the Meeting of the Association for Computational Linguistics.2017:718-728.
[9]LI S S,HUANG L,WANG R,et al.Sentence-level emotion classification with label and context dependence[C]∥Procee-dings of the Meeting of the Association for Computational Linguistics.2015:1045-1053.
[10]KOZAREVA Z,NAVARRO B,VAZQUEZ S,et al.UA-ZBSA:a headline emotion classification through web information[C]∥Proceedings of theInternational Workshop on Semantic Evaluations.Association for Computational Linguistics.2007:334-337.
[11]WEN S,WAN X.Emotion classification in microblog texts using class sequential rules[C]∥Proceedings of theAAAI Conference on Artificial Intelligence.2014:187-193.
[12]LI S S,HUANG L,WANG R,et al.Sentence-level emotion classification with label and context dependence[C]∥Procee-dings of the Meeting of the Association for Computational Linguistics.2015:1045-1053.
[13]ALM C C,ROTH D,SPROAT R.Emotions from text:machine learning for text-based emotion prediction[C]∥Proceedings of the Conference on Empirical Methods in Natural Language Processing.2005:579-586.
[14]LI C X,WU H M,JIN Q.Emotion classification of Chinese microblog text via fusion of bow and evector feature representations [C]∥Communications in Computer and Information Scie-nce.2014:217-228.
[15]LI S S,XU J,ZHANG D,et al.Two-view label propagation to semi-supervised reader emotion classification[C]∥Proceedings of theInternational Conference on Computational Linguistics.2016:2647-2655.
[16]BENGIO Y,DUCHARME R,VINCENT P,et al.A neural probabilistic language model[J].Journal of Machine Learning Research,2003,3:1137-1155.
[17]MNIH A,HINTON G.A scalable hierarchical distributed language model [C]∥Proceedings of theInternational Conference on Neural Information Processing Systems.2008:1081-1088.
[18]SOCHER R,BAUER J,MANNING C D,et al.Parsing with compositional vector grammars[C]∥Proceedings of the Mee-ting of the Association for Computational Linguistics.2013:455-165.
[19]TANG D Y,QIN B,LIU T,et al.Learning sentence representa-tion for emotion classification on microblogs[C]∥Proceedings of the Meeting of theNatural Language Processing and Chinese Computing.2013:212-223.
[20]XU R F,CHEN T,XIA Y Q,et al.Word embedding composition for data imbalances in sentiment and emotion classification [J].Cognitive Computation,2015,7(2):226-240.
[21]WANG Z Q,ZHANG Y,LEE S Y M,et al.A bilingual attention network for code-switched emotion prediction[C]∥Proceedings of theInternational Conference on Computational Linguistics.2016:1624-1634.
[22]LABUTOV I,LIPSON H.Re-embedding words[C]∥Procee-dings of the Meeting of the Association for Computational Linguistics.2013:489-493.
[23]HUANG L,LI S S,ZHOU G D.Emotion corpus construction on microblog[C]∥Proceedings of the Chinese Lexical Semantics Workshop.2015:204-212.
[24]NIU F,RECHT B,RE C,et al.Hogwild:a lock-free approach to parallelizing stochastic gradient descent[C]∥Proceedings of theInternational Conference on Neural Information Processing Systems.2011:693-701.
[25]TANG J,QU M,MEI Q Z.PTE:predictive text embedding through large-scale heterogeneous text networks[C]∥Procee-dings of the Knowledge Discovery in Database.2015:1165-1174.
[26]HOCHREITER S,SCHMIDHUBER J.Long short-term memory [J].Neural Computation,1997,9(8):1735-1780.
[27]GRAVES A.Generating sequences with recurrent neural net-works [J].arXiv:1308.0850.
[1] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[2] 姜胜腾, 张亦弛, 罗鹏, 刘月玲, 曹阔, 赵海涛, 魏急波.
语义通信系统的性能度量指标分析
Analysis of Performance Metrics of Semantic Communication Systems
计算机科学, 2022, 49(7): 236-241. https://doi.org/10.11896/jsjkx.211200071
[3] 韩红旗, 冉亚鑫, 张运良, 桂婕, 高雄, 易梦琳.
基于共同子空间分类学习的跨媒体检索研究
Study on Cross-media Information Retrieval Based on Common Subspace Classification Learning
计算机科学, 2022, 49(5): 33-42. https://doi.org/10.11896/jsjkx.210200157
[4] 刘硕, 王庚润, 彭建华, 李柯.
基于混合字词特征的中文短文本分类算法
Chinese Short Text Classification Algorithm Based on Hybrid Features of Characters and Words
计算机科学, 2022, 49(4): 282-287. https://doi.org/10.11896/jsjkx.210200027
[5] 丁锋, 孙晓.
基于注意力机制和BiLSTM-CRF的消极情绪意见目标抽取
Negative-emotion Opinion Target Extraction Based on Attention and BiLSTM-CRF
计算机科学, 2022, 49(2): 223-230. https://doi.org/10.11896/jsjkx.210100046
[6] 袁景凌, 丁远远, 盛德明, 李琳.
基于视觉方面注意力的图像文本情感分析模型
Image-Text Sentiment Analysis Model Based on Visual Aspect Attention
计算机科学, 2022, 49(1): 219-224. https://doi.org/10.11896/jsjkx.201000074
[7] 胡艳丽, 童谭骞, 张啸宇, 彭娟.
融入自注意力机制的深度学习情感分析方法
Self-attention-based BGRU and CNN for Sentiment Analysis
计算机科学, 2022, 49(1): 252-258. https://doi.org/10.11896/jsjkx.210600063
[8] 刘凯, 张宏军, 陈飞琼.
基于领域适应嵌入的军事命名实体识别
Name Entity Recognition for Military Based on Domain Adaptive Embedding
计算机科学, 2022, 49(1): 292-297. https://doi.org/10.11896/jsjkx.201100007
[9] 戴宏亮, 钟国金, 游志铭, 戴宏明.
基于Spark的舆情情感大数据分析集成方法
Public Opinion Sentiment Big Data Analysis Ensemble Method Based on Spark
计算机科学, 2021, 48(9): 118-124. https://doi.org/10.11896/jsjkx.210400280
[10] 张瑾, 段利国, 李爱萍, 郝晓燕.
基于注意力与门控机制相结合的细粒度情感分析
Fine-grained Sentiment Analysis Based on Combination of Attention and Gated Mechanism
计算机科学, 2021, 48(8): 226-233. https://doi.org/10.11896/jsjkx.200700058
[11] 史伟, 付月.
考虑语境的微博短文本挖掘:情感分析的方法
Microblog Short Text Mining Considering Context:A Method of Sentiment Analysis
计算机科学, 2021, 48(6A): 158-164. https://doi.org/10.11896/jsjkx.210200089
[12] 潘芳, 张会兵, 董俊超, 首照宇.
基于高效Transformer的中文在线课程评论方面情感分析
Aspect Sentiment Analysis of Chinese Online Course Review Based on Efficient Transformer
计算机科学, 2021, 48(6A): 264-269. https://doi.org/10.11896/jsjkx.200800116
[13] 杨进才, 曹元, 胡泉, 沈显君.
基于Transformer模型与关系词特征的汉语因果类复句关系自动识别
Relation Classification of Chinese Causal Compound Sentences Based on Transformer Model and Relational Word Feature
计算机科学, 2021, 48(6A): 295-298. https://doi.org/10.11896/jsjkx.200500019
[14] 张明阳, 王刚, 彭起, 张岩峰.
学术论文公开评审平台数据分析
Data Analysis of OpenReview
计算机科学, 2021, 48(6): 63-70. https://doi.org/10.11896/jsjkx.200500138
[15] 尹久, 池凯凯, 宦若虹.
基于ATT-DGRU的文本方面级别情感分析
Aspect-level Sentiment Analysis of Text Based on ATT-DGRU
计算机科学, 2021, 48(5): 217-224. https://doi.org/10.11896/jsjkx.200500076
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!