计算机科学 ›› 2019, Vol. 46 ›› Issue (6A): 93-97.
张璐, 沈忱林, 李寿山
ZHANG Lu, SHEN Chen-lin, LI Shou-shan
摘要: 情绪分析是自然语言处理领域的一个研究热点,其通过分析人们发布的文本推测人们的主观感受。情绪分类是情绪分析中的一个基本任务,旨在判断一个文本的情绪类别。对情绪分类来说,词语的表示具有决定性的作用。许多现有的词向量学习算法只对词语的上下文语义信息进行建模,而忽略了词语的情绪信息,这样会导致上下文相似但情绪相反的词语有相似的词向量。为了解决该问题,通过构建一个由两个基本网络(即文档-词网络和情绪图标-词网络)组成的异构网络来学习情绪特定的词向量。最后,在标注样本上训练一个LSTM分类器。实验结果表明了所提情绪特定词向量学习算法的有效性。
中图分类号:
[1]MIKOLOV T,SUTSKEVER I,CHEN K,et al.Distributed representations of words and phrases and their compositionality [J].Advances in Neural Information Processing Systems,2013,26:3111-3119. [2]COLLOBERT R,WESTON J,BOTTOU L,et al.Natural language processing(almost) from scratch [J].Journal of Machine Learning Research,2011,12(1):2493-2537. [3]TURIAN J,RATINOV L,BENGIO Y.Word representations:a simple and general method for semi-supervised learning[C]∥Proceedings of the Meeting of the Association for Computational Linguistics.2010:384-394. [4]MIKOLOV T,CHEN K,CORRADO G,et al.Efficient estimation of word representations in vector space [J].arXiv:1301.3781. [5]TANG J,QU M,WANG M,et al.LINE:large-scale information network embedding[C]∥Proceedings of the International World Wide Web Conference.2015:1067-1077. [6]黄磊,李寿山,周国栋.基于句法信息的微博情绪识别方法研究 [J].计算机科学,2017,44(2):244-249. [7]LIU H H,LI S S,ZHOU G D,et al.Joint modeling of news reader’s and comment writer’s emotions[C]∥Proceedings of the Meeting of the Association for Computational Linguistics.2013:511-515. [8]ABDUL-MAGEED M,UNGAR L.EmoNet:fine-grained emotion detection with gated recurrent neural networks[C]∥Proceedings of the Meeting of the Association for Computational Linguistics.2017:718-728. [9]LI S S,HUANG L,WANG R,et al.Sentence-level emotion classification with label and context dependence[C]∥Procee-dings of the Meeting of the Association for Computational Linguistics.2015:1045-1053. [10]KOZAREVA Z,NAVARRO B,VAZQUEZ S,et al.UA-ZBSA:a headline emotion classification through web information[C]∥Proceedings of theInternational Workshop on Semantic Evaluations.Association for Computational Linguistics.2007:334-337. [11]WEN S,WAN X.Emotion classification in microblog texts using class sequential rules[C]∥Proceedings of theAAAI Conference on Artificial Intelligence.2014:187-193. [12]LI S S,HUANG L,WANG R,et al.Sentence-level emotion classification with label and context dependence[C]∥Procee-dings of the Meeting of the Association for Computational Linguistics.2015:1045-1053. [13]ALM C C,ROTH D,SPROAT R.Emotions from text:machine learning for text-based emotion prediction[C]∥Proceedings of the Conference on Empirical Methods in Natural Language Processing.2005:579-586. [14]LI C X,WU H M,JIN Q.Emotion classification of Chinese microblog text via fusion of bow and evector feature representations [C]∥Communications in Computer and Information Scie-nce.2014:217-228. [15]LI S S,XU J,ZHANG D,et al.Two-view label propagation to semi-supervised reader emotion classification[C]∥Proceedings of theInternational Conference on Computational Linguistics.2016:2647-2655. [16]BENGIO Y,DUCHARME R,VINCENT P,et al.A neural probabilistic language model[J].Journal of Machine Learning Research,2003,3:1137-1155. [17]MNIH A,HINTON G.A scalable hierarchical distributed language model [C]∥Proceedings of theInternational Conference on Neural Information Processing Systems.2008:1081-1088. [18]SOCHER R,BAUER J,MANNING C D,et al.Parsing with compositional vector grammars[C]∥Proceedings of the Mee-ting of the Association for Computational Linguistics.2013:455-165. [19]TANG D Y,QIN B,LIU T,et al.Learning sentence representa-tion for emotion classification on microblogs[C]∥Proceedings of the Meeting of theNatural Language Processing and Chinese Computing.2013:212-223. [20]XU R F,CHEN T,XIA Y Q,et al.Word embedding composition for data imbalances in sentiment and emotion classification [J].Cognitive Computation,2015,7(2):226-240. [21]WANG Z Q,ZHANG Y,LEE S Y M,et al.A bilingual attention network for code-switched emotion prediction[C]∥Proceedings of theInternational Conference on Computational Linguistics.2016:1624-1634. [22]LABUTOV I,LIPSON H.Re-embedding words[C]∥Procee-dings of the Meeting of the Association for Computational Linguistics.2013:489-493. [23]HUANG L,LI S S,ZHOU G D.Emotion corpus construction on microblog[C]∥Proceedings of the Chinese Lexical Semantics Workshop.2015:204-212. [24]NIU F,RECHT B,RE C,et al.Hogwild:a lock-free approach to parallelizing stochastic gradient descent[C]∥Proceedings of theInternational Conference on Neural Information Processing Systems.2011:693-701. [25]TANG J,QU M,MEI Q Z.PTE:predictive text embedding through large-scale heterogeneous text networks[C]∥Procee-dings of the Knowledge Discovery in Database.2015:1165-1174. [26]HOCHREITER S,SCHMIDHUBER J.Long short-term memory [J].Neural Computation,1997,9(8):1735-1780. [27]GRAVES A.Generating sequences with recurrent neural net-works [J].arXiv:1308.0850. |
[1] | 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木. 中文预训练模型研究进展 Advances in Chinese Pre-training Models 计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018 |
[2] | 姜胜腾, 张亦弛, 罗鹏, 刘月玲, 曹阔, 赵海涛, 魏急波. 语义通信系统的性能度量指标分析 Analysis of Performance Metrics of Semantic Communication Systems 计算机科学, 2022, 49(7): 236-241. https://doi.org/10.11896/jsjkx.211200071 |
[3] | 韩红旗, 冉亚鑫, 张运良, 桂婕, 高雄, 易梦琳. 基于共同子空间分类学习的跨媒体检索研究 Study on Cross-media Information Retrieval Based on Common Subspace Classification Learning 计算机科学, 2022, 49(5): 33-42. https://doi.org/10.11896/jsjkx.210200157 |
[4] | 刘硕, 王庚润, 彭建华, 李柯. 基于混合字词特征的中文短文本分类算法 Chinese Short Text Classification Algorithm Based on Hybrid Features of Characters and Words 计算机科学, 2022, 49(4): 282-287. https://doi.org/10.11896/jsjkx.210200027 |
[5] | 丁锋, 孙晓. 基于注意力机制和BiLSTM-CRF的消极情绪意见目标抽取 Negative-emotion Opinion Target Extraction Based on Attention and BiLSTM-CRF 计算机科学, 2022, 49(2): 223-230. https://doi.org/10.11896/jsjkx.210100046 |
[6] | 袁景凌, 丁远远, 盛德明, 李琳. 基于视觉方面注意力的图像文本情感分析模型 Image-Text Sentiment Analysis Model Based on Visual Aspect Attention 计算机科学, 2022, 49(1): 219-224. https://doi.org/10.11896/jsjkx.201000074 |
[7] | 胡艳丽, 童谭骞, 张啸宇, 彭娟. 融入自注意力机制的深度学习情感分析方法 Self-attention-based BGRU and CNN for Sentiment Analysis 计算机科学, 2022, 49(1): 252-258. https://doi.org/10.11896/jsjkx.210600063 |
[8] | 刘凯, 张宏军, 陈飞琼. 基于领域适应嵌入的军事命名实体识别 Name Entity Recognition for Military Based on Domain Adaptive Embedding 计算机科学, 2022, 49(1): 292-297. https://doi.org/10.11896/jsjkx.201100007 |
[9] | 戴宏亮, 钟国金, 游志铭, 戴宏明. 基于Spark的舆情情感大数据分析集成方法 Public Opinion Sentiment Big Data Analysis Ensemble Method Based on Spark 计算机科学, 2021, 48(9): 118-124. https://doi.org/10.11896/jsjkx.210400280 |
[10] | 张瑾, 段利国, 李爱萍, 郝晓燕. 基于注意力与门控机制相结合的细粒度情感分析 Fine-grained Sentiment Analysis Based on Combination of Attention and Gated Mechanism 计算机科学, 2021, 48(8): 226-233. https://doi.org/10.11896/jsjkx.200700058 |
[11] | 史伟, 付月. 考虑语境的微博短文本挖掘:情感分析的方法 Microblog Short Text Mining Considering Context:A Method of Sentiment Analysis 计算机科学, 2021, 48(6A): 158-164. https://doi.org/10.11896/jsjkx.210200089 |
[12] | 潘芳, 张会兵, 董俊超, 首照宇. 基于高效Transformer的中文在线课程评论方面情感分析 Aspect Sentiment Analysis of Chinese Online Course Review Based on Efficient Transformer 计算机科学, 2021, 48(6A): 264-269. https://doi.org/10.11896/jsjkx.200800116 |
[13] | 杨进才, 曹元, 胡泉, 沈显君. 基于Transformer模型与关系词特征的汉语因果类复句关系自动识别 Relation Classification of Chinese Causal Compound Sentences Based on Transformer Model and Relational Word Feature 计算机科学, 2021, 48(6A): 295-298. https://doi.org/10.11896/jsjkx.200500019 |
[14] | 张明阳, 王刚, 彭起, 张岩峰. 学术论文公开评审平台数据分析 Data Analysis of OpenReview 计算机科学, 2021, 48(6): 63-70. https://doi.org/10.11896/jsjkx.200500138 |
[15] | 尹久, 池凯凯, 宦若虹. 基于ATT-DGRU的文本方面级别情感分析 Aspect-level Sentiment Analysis of Text Based on ATT-DGRU 计算机科学, 2021, 48(5): 217-224. https://doi.org/10.11896/jsjkx.200500076 |
|