计算机科学 ›› 2017, Vol. 44 ›› Issue (2): 244-249.doi: 10.11896/j.issn.1002-137X.2017.02.040

• 人工智能 • 上一篇    下一篇

基于句法信息的微博情绪识别方法研究

黄磊,李寿山,周国栋   

  1. 苏州大学计算机科学与技术学院 苏州215006,苏州大学计算机科学与技术学院 苏州215006,苏州大学计算机科学与技术学院 苏州215006
  • 出版日期:2018-11-13 发布日期:2018-11-13
  • 基金资助:
    本文受国家自然科学基金(61331011,61375073,61273320)资助

Emotion Recognition of Chinese Microblogs with Syntactic Information

HUANG Lei, LI Shou-shan and ZHOU Guo-dong   

  • Online:2018-11-13 Published:2018-11-13

摘要: 情绪识别旨在自动识别文本是否含有情绪。情绪识别是情感分析研究中的一项基本任务。针对该任务,提出了一种基于句法信息的微博文本情绪识别方法。该方法的特色在于充分考虑了微博文本的句法信息。 具体实现中,首先利用词性标注(POS)序列和结构句法树来表示句法信息,以分别提取POS序列模式、重写规则和二元句法标签作为特征进行文本表示;然后利用最大熵分类算法对微博文本进行情绪识别。实验结果表明, 所提方法能够获得较好的识别效果。

关键词: 自然语言处理,微博,情绪识别,POS序列模式,句法树

Abstract: Emotion recognition aims to predict the involving emotion towards a piece of text.Automatic emotion recognition is a basic task for sentiment analysis.In this paper,an emotion recognition for Chinese microblogs approach based on syntactic information was proposed.One distinguishing feature of the proposed method is that the microblog’s syntactic information is employed.Specifically,we took advantage of POS (part of speech) sequence and syntactic tree to represent syntactic information in order to extract POS sequence pattern,rewrite rules and bigrams of syntactic labels as features for text representation.Then,we utilized the maximum entropy algorithm to perform the classification.Experimental studies demonstrate that our approach is very effective for emotion recognition.

Key words: Natural language processing,Microblog,Emotion recognition,POS sequence pattern,Syntax tree

[1] AMAN S,SZPAKOWICZM S.Identifying Expressions of Emotion in Text [C]∥Proceedings of the 10th International Con-ference (TSD 2007).2007:196-205.
[2] LU W S,GUO G D,CHEN L F.Emotion Classification withFeature Extraction Based on Part of Speech Tagging Sequences in Microblog [J].Journal of Computer Applications,2014,34(10):2869-2873.(in Chinese) 卢伟胜,郭躬德,陈黎飞.基于词性标注序列特征提取的微博情感分类 [J].计算机应用,2014,34(10):2869-2873.
[3] MUKHERJEE A,LIU B.Improving Gender Classification ofBlog Authors [C]∥Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2010).East Stroudsburg,United States,2010:207-217.
[4] LIU H H,LI S S,ZHOU D G,et al.Research on Chinese Emotion Recognition [J].Journal of Jiangxi Normal University (Natural Science Edition),2013,37(2):120-124.(in Chinese) 刘欢欢,李寿山,周国栋,等.中文情绪识别方法研究 [J].江西师范大学学报,2013,37(2):120-124.
[5] LIU Q,FENG C,HUANG H.Emotional Tendency Identification for Micro-blog Topics Based on Multiple Characteristics [C]∥Proceedings of the 26th Pacific Asia Conference on Language,Information and Computation (PACLIC 2012).Bali,Indonesia,2012:207-217.
[6] WIEBE J,WILSON T,CARDIE C.Annotating Expressions of Opinions and Emotions in Language [J].Language Resources and Evaluation,2005,39:65-210.
[7] QUAN C,REN F.Construction of a Blog Emotion Corpus for Chinese Emotional Expression Analysis [C]∥Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2009).East Stroudsburg,United States,2009:1446-1454.
[8] XU J,XU R,LU Q,et al.Coarse-to-fine Sentence-level Emotion Classification Based on the Intra-sentence Features and Sentential Context [C]∥Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM 2012).United States,2012:2455-2458.
[9] LIN K,YANG C,CHEN H.Emotion Classification of Online News Articles from the Reader’s Perspective [C]∥Proceedings of IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology Workshops (WI-IAT 2008).Sydney,NSW,Australia,2008:220-226.
[10] ALM C,ROTH D,SPROAT R.Emotions from Text:Machine Learning for Text-based Emotion Prediction [C]∥Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2009).East Stroudsburg,United States,2005:579-586.
[11] TOKUHISA R,INJI K,MATSUMOTO Y.Emotion Classification Using Massive Examples Extracted from the Web [C]∥Proceedings of the 22nd International Conference on Computational Linguistics (COLING 2008).2008:881-888.
[12] BHOWMICK P,BASU A,MITRA P,et al.Multi-label TextClassification Approach for Sentence Level News Emotion Ana-lysis[C]∥Pattern Recognition and Machine Intelligence.Lecture Notes in Computer Science,Berlin,Germany,2009:261-266.
[13] LI S,HUANG L,WANG R,et al.Sentence-level Emotion Classification with Label and Context Dependence [C]∥Proceedings of 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing(ACL 2015).Beijing,China,2015:1045-1053.
[14] PANG B,LEE L,VAITHYANATHAN S.Thumbs up? Sentiment Classification Using Machine Learning Techniques [C]∥Proceedings of Empirical Methods in Natural Language Proces-sing (EMNLP 2010).2002:79-86.
[15] DAVIDOV D,TSUR O,RAPPOPORT A.Enhanced Sentiment Learning Using Twitter Hastags and Smileys [C]∥Proceedings of the 22nd International Conference on Computational Linguistics (COLING 2008).2010:241-249.
[16] LIU H,LI S,ZHOU G,et al.Joint Modeling of News Reader’s and Comment Writer’s Emotions [C]∥Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013).Sofia,Bulgaria,2013:511-515.
[17] LI C,WU H,JIN Q.Emotion Classification of Chinese Microblog Text via Fusion of BoW and eVector Feature Representations [C]∥Proceedings of the 3rd CCF Conference on Natural Language Processing and Chinese Computing (NLP&CC 2014).2014:217-228.
[18] QUAN C,REN F.Sentence Emotion Analysis and Recognition Based on Emotion Words Using Ren-CECps [J].International Journal of Advanced Intelligence,2010,2(1):105-117.
[19] YAO Y L,WANG S W,XU R F,et al.The Construction of an Emotion Annotated Corpus on Microblog Text [J].Journal of Chinese Information Processing,2014,28(5):83-91.(in Chinese) 姚源林,王树伟,徐睿峰,等.面向微博文本的情绪标注语料库构建 [J].中文信息学报,2014,28(5):83-91.
[20] MAEDA H,SHIMADA K,ENDO K.Twitter Sentiment Analysis Based on Writing Style [C]∥Proceedings of the 8th International Conference on NLP (NLP 2012).Kanazawa,Japan,2012;278-288.
[21] ZHANG J,ZHU B,LIANG L L,et al.Recognition and Classification of Emotions in the Chinese Microblog Based on Emotional Factor [J].Acta Scientiarum Naturalium Universitatis Pekinensis,2014,50(1):79-84.(in Chinese) 张晶,朱波,梁琳琳,等.基于情绪因子的中文微博情绪识别与分类 [J].北京大学学报(自然科学版),2014,50(1):79-84.
[22] HUANG L,LI S,ZHOU G.Emotion Corpus Construction on Microblog Text [C]∥Proceedings of the 16th Workshop on Chinese Lexical Semantics Workshop(CLSW 2015).Beijing,China,2015:204-212.
[23] HIRST G,FEIGUINA O.Bigrams of Syntactic Labels for Authorship Discrimination of Short Texts [J].Literary & Linguistic Computing,2007,22(4):405-417.
[24] CHEN F,CHAO W H,ZHOU Q,et al.Convolution Tree Kernel Based Sentiment Element Recognition Approach for Chinese Microblog [J].Computer Science,2014,41(12):133-137,142.(in Chinese) 陈锋,巢文涵,周庆,等.基于卷积树核的中文微博情感要素识别 [J].计算机科学,2014,41(12):133-137,142.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!