计算机科学 ›› 2021, Vol. 48 ›› Issue (6A): 158-164.doi: 10.11896/jsjkx.210200089

• 大数据&数据科学 • 上一篇    下一篇

考虑语境的微博短文本挖掘:情感分析的方法

史伟1, 付月2   

  1. 1 湖州师范学院经济管理学院 浙江 湖州313000
    2 湖州师范学院求真学院 浙江 湖州313000
  • 出版日期:2021-06-10 发布日期:2021-06-17
  • 通讯作者: 史伟(shiwei@zjhu.edu.cn)
  • 基金资助:
    国家社会科学基金一般项目“重大突发事件中网民情感状态演变规律及引导研究”的阶段性成果(20BXW013)

Microblog Short Text Mining Considering Context:A Method of Sentiment Analysis

SHI Wei1, FU Yue2   

  1. 1 School of Economics and Management,Huzhou University,Huzhou,Zhejiang 313000,China
    2 Qiuzhen College,Huzhou University,Huzhou,Zhejiang 313000,China
  • Online:2021-06-10 Published:2021-06-17
  • About author:SHI Wei,born in 1981,Ph.D,professor.His main research interests include business intelligence and affective computing.
  • Supported by:
    General Program of National Social Science Foundation of China “Research on the Evolution Law and Guidance of Netizens' Sentiment State in Major Emergencies”(20BXW013).

摘要: 传统基于词典的情感分析方法中情感词语的极性和强度是固定和静态的,没有考虑情感词语随不同语义环境极性和强度的变化。为此,提出一种考虑语境的基于情感本体和情感圈的微博短文本情感分析方法。采用情感圈方法考虑不同语境中词语的共现模式,以捕获它们的语义并更新情感词语的极性和强度。结合已构建的情感本体和语义量化规则,建立考虑语义环境的微博短文本挖掘方法。实验结果表明,该方法从实体级和微博级两个层面,在精度、召回率、F值和准确率几个指标上都明显优于基线方法。

关键词: 情感本体, 情感分析, 情感圈, 微博短文本

Abstract: In the traditional dictionary based sentiment analysis,the polarity and intensity of sentiment words are fixed and static,without considering the change of polarity and intensity of sentiment words with different semantic environments.This paper proposes a sentiment analysis method of microblog short text based on sentiment ontology and sentiment circle considering context semantics.In order to capture their semantics and update the polarity and intensity of emotional words,we use the sentiment circle method to consider the co-occurrence patterns of words in different contexts.Combined with the constructed emotion ontology and semantic quantitative rules,a method of microblog short text mining considering semantic environment is established.The experimental results show that the proposed method is superior to the baseline method in terms of accuracy,recall,F value and accuracy from both entity level and microblog level.

Key words: Microblog short text, Sentiment analysis, Sentiment circle, Sentiment ontology

中图分类号: 

  • TP391.1
[1] LIU B.Sentiment analysis and subjectivity[J].Handbook ofNatural Language Processing,2010,2:568.
[2] SHI W,WANG H W,HE S Y.EOSentiMiner:an opinion-aware system based on emotion ontology for sentiment analysis of Chinese online reviews[J].Journal of Experimental & Theoretical Artificial Intelligence,2015,27(4):423-448.
[3] YI S S,LIU X F.Machine learning based customer sentimentanalysis for recommending shoppers,shops based on customers' review[J].Complex & Intelligent Systems,2020,6:621-634.
[4] FUSILIER D H,MONTES-Y-GOMEZ M,ROSSO P,et al.Detecting positive and negative deceptive opinions using PU-learning[J].Information processing & management,2015,51(4):433-443.
[5] 秦锋,王恒,郑啸.基于上下文语境的微博情感分析[J].计算机工程,2017,43(3):241-246,252.
[6] 卢欣,李旸,王素格.融合语言特征的卷积神经网络的反讽识别方法[J].中文信息学报,2019,33(5):31-38.
[7] 吴杰胜,陆奎.基于多部情感词典和规则集的中文微博情感分析研究[J].计算机应用与软件,2019,36(9):93-99.
[8] 李继东,王移芝.基于扩展词典与语义规则的中文微博情感分析[J].计算机与现代化,2018,270(2):90-95.
[9] AHMED M,CHEN Q,LI Z H.Constructing domain-dependent sentiment dictionary for sentiment analysis[J].Neural Computing and Applications,2020,32:14719-14732.
[10] ISMAIL H M,BELKHOUCHE B,ZAKI N.Semantic Twitter sentiment analysis based on a fuzzy thesaurus[J].Soft Computing,2018,22:6011-6024.
[11] 景丽,李曼曼,何婷婷.结合扩充词典与自监督学习的网络评论情感分类[J].计算机科学,2020,47(S2):78-82.
[12] SAIF H,FERNANDEZ M,HE Y,et al.Senticircles for contextual and conceptual semantic sentiment analysis of Twitter[C]//Proc. 11th Extended Semantic Web Conf.(ESWC).Crete,Greece,2014.
[13] WITTGENSTEIN L.Philosophical investigations[D].London,UK:Blackwell,1953.
[14] TURNEY P D,PANTEL P.From frequency to meaning:Vector space models of semantics[J].Journal of Artificial Intelligence Research,2010,37(1):141-188.
[15] 史伟,王洪伟,何绍义.基于微博的产品评论挖掘:情感分析的方法[J].情报学报,2014,33(12):1311-1321.
[16] 史伟,王洪伟,何绍义.基于微博的情感分析的电影票房预测研究[J].华中师范大学学报(自然科学版),2015,49(1):66-72.
[17] 史伟,王洪伟,何绍义.基于知网的模糊情感本体的构建研究[J].情报学报,2012,31(6):595-602.
[18] HowNet[R/OL].HowNet sHomePage.http ://www.keenage.com.
[1] 丁锋, 孙晓.
基于注意力机制和BiLSTM-CRF的消极情绪意见目标抽取
Negative-emotion Opinion Target Extraction Based on Attention and BiLSTM-CRF
计算机科学, 2022, 49(2): 223-230. https://doi.org/10.11896/jsjkx.210100046
[2] 袁景凌, 丁远远, 盛德明, 李琳.
基于视觉方面注意力的图像文本情感分析模型
Image-Text Sentiment Analysis Model Based on Visual Aspect Attention
计算机科学, 2022, 49(1): 219-224. https://doi.org/10.11896/jsjkx.201000074
[3] 胡艳丽, 童谭骞, 张啸宇, 彭娟.
融入自注意力机制的深度学习情感分析方法
Self-attention-based BGRU and CNN for Sentiment Analysis
计算机科学, 2022, 49(1): 252-258. https://doi.org/10.11896/jsjkx.210600063
[4] 戴宏亮, 钟国金, 游志铭, 戴宏明.
基于Spark的舆情情感大数据分析集成方法
Public Opinion Sentiment Big Data Analysis Ensemble Method Based on Spark
计算机科学, 2021, 48(9): 118-124. https://doi.org/10.11896/jsjkx.210400280
[5] 张瑾, 段利国, 李爱萍, 郝晓燕.
基于注意力与门控机制相结合的细粒度情感分析
Fine-grained Sentiment Analysis Based on Combination of Attention and Gated Mechanism
计算机科学, 2021, 48(8): 226-233. https://doi.org/10.11896/jsjkx.200700058
[6] 潘芳, 张会兵, 董俊超, 首照宇.
基于高效Transformer的中文在线课程评论方面情感分析
Aspect Sentiment Analysis of Chinese Online Course Review Based on Efficient Transformer
计算机科学, 2021, 48(6A): 264-269. https://doi.org/10.11896/jsjkx.200800116
[7] 张明阳, 王刚, 彭起, 张岩峰.
学术论文公开评审平台数据分析
Data Analysis of OpenReview
计算机科学, 2021, 48(6): 63-70. https://doi.org/10.11896/jsjkx.200500138
[8] 尹久, 池凯凯, 宦若虹.
基于ATT-DGRU的文本方面级别情感分析
Aspect-level Sentiment Analysis of Text Based on ATT-DGRU
计算机科学, 2021, 48(5): 217-224. https://doi.org/10.11896/jsjkx.200500076
[9] 李建兰, 潘岳, 李小聪, 刘子维, 王天宇.
基于CiteSpace的中文评论文本研究现状与趋势分析
Chinese Commentary Text Research Status and Trend Analysis Based on CiteSpace
计算机科学, 2021, 48(11A): 17-21. https://doi.org/10.11896/jsjkx.210300172
[10] 杨青, 张亚文, 朱丽, 吴涛.
基于注意力机制和BiGRU融合的文本情感分析
Text Sentiment Analysis Based on Fusion of Attention Mechanism and BiGRU
计算机科学, 2021, 48(11): 307-311. https://doi.org/10.11896/jsjkx.201000075
[11] 徐琳宏, 刘鑫, 原伟, 祁瑞华.
俄语多模态情感语料库的构建及应用
Construction and Application of Russian Multimodal Emotion Corpus
计算机科学, 2021, 48(11): 312-318. https://doi.org/10.11896/jsjkx.200900088
[12] 张严, 李天瑞.
面向评论的方面级情感分析综述
Review of Comment-oriented Aspect-based Sentiment Analysis
计算机科学, 2020, 47(6): 194-200. https://doi.org/10.11896/jsjkx.200200127
[13] 林敏鸿, 蒙祖强.
基于注意力神经网络的多模态情感分析
Multimodal Sentiment Analysis Based on Attention Neural Network
计算机科学, 2020, 47(11A): 508-514. https://doi.org/10.11896/jsjkx.191100041
[14] 李苑,李智星,滕磊,王化明,王国胤.
基于注意力机制的评论情感分析及情感词检测
Comment Sentiment Analysis and Sentiment Words Detection Based on Attention Mechanism
计算机科学, 2020, 47(1): 186-192. https://doi.org/10.11896/jsjkx.181002011
[15] 王立志,慕晓冬,刘宏岚.
采用改进粒子群优化的SVM方法实现中文文本情感分类
Using SVM Method Optimized by Improved Particle Swarm Optimization to Analyze Emotion of Chinese Text
计算机科学, 2020, 47(1): 231-236. https://doi.org/10.11896/jsjkx.181102130
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!