计算机科学 ›› 2018, Vol. 45 ›› Issue (12): 153-159.doi: 10.11896/j.issn.1002-137X.2018.12.024
张刚强, 刘群, 纪良浩
ZHANG Gang-qiang, LIU Qun, JI Liang-hao
摘要: 如何对评论数据进行正确的情感分类是情感分析中的重要研究内容。从粒计算和认知学角度,提出了一种基于序贯三支决策的多粒度中文评论情感分类方法。首先,基于评论数据集的特点,根据评论中情感信息量的多少,提出一种由粗到细的多粒度情感信息表示方法;然后,结合序贯三支决策的思想在不同粒度依据情感信息进行逐步计算,对边界域评论序贯地进行三支决策;最后,根据不同粒度的决策阈值和成本对评论做出最终的情感分类。对比实验结果表明,该方法在3个经典评论数据集上获得了更好的结果,具有更高的分类正确率和更强的鲁棒性。
中图分类号:
[1]LIU B.Sentiment Analysis and Opinion Mining[J].Synthesis Lectures on Human Language Technologies,2016,30(1):152-153. [2]CAMBRIA E.Affective Computing and Sentiment Analysis[J].IEEE Intelligent Systems,2016,31(2):102-107. [3]MEDHAT W,HASSAN A,KORASHY H.Sentiment analysis algorithms and applications:A survey[J].Ain Shams Enginee-ring Journal,2014,5(4):1093-1113. [4]RANA T A,CHEAH Y N.Aspect extraction in sentiment ana-lysis:comparative analysis and survey[J].Artificial Intelligence Review,2016,46(4):459-483. [5]TABOADA M,BROOKE J,TOFILOSKI M,et al.Lexicon-based methods for sentiment analysis[J].Computational Linguistics,2011,37(2):267-307. [6]ZOU H,TANG X,XIE B,et al.Sentiment Classification Using Machine Learning Techniques with Syntax Features[C]∥International Conference on Computatio-nal Science and Computational Intelligence.IEEE Computer Society,2015:175-179. [7]AGARWAL B,MITTAL N.Machine Learning Approach for Sentiment Analysis[M]∥Prominent Feature Extraction for Sentiment Analysis.Springer International Publishing,2016:21-45. [8]TRIPATHY A,AGRAWAL A,RATH S K.Classification ofSentiment Reviews using N-gram Machine Learning Approach[J].Expert Systems with Applications,2016,57(C):117-126. [9]YAO Y Y.An Outline of a Theory of Three-Way Decisions[C]∥International Conference on Rough Sets and Current Trends in Computing.Springer Berlin Heidelberg,2012:1-17. [10]ZHOU Z,SHANG L.A sentiment analysis method based on dynamic lexicon and three-way decision [J].Journal of Shandong University (Engineering Science),2015,45(1):19-23.(in Chinese) 周哲,商琳.一种基于动态词典和三支决策的情感分析方法[J].山东大学学报(工学版),2015,45(1):19-23. [11]ZHOU Z,ZHAO W,SHANG L.Sentiment Analysis with Automatically Constructed Lexicon and Three-Way Decision[M]∥Rough Sets and Knowledge Technology.Springer International Publishing,2014:777-788. [12]WANG L,HUANG H X,WU B,et al.Emotion analysis of text based on topics and three-way decisions [J].Computer Science,2015,42(6):93-96.(in Chinese) 王磊,黄河笑,吴兵,等.基于主题与三支决策的文本情感分析[J].计算机科学,2015,42(6):93-96. [13]ZHANG Z,WANG R.Applying Three-way Decisions to Sentiment Classification with Sentiment Uncertainty[C]∥International Conference on Rough Sets and Knowledge Technology.Springer International Publishing,2014:720-731. [14]YAO Y Y,DENG X.Sequential three-way decisions with probabilistic rough sets[C]∥IEEE International Conference on Cognitive Informatics & Cognitive Computing.IEEE,2011:120-125. [15]YAO Y Y.Three-Way Decisions and Cognitive Computing[J].Cognitive Computation,2016,8(4):543-554. [16]PAWLAK Z.Rough sets[J].International Journal of Computer &Information Sciences,1982,11(5):341-356. [17]YAO Y Y.Probabilistic approaches to rough sets[J].ExpertSystems,2003,20(5):287-297. [18]LIU D,YAO Y Y,LI T R.Three-way decisions-theoretic rough sets [J].Computer Science,2011,38(1):246-250.(in Chinese) 刘盾,姚一豫,李天瑞.三枝决策粗糙集[J].计算机科学,2011,38(1):246-250. [19]YAO Y Y.Three-way decisions with probabilistic rough sets[J].Information Sciences,2010,180(3):341-353. [20]YAO Y Y.Granular Computing and Sequential Three-Way Decisions[C]∥International Conference on Rough Sets and Knowledge Technology.Springer Berlin Heidelberg,2013:16-27. [21]WANG G Y,ZHANG Q H,HU J.An overview of granularcomputing[J].CAAI Transactions on Intelligent Systems,2007,2(6):8-26.(in Chinese) 王国胤,张清华,胡军.粒计算研究综述[J].智能系统学报,2007,2(6):8-26. [22]WU Q,TAN S.A two-stage framework for cross-domain sentiment classification[J].Expert Systems with Applications,2011,38(11):14269-14275. [23]ZHANG S,LIU H,YANG L,et al.A Cross-Domain Sentiment Classification Method Based on Extraction of Key Sentiment Sentence[M]∥Natural Language Processing and Chinese Computing.Springer International Publishing,2015:90-101. |
[1] | 秦琪琦, 张月琴, 王润泽, 张泽华. 基于知识图谱的层次粒化推荐方法 Hierarchical Granulation Recommendation Method Based on Knowledge Graph 计算机科学, 2022, 49(8): 64-69. https://doi.org/10.11896/jsjkx.210600111 |
[2] | 张源, 康乐, 宫朝辉, 张志鸿. 基于Bi-LSTM的期货市场关联交易行为检测方法 Related Transaction Behavior Detection in Futures Market Based on Bi-LSTM 计算机科学, 2022, 49(7): 31-39. https://doi.org/10.11896/jsjkx.210400304 |
[3] | 林夕, 陈孜卓, 王中卿. 基于不平衡数据与集成学习的属性级情感分类 Aspect-level Sentiment Classification Based on Imbalanced Data and Ensemble Learning 计算机科学, 2022, 49(6A): 144-149. https://doi.org/10.11896/jsjkx.210500205 |
[4] | 杨斐斐, 沈思妤, 申德荣, 聂铁铮, 寇月. 面向数据融合的多粒度数据溯源方法 Method on Multi-granularity Data Provenance for Data Fusion 计算机科学, 2022, 49(5): 120-128. https://doi.org/10.11896/jsjkx.210300092 |
[5] | 李浩, 张兰, 杨兵, 杨海潇, 寇勇奇, 王飞, 康雁. 融合双重权重机制和图卷积神经网络的微博细粒度情感分类 Fine-grained Sentiment Classification of Chinese Microblogs Combining Dual Weight Mechanismand Graph Convolutional Neural Network 计算机科学, 2022, 49(3): 246-254. https://doi.org/10.11896/jsjkx.201200073 |
[6] | 潘志豪, 曾碧, 廖文雄, 魏鹏飞, 文松. 基于交互注意力图卷积网络的方面情感分类 Interactive Attention Graph Convolutional Networks for Aspect-based Sentiment Classification 计算机科学, 2022, 49(3): 294-300. https://doi.org/10.11896/jsjkx.210100180 |
[7] | 胡艳丽, 童谭骞, 张啸宇, 彭娟. 融入自注意力机制的深度学习情感分析方法 Self-attention-based BGRU and CNN for Sentiment Analysis 计算机科学, 2022, 49(1): 252-258. https://doi.org/10.11896/jsjkx.210600063 |
[8] | 王栋, 周大可, 黄有达, 杨欣. 基于多尺度多粒度特征的行人重识别 Multi-scale Multi-granularity Feature for Pedestrian Re-identification 计算机科学, 2021, 48(7): 238-244. https://doi.org/10.11896/jsjkx.200600043 |
[9] | 房婷, 宫傲宇, 张帆, 林艳, 贾林琼, 张一晋. 一种传输时限下认知无线电网络的动态广播策略 Dynamic Broadcasting Strategy in Cognitive Radio Networks Under Delivery Deadline 计算机科学, 2021, 48(7): 340-346. https://doi.org/10.11896/jsjkx.200900001 |
[10] | 黄梅根, 刘川, 杜欢, 刘佳乐. 基于知识图谱的认知诊断模型及其在教辅中的应用研究 Research on Cognitive Diagnosis Model Based on Knowledge Graph and Its Application in Teaching Assistant 计算机科学, 2021, 48(6A): 644-648. https://doi.org/10.11896/jsjkx.200700163 |
[11] | 李艳, 范斌, 郭劼, 林梓源, 赵曌. 基于k-原型聚类和粗糙集的属性约简方法 Attribute Reduction Method Based on k-prototypes Clustering and Rough Sets 计算机科学, 2021, 48(6A): 342-348. https://doi.org/10.11896/jsjkx.201000053 |
[12] | 霍帅, 庞春江. 基于Transformer和多通道卷积神经网络的情感分析研究 Research on Sentiment Analysis Based on Transformer and Multi-channel Convolutional Neural Network 计算机科学, 2021, 48(6A): 349-356. https://doi.org/10.11896/jsjkx.200800004 |
[13] | 王政, 姜春茂. 一种基于三支决策的云任务调度优化算法 Cloud Task Scheduling Algorithm Based on Three-way Decisions 计算机科学, 2021, 48(6A): 420-426. https://doi.org/10.11896/jsjkx.201000023 |
[14] | 吴广智, 郭斌, 丁亚三, 成家慧, 於志文. 假消息认知机理研究综述 Cognitive Mechanisms of Fake News 计算机科学, 2021, 48(6): 306-314. https://doi.org/10.11896/jsjkx.201200194 |
[15] | 吕乐宾, 刘群, 彭露, 邓维斌, 王崇宇. 结合多粒度信息的文本匹配融合模型 Text Matching Fusion Model Combining Multi-granularity Information 计算机科学, 2021, 48(6): 196-201. https://doi.org/10.11896/jsjkx.200700100 |
|