计算机科学 ›› 2020, Vol. 47 ›› Issue (8): 164-170.doi: 10.11896/jsjkx.190600153

• 数据库&大数据&数据科学 • 上一篇    下一篇

基于依赖联系分析的观点词对协同抽取

赵威1, 2, 林煜明1, 王超强1, 蔡国永1   

  1. 1 桂林电子科技大学广西可信软件重点实验室 广西 桂林 541004
    2 华东师范大学数据科学与工程学院 上海 200062
  • 出版日期:2020-08-15 发布日期:2020-08-10
  • 通讯作者: 林煜明(ymlin@guet.edu.cn)
  • 作者简介:331205121@qq.com
  • 基金资助:
    广西自然科学基金(2018GXNSFDA281049);国家自然科学基金(61662015, U1711263);广西创新驱动发展专项资金项目(桂科AA19046004);桂林电子科技大学研究生教育创新计划资助项目(2018YJCX48);广西可信软件重点实验室研究课题(kx201916)

Opinion Word-pairs Collaborative Extraction Based on Dependency Relation Analysis

ZHAO Wei1, 2, LIN Yu-ming1, WANG Chao-qiang1, CAI Guo-yong1   

  1. 1 Guangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology, Guilin, Guangxi 541004, China
    2 School of Data Science & Engineering, East China Normal University, Shanghai 200062, China
  • Online:2020-08-15 Published:2020-08-10
  • About author:ZHAO Wei, born in 1995, postgraduate.His main research interests include opinion mining and so on.
    LIN Yu-ming, born in 1978, Ph.D, professor.His main research interests include opinion mining, knowledge graph, and massive data management.
  • Supported by:
    This work was supported by the Guangxi Natural Science Foundation (2018GXNSFDA281049), National Natural Science Foundation of China (61662015, U1711263), Science and Technology Major Project of Guangxi Province (AA19046004) and Innovation Project of Guet Graduate Education (2018YJCX48) and Project of Guangxi Key Laboratory of Trusted Software(kx201916).

摘要: 同一类商品下, 观点词对中包含的观点目标和观点词通常有着很强的观点依赖联系, 因此可以通过对评论句子中单词间的观点依赖联系进行分析来提取观点词对。首先, 构建评论句子的依赖联系分析模型来获取评论句子中每个单词之间的依赖联系信息, 文中选择的基本模型是LSTM神经网络;然后, 假设评论句子中所包含的观点词对中的一项是已知的, 并将该已知项作为模型的注意力信息, 使得模型能够从评论句子中有重点地提取出与该已知项具有强观点依赖联系的单词或词组, 并将其作为观点词对中的另一未知项;最后, 将观点依赖联系得分最高的词对作为观点词对并输出。文中进一步设计了一种复合模型, 通过结合两种包含不同已知项信息的上述模型, 来实现在不需要提前知道已知项的情况下观点词对的挖掘。

关键词: 观点词对, 观点依赖联系分析, 神经网络, 注意力机制

Abstract: In the same category of commodities, opinion word-pairs usually have strong opinion dependence relation to the opinion targets and the opinion words contained in them.Therefore, in the extraction process of opinion word-pairs, they can be extracted by analyzing the opinion dependence relations among the words in the review sentences.Firstly, a dependency relation analysis model is constructed to obtain the dependency relation information of each word in a review sentence, and the basic model is defined as LSTM neural network.Secondly, it is assumed that one of the item that opinion word-pairs contained in review sentence is known, and the known item is used as the model’s attention information, so that the model can focus on extracting the words of phrases associated with the known item with strong opinion dependence from the review sentence as another unknown item in the opinion word-pairs.Finally, the word-pairs with the highest score of the opinion dependence relation are output as the opinion word-pairs.Then a compound model is designed to realize the mining of opinion word pairs without knowing the known items in advance by combining the two models which contain the information of different known items in the opinion word-pairs.

Key words: Attention mechanism, Neural network, Opinion dependency relation analysis, Opinion pair

中图分类号: 

  • TP391
[1]DING X, LIU B, YU P S.A holistic lexicon-based approach to opinion mining[C]∥International Conference on Web Search & Data Mining.2008:231-240.
[2]WU Y, ZHANG Q, HUANG X, et al.Phrase Dependency Parsing for Opinion Mining[C]∥Proceedings of the 2009 Confe-rence on Empirical Methods in Natural Language Processing.2009:1533-1541.
[3]XU L, LIU K, LAI S, et al.Mining Opinion Words and Opinion Targets in a Two-Stage Framework[C]∥Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics.2013:1764-1773.
[4]WU C, WU F, WU S, et al.A Hybrid Unsupervised Method for Aspect Term and Opinion Target Extraction[J].Knowledge-Based Systems, 2018:148:66-73.
[5]DANILO E, DENILSON G, IVES P, et al.Analysis of Document Pre-Processing Effects in Text and Opinion Mining[J].Information, 2018, 9(4):100.
[6]AGERRI R, RIGAU G.Language Independent Sequence Labelling for Opinion Target Extraction[J].Artificial Intelligence, 2019, 268:85-95.
[7]DING X, LIU B.Resolving Object and Attribute Coreference in Opinion Mining[C]∥The 23rd, International Conference on Computational Linguistics Proceedings of the Main Conference.2010:268-276.
[8]ALMEIDA M S C, PINTO C, et al.Martins.Aligning Opinions:Cross-Lingual Opinion Mining with Dependencies[C]∥Meeting of the Association for Computational Linguistics & Internatio-nal Joint Conference on Natural Language Processing.2015:408-418.
[9]NGUYEN Thi Thanh Thuy, NGO Xuan Bach, et al.Cross-Language Aspect Extraction for Opinion Mining[C]∥10th International Conference on Knowledge and Systems Engineering.2018:67-72.
[10]TU W, CHEUNG D, MAMOULIS N.Time-sensitive opinionmining for prediction[J].Association for the Advancement of Artificial Intelligence, 2015:4214-4215.
[11]ZHANG L, LIM S H, LIU B.Extracting and Ranking Product Features in Opinion Documents[C]∥International Conference on Computational Linguistics.2010:1462-1470.
[12]HAI Z, CHANG K, CONG G.One seed to find them all:mining opinion features via association[C]∥Proceedings of the 21st ACM International Conference on Information and Knowledge Management.2012:255-264.
[13]QIU G, LIU B, BU J, et al.Opinion Word Expansion and Target Extraction through Double Propagation[J].Computational Linguistics, 2011, 37(1):9-27.
[14]BISHAN YANG C C.Joint Inference for Fine-grained OpinionExtraction[C]∥Meeting of the Association for Computational Linguistics.2013:1640-1649.
[15]MOJICA L G, NG V.Fine-Grained Opinion Extraction with
Markov Logic Networks[C]∥2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).2015:271-276.
[16]JIANG X, LIN Y, LI Y, et al.Collective Extraction for Opinion Targets and Opinion Words from Online Reviews[C]∥7th International Conference on Cloud Computing and Big Data.2016:367-373.
[17]LIN Y, JIANG X, et al.Semi-supervised collective extraction of opinion target and opinion word from online reviews based on active labeling[J].Journal of Intelligent and Fuzzy Systems, 2017, 33(6):3949-3958.
[18]WANG H, ZHANG C, YIN H, et al.A Unified Framework for Fine-Grained Opinion Mining from Online Reviews[C]∥49th Hawaii International Conference on System Sciences (HICSS).2016:1134-1143.
[19]LADDHA A, MUKHERJEE A.Aspect Specific Opinion Ex-pression Extraction using Attention based LSTM-CRF Network[J].CoRR abs.2019:1902.02709.
[1] 周芳泉, 成卫青.
基于全局增强图神经网络的序列推荐
Sequence Recommendation Based on Global Enhanced Graph Neural Network
计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[2] 戴禹, 许林峰.
基于文本行匹配的跨图文本阅读方法
Cross-image Text Reading Method Based on Text Line Matching
计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032
[3] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[4] 熊丽琴, 曹雷, 赖俊, 陈希亮.
基于值分解的多智能体深度强化学习综述
Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization
计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112
[5] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[6] 宁晗阳, 马苗, 杨波, 刘士昌.
密码学智能化研究进展与分析
Research Progress and Analysis on Intelligent Cryptology
计算机科学, 2022, 49(9): 288-296. https://doi.org/10.11896/jsjkx.220300053
[7] 汪鸣, 彭舰, 黄飞虎.
基于多时间尺度时空图网络的交通流量预测模型
Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction
计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188
[8] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[9] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[10] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[11] 王润安, 邹兆年.
基于物理操作级模型的查询执行时间预测方法
Query Performance Prediction Based on Physical Operation-level Models
计算机科学, 2022, 49(8): 49-55. https://doi.org/10.11896/jsjkx.210700074
[12] 陈泳全, 姜瑛.
基于卷积神经网络的APP用户行为分析方法
Analysis Method of APP User Behavior Based on Convolutional Neural Network
计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121
[13] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[14] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[15] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!