计算机科学 ›› 2024, Vol. 51 ›› Issue (4): 324-333.doi: 10.11896/jsjkx.230200195

• 人工智能 • 上一篇    下一篇

融合方面语义和网格标记的多语言意见元组抽取

古文霞1,2, 早克热·卡德尔1, 杨乾1,2, 艾山·吾买尔1,2   

  1. 1 新疆大学信息科学与工程学院 乌鲁木齐830046
    2 新疆多语种信息技术重点实验室 乌鲁木齐830046
  • 收稿日期:2023-02-25 修回日期:2023-06-30 出版日期:2024-04-15 发布日期:2024-04-10
  • 通讯作者: 艾山·吾买尔(Hasan1479@xju.edu.cn)
  • 作者简介:(xjstugreat@stu.xju.edu.cn)
  • 基金资助:
    基于深度学习的新疆旅游情感分析技术研究项目(2021D01C081)

Multilingual Opinion Factor Extraction Fusing Aspect Semantics and Grid Tagging

GU Wenxia1,2 , ZAOKERE Kadeer1, YANG Qian1,2, AISHAN Wumaier1,2   

  1. 1 School of Information Science and Engineering,Xinjiang University,Urumqi 830046,China
    2 Xinjiang Key Laboratory of Multilingual Information Technology,Xinjiang University,Urumqi 830046,China
  • Received:2023-02-25 Revised:2023-06-30 Online:2024-04-15 Published:2024-04-10
  • Supported by:
    Research on Xinjiang Tourism Sentiment Analysis Technology Based on Deep Learning(2021D01C081).

摘要: 面向方面的细粒度意见抽取(Aspect-oriented Fine-grained Opinion Extraction,AFOE)任务的目的是以意见对的形式抽取文本评论中的方面和意见词或者再抽取情感极性,形成意见三元组。以往的研究通常以管道方式抽取意见元素,容易出现错误传播的问题,而且大多数只关注方面词和意见词的单个子任务抽取,忽略了不同意见元素之间的相互影响和指示信息,导致意见挖掘任务不完整。此外,面向中文的意见元素抽取任务的研究较少。针对以上问题,文中提出了融合方面语义和网格标记的多语言意见元组抽取模型。首先,使用向内LSTM(Inward-LSTM)和向外LSTM(Outward-LSTM)编码方面词及其对应的上下文信息建立方面和候选意见词的关联,再结合全局信息生成特定方面语义特征的上下文表示,有利于提高下游意见元素抽取的性能。其次,使用网格标记方案的推理策略,利用方面和意见词之间的依赖指示信息进行更准确的抽取,以端到端的方式处理AFOE任务。相比基线模型,对于方面意见对抽取任务,改进的模型在中英文数据集上的F1值提高了0.89%~4.11%,对于三元组抽取任务提高了1.36%~3.11%,实验结果表明,改进的模型能有效地对中英文评论的意见元素进行抽取,性能显著优于基线模型。

关键词: 方面意见对抽取, 三元组抽取, 网格标记方案, 方面语义, 面向方面的细粒度意见抽取

Abstract: Aspect-oriented fine-grained opinion extraction(AFOE) aims to extract the aspect and opinion terms in the reviews in the form of opinion pairs or to extract the sentiment polarity to form opinion triplets.Previous studies usually extract opinion factors in a pipeline manner,which is prone to the problem of error propagation,most of them only focus on the single sub-task extraction of aspect terms or opinion terms,and ignore the mutually interactive and indicative information between different opinion factors,which lead to the problem that opinion excavation tasks are incomplete.In addition,the existing researches do not pay attention to the research of Chinese-oriented opinion factors extraction.To tackle the problems,this paper proposes multilingual opinion factors extraction model fusing aspect semantics and grid tagging.Firstly,inward LSTM(Inward-LSTM) and outward LSTM(Outward-LSTM) are exploited to encode aspect terms and corresponding left-right contexts to establish the association between aspect and candidate opinion terms,and then combine global context information to generate contextualized representation of specific aspect semantic features,which is beneficial to improve the performance of downstream opinion factors extraction.Secondly,the inference strategy of the grid tagging scheme is applied to decode the potential indications between aspect and opi-nion terms for more accurate extraction,the AFOE task is handled in an end-to-end manner.Compared with the baseline model,the F1 scores of the proposed model in the Chinese and English datasets increase by 0.89%~4.11% for the aspect opinion pair extraction task,and 1.36%~3.11% for the triplet extraction task.Experimental results show that the improved model can effectively extract the opinion factors of Chinese and English comments,the performance is significantly better than the baseline model.

Key words: Aspect-opinion pair extraction, Triplet extraction, Grid tagging scheme, Aspect semantics, Aspect-oriented fine-grained opinion extraction

中图分类号: 

  • TP391
[1]DAI H,SONG Y.Neural Aspect and Opinion Term Extraction with Mined Rules as Weak Supervision[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.2019:5268-5277.
[2]PENG H,XU L,BING L,et al.Knowing what,how andwhy:A near complete solution for aspect-based sentiment analysis[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020,34(5):8600-8607.
[3]LI X,BING L,LI P,et al.A unified model for opinion target extraction and target sentiment prediction[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019,33(1):6714-6721.
[4]LI X,BING L,LI P,et al.Aspect term extraction with history attention and selective transformation[J].arXiv:1805.00760,2018.
[5]WANG W,PAN S J,DAHLMEIER D,et al.Coupled multi-la-yer attentions for co-extraction of aspect and opinion terms[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2017.
[6]WEI Z,HONG Y,ZOU B,et al.Don't eclipse your arts due to small discrepancies:Boundary repositioning with a pointer network for aspect extraction[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:3678-3684.
[7]LI K,CHEN C,QUAN X,et al.Conditional augmentation for aspect term extraction via masked sequence-to-sequence generation[J].arXiv:2004.14769,2020.
[8]KARAOĞLAN K M,FıNDıK O.Extended rule-based opinion target extraction with a novel text pre-processing method and ensemble learning[J].Applied Soft Computing,2022,118:108524.
[9]WANG K,SHEN W,YANG Y,et al.Relational Graph Atten-tion Network for Aspect-based Sentiment Analysis[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:3229-3238.
[10]DAI J,YAN H,SUN T,et al.Does syntax matter? a strong baseline for aspect-based sentiment analysis with roberta[J].arXiv:2104.04986,2021.
[11]ZHU Z,ZHANG D,LI L,et al.Knowledge-guided multi-granularity GCN for ABSA[J].Information Processing & Management,2023,60(2):103223.
[12]VEYSEH A P B,NOURI N,DERNONCOURT F,et al.Introducing Syntactic Structures into Target Opinion Word Extraction with Deep Learning[C]//Proceedings of the 2020 Confe-rence on Empirical Methods in Natural Language Processing(EMNLP).2020:8947-8956.
[13]JIANG J,WANG A,AIZAWA A.Attention-based relationalgraph convolutional network for target-oriented opinion words extraction[C]//Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics:Main Volume.2021:1986-1997.
[14]DAI Y,WANG P,ZHU X.Reasoning over multiplex heterogeneous graph for Target-oriented Opinion Words Extraction[J].Knowledge-Based Systems,2022,236:107723.
[15]DAI C,SHEN B,YAN F.A Joint Model for Target-OrientedOpinion Words Extraction and Sentiment Classification[J].Journal of Information & Knowledge Management,2023:2250097.
[16]PHAN M H,OGUNBONA P O.Modelling context and syntactical features for aspect-based sentiment analysis[C]//Procee-dings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:3211-3220.
[17]CHEN Z,QIAN T.Relation-aware collaborative learning forunified aspect-based sentiment analysis[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:3685-3694.
[18]ZHAO H,HUANG L,ZHANG R,et al.Spanmlt:A span-based multi-task learning framework for pair-wise aspect and opinion terms extraction[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:3239-3248.
[19]MAO Y,SHEN Y,YU C,et al.A joint training dual-mrc framework for aspect based sentiment analysis[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:13543-13551.
[20]KIRANGE D K,DESHMUKH R R,KIRANGE M D K.Aspect based sentiment analysis semeval-2014 task 4[J].Asian Journal of Computer Science and Information Technology(AJCSIT).2014,8(4):72-75.
[21]PONTIKI M,GALANIS D,PAPAGEORGIOU H,et al.Semeval-2015 task 12:Aspect based sentiment analysis[C]//Proceedings of the 9th International Workshop on Semantic Evaluation(SemEval 2015).2015:486-495.
[22]PONTIKI M,GALANIS D,PAPAGEORGIOU H,et al.Semeval-2016 task 5:Aspect based sentiment analysis[C]//ProWorkshop on Semantic Evaluation(SemEval-2016).Association for Computational Linguistics,2016:19-30.
[23]WU Z,YING C,ZHAO F,et al.Grid Tagging Scheme for Aspect-oriented Fine-grained Opinion Extraction[C]//Findings of the Association for Computational Linguistics:EMNLP 2020.2020:2576-2585.
[24]CHE W,ZHAO Y,GUO H,et al.Sentence compression for aspect-based sentiment analysis[J].IEEE/ACM Transactions on Audio,Speech,and Language Processing,2015,23(12):2111-2124.
[25]ZHAO Y,PAN H,DU C,et al.Principal curvature for infrared small target detection[J].Infrared Physics & Technology,2015,69:36-43.
[26]PENG H,MA Y,LI Y,et al.Learning multi-grained aspect target sequence for Chinese sentiment analysis[J].Knowledge-Based Systems,2018,148:167-176.
[27]ZHANG Y,QI P,MANNING C D.Graph Convolution overPruned Dependency Trees Improves Relation Extraction[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing.2018:2205-2215.
[28]FAN Z,WU Z,DAI X,et al.Target-oriented opinion words extraction with target-fused neural sequence labeling[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies,Volume 1(Long and Short Papers).2019:2509-2518.
[29]XU H,LIU B,SHU L,et al.Double Embeddings and CNN-based Sequence Labeling for Aspect Extraction[C]//Procee-dings of the 56th Annual Meeting of the Association for Computational Linguistics(Volume 2:Short Papers).2018:592-598.
[30]HE R,LEE W S,NG H T,et al.An Interactive Multi-Task Learning Network for End-to-End Aspect-Based Sentiment Analysis[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.2019:504-515.
[31]XU L,LI H,LU W,et al.Position-Aware Tagging for Aspect Sentiment Triplet Extraction[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing(EMNLP).2020:2339-2349.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!