计算机科学 ›› 2021, Vol. 48 ›› Issue (12): 319-323.doi: 10.11896/jsjkx.201100105

• 人工智能 • 上一篇    下一篇

基于双嵌入卷积神经网络的涉案微博评价对象抽取

王晓涵, 谭陈琛, 相艳, 余正涛   

  1. 昆明理工大学信息工程与自动化学院 昆明650500
    昆明理工大学云南省人工智能重点实验室 昆明650500
  • 收稿日期:2020-11-13 修回日期:2021-04-16 出版日期:2021-12-15 发布日期:2021-11-26
  • 通讯作者: 相艳(50691012@qq.com)
  • 作者简介:1097942784@qq.com
  • 基金资助:
    国家重点研发计划(2018YFC0830105,2018YFC0830101,2018YFC0830100);云南省基础研究专项面上项目(202001AT070047,202001AT070046);国家自然科学基金(61762056,61972186);云南省高新技术产业专项(201606)

Aspect Extraction of Case Microblog Based on Double Embedded Convolutional Neural Network

WANG Xiao-han, TAN Chen-chen, XIANG Yan, YU Zheng-tao   

  1. Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,China
    Yunnan Key Laboratory of Artificial Intelligence,Kunming University of Science and Technology,Kunming 650500,China
  • Received:2020-11-13 Revised:2021-04-16 Online:2021-12-15 Published:2021-11-26
  • About author:WANG Xiao-han,born in 1995,master.Her main research interests include na-tural language processing and emotion analysis.
    XIANG Yan,born in 1979,Ph.D.Her main research interests include natural language processing,text mining and emotion analysis.
  • Supported by:
    National Key Research and Development Program of China(2018YFC0830105,2018YFC0830101,2018YFC0830100),General Projects of Basic Research in Yunnan Province(202001AT070047,202001AT070046),National Natural Science Foundation of China(61762056,61972186) and Special Project of New and High-tech Industry in Yunnan Province(201606).

摘要: 涉案微博的评价对象抽取是一个特定领域的任务,其评价对象词表达多样且含义与通用领域不同,仅依赖于通用领域的词嵌入无法很好地表征这些评价对象词。为此,提出了一种综合利用领域词嵌入和通用词嵌入的涉案微博评价对象抽取方法。首先对涉案微博文本进行预训练,得到具有涉案领域特征的嵌入层,其次将微博评论分别输入两个嵌入层,得到不同领域对评价对象的表征结果并进行拼接操作,然后通过卷积层抽取出与案件相关的特征,最后利用分类器对序列进行标记,以提取涉案微博评价对象。实验结果表明,所提方法的F1值在#重庆公交车坠江案#和#奔驰女司机维权案#的两个数据集上分别达到了72.36%和71.02%,较现有的基准模型有所提升,验证了不同领域词嵌入对涉案微博评价对象抽取的影响。

关键词: 卷积神经网络, 评价对象抽取, 双嵌入, 微博

Abstract: Aspect extraction of the microblog involved in the case is a task in a specific domain.The expression of aspect words is diverse and the meaning is different from that of the general domain.Only relying on the word embedding in the general domain,these aspect words cannot be well represented.This paper proposes a method for extracting aspect words from microblogs by using both domain word embedding and generic word embedding.Firstly,all the microblogs involved in the case is pre-trained to obtain the embedding layer with the characteristics of the involved domain.Secondly,the microblog comments are input into two embedding layers to obtain the characterization results of the aspect words in different domains,and perform the splicing operation.Then,the features related to the case are extracted through the convolution layer.Finally,the classifier is used to label the sequence to extract aspect words involved in the case.The experimental results show that the F1 value of the proposed method reaches 72.36% and 71.02% respectively on the data sets of #Chongqing bus falling into the river# and #Mercedes Benz female driver rights protection#,which is better than the existing benchmark models,and verifies the influence of word embedding in different domains on the aspect extraction of the microblogs.

Key words: Aspect extraction, Convolutional neural network, Double embedding, Microblog

中图分类号: 

  • TP311
[1]ZHANG S Q,DU S D,ZHANG X B,et al.Social Rumor Detection Method Based on Multimodal Fusion[J].Computer Science,2021,48(5):117-123.
[2]ZHUANG L,JING F,ZHU X Y.Movie review mining and sum-marization[C]//Proceedings of the 15th ACM International Conference on Information and Knowledge Management.Arlington,Virginia,USA,2006:43-50.
[3]BLAIR-GOLDENSOHN S,HANNAN K,MCDONALD R, et al.Building a sentiment summarizer for local service reviews[C]//Proceedings of the 2008 WWW Workshop on NLP in the Information Explosion Era(NLPIX 2008).Beijing,China,2008:339-348.
[4]SONG X L,WANG S G,LI H X.Research on automatic identification of product evaluation object oriented to specific domain[J].Journal of Chinese Information Processing,2010,24(1):89-93.
[5]ZHANG M,ZHANG Y,VO D T.Neural networks for open domain targeted sentiment[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing(EMNLP 2015).2015:348-355.
[6]PORIA S,CAMBRIA E,GELBUKH A.Aspect extraction for opinion mining with a deep con-volutional neural network[J].Knowledge Based Systems,2016,108(15):42-49.
[7]MA X Z,HOVY E.End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics.ACL 2016:1064-1074.
[8]WANG W,PAN S J,DAHLMEIER D,et al.Coupled multi- layer attentions for co-extraction of aspect and opinion terms[C]//Proceedings of AAAI Conference on Artificial Intelligence(AAAI 2017).2017:3316-3322.
[9]ZHANG P,CHEN T,CHEN C.Aspect extraction method for Chinese microblog based on deep learning[J].Computer Engineering and Design,2018(8):246-250.
[10] CHEN S,LIU J,WANG Y,et al.Synchronous Double-channel Recurrent Network for Aspect-Opinion Pair Extraction[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:6515-6524.
[11]LI K,CHEN C,QUAN X,et al.Conditional augmentation for aspect term extraction via masked sequence-to-sequence generation[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:7056-7066.
[12]WEI Z,HONG Y,ZOU B,et al.Don't eclipse your arts due to small discrepancies:Boundary repositioning with a pointer network for aspect extraction[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:3678-3684.
[13] ZHOU X,WAN X,XIAO J.Representation Learning for Aspect Category Detection in Online Reviews[C]//Proceedings of Twenty-ninth AAAI Conference on Artificial Intelligence.AAAI Press,2015:417-424.
[14]YIN Y,WEI F,DONG L,et al.Unsupervised word and depen- dency path embeddings for aspect term extraction[C]//Procee-dings of the International Joint Conference on Artificial Intelligence(IJCAI 2016).2016:2979-2985.
[15]HE R,LEE W S,NG H T,et al.An unsupervised neural attention model for aspect extraction[C]//Proceedings of Annual Meeting of the Association for Computational Linguistics(ACL 2017).2017:1008-1015.
[16]LI X,LAM W.Deep multi-task learning for aspect term extraction with memory interaction[C]//Proceedings of Empirical Methods on Natural Language Processing(EMNLP 2017).2017:457-462.
[17]SHU L,HU X,BING L.Lifelong learning crf for supervised aspect extraction[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics(Volume 2:Short Papers).2017:148-154.
[18]LUO H,LI T,LIU B,et al.Improving aspect term extraction with bidirectional dependency tree representation[J].IEEE ACM Transactions on Audio,Speech,and Language Processing,2019,27(7):1201-1212.
[19]LI Y Z,LIU T W,LI Q G,et al.Character-based BiLSTM-CRF Incorporating POS and Dictionaries for Chinese Opinion Target Extraction[C]//Proceedings of the 10th Asian Conference on Machine Learning(PMLR 2018).2018:518-533.
[20]YAN H,DENG B,LI X,et al.TENER:Adapting Transformer Encoder for Named Entity Recognition[J].arXiv:1911.04474,2019.
[1] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[2] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[3] 陈泳全, 姜瑛.
基于卷积神经网络的APP用户行为分析方法
Analysis Method of APP User Behavior Based on Convolutional Neural Network
计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121
[4] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[5] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[6] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[7] 戴朝霞, 李锦欣, 张向东, 徐旭, 梅林, 张亮.
基于DNGAN的磁共振图像超分辨率重建算法
Super-resolution Reconstruction of MRI Based on DNGAN
计算机科学, 2022, 49(7): 113-119. https://doi.org/10.11896/jsjkx.210600105
[8] 刘月红, 牛少华, 神显豪.
基于卷积神经网络的虚拟现实视频帧内预测编码
Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network
计算机科学, 2022, 49(7): 127-131. https://doi.org/10.11896/jsjkx.211100179
[9] 徐鸣珂, 张帆.
Head Fusion:一种提高语音情绪识别的准确性和鲁棒性的方法
Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition
计算机科学, 2022, 49(7): 132-141. https://doi.org/10.11896/jsjkx.210100085
[10] 金方焱, 王秀利.
融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取
Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM
计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190
[11] 谢柏林, 黎琦, 邝建.
基于隐半马尔可夫模型的微博流行信息检测方法
Microblog Popular Information Detection Based on Hidden Semi-Markov Model
计算机科学, 2022, 49(6A): 291-296. https://doi.org/10.11896/jsjkx.210800011
[12] 杨玥, 冯涛, 梁虹, 杨扬.
融合交叉注意力机制的图像任意风格迁移
Image Arbitrary Style Transfer via Criss-cross Attention
计算机科学, 2022, 49(6A): 345-352. https://doi.org/10.11896/jsjkx.210700236
[13] 杨健楠, 张帆.
一种结合双注意力机制和层次网络结构的细碎农作物分类方法
Classification Method for Small Crops Combining Dual Attention Mechanisms and Hierarchical Network Structure
计算机科学, 2022, 49(6A): 353-357. https://doi.org/10.11896/jsjkx.210200169
[14] 孙福权, 崔志清, 邹彭, 张琨.
基于多尺度特征的脑肿瘤分割算法
Brain Tumor Segmentation Algorithm Based on Multi-scale Features
计算机科学, 2022, 49(6A): 12-16. https://doi.org/10.11896/jsjkx.210700217
[15] 吴子斌, 闫巧.
基于动量的映射式梯度下降算法
Projected Gradient Descent Algorithm with Momentum
计算机科学, 2022, 49(6A): 178-183. https://doi.org/10.11896/jsjkx.210500039
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!