计算机科学 ›› 2018, Vol. 45 ›› Issue (9): 243-247.doi: 10.11896/j.issn.1002-137X.2018.09.040

• 人工智能 • 上一篇    下一篇

一种基于迁移学习及多表征的微博立场分析方法

周艳芳, 周刚, 鹿忠磊   

  1. 数学工程与先进计算国家重点实验室 郑州450001
  • 收稿日期:2017-08-10 出版日期:2018-09-20 发布日期:2018-10-10
  • 通讯作者: 周 刚(1974-),男,博士,副教授,硕士生导师,主要研究方向为在线社会网络分析、海量信息处理,E-mail:gzhougzhou@126.com
  • 作者简介:周艳芳(1993-),女,硕士生,主要研究方向为自然语言处理、情感分析,E-mail:lzl_xd6j@163.com;鹿忠磊(1988-),男,硕士生,主要研究方向为自然语言处理。
  • 基金资助:
    本文受数学工程与先进计算国家重点实验室开放基金资助项目(2015A11)资助。

Approach of Stance Detection in Micro-blog Based on Transfer Learning and Multi-representation

ZHOU Yan-fang, ZHOU Gang, LU Zhong-lei   

  1. State Key Laboratory of Mathematical Engineering and Advanced Computing,Zhengzhou 450001,China
  • Received:2017-08-10 Online:2018-09-20 Published:2018-10-10

摘要: 立场分析旨在发现用户对特定目标对象所持的观点态度。针对现有方法往往难以克服标注数据匮乏及微博文本中大量未登录词等导致的分词误差的问题,提出了基于迁移学习及字、词特征混合的立场分析方法。首先,将字、词特征输入深度神经网络,级联两者隐藏层输出,复现由分词错误引起的缺失语义信息;然后,利用与立场相关话题的辅助数据训练话题分类模型(父模型),得到更为有效的句子特征表示;接着,以父模型参数初始化立场分析模型(子模型),从辅助数据(话题分类数据)迁移知识能加强句子的语义表示能力;最后,使用有标注数据微调子模型参数并训练分类器。在NLPCC-2016任务4的语料上进行实验,F1值达72.2%,优于参赛团队的最佳成绩。实验结果表明,该方法可提高立场分类性能,同时缓解分词误差带来的影响。

关键词: 立场分析, 迁移学习, 深度学习, 微博

Abstract: Stance detection aims to identify users’ opinion towards a particular target.Aiming at the problem that exi-sting methods are often difficult to overcome the lack of labeled data and the error caused by word segmentation of Chinese text,this paper presented a transfer learning method and a hybrid model of character-level and word-level features.Firstly,character-level and the word-level features are inputted to deep neural network and the outputs of both are concatenated to reproduce the missing semantic information caused by word segmentation.Then,a topic classification model(parent model) is trained with a large external micro-blog data to obtain the effective sentence feature representation.Next,some of parent model’s parametersare used to initialize stance detection model and the knowledge transferred from auxiliary data can be used to enhance semantic representation ability of sentences.Finally,the labeled data are used to fine tune the child model andtrain classifiers.Experiment on NLPCC-2016 Task 4 proves that F1 value of proposed method achieves 72.2%,which is better than the best one of participating teams.The results show that this approach can improve the stance detection performance and alleviate the influence caused by word segmentation.

Key words: Deep learning, Micro-blog, Stance detection, Transfer learning

中图分类号: 

  • TP391
[1]THOMAS M,PANG B,LEE L.Get out the vote:determining support or opposition from congressional floor-debate transcripts[C]∥Conference on Empirical Methods in Natural Language Processing.Association for Computational Linguistics,2006:327-335.
[2]SOMASUNDARAN S,WIEBE J.Recognizing Stances in Online Debates[C]∥Joint Conference of the 47th Annual Metting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP.2009:116-124.
[3]MURAKAMI A,RAYMOND R.Support or oppose?:classif-ying positions in online debates from reply activities and opinion expressions[C]∥International Conference on Computational Linguistics:Posters.Association for Computational Linguistics,2010:869-875.
[4]WALKER M A,ANAND P,ABBOTT R,et al.Stance classification using dialogic properties of persuasion[C]∥Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2013:592-596.
[5]RAJADESINGAN A,LIU H.Identifying Users with Opposing Opinions in Twitter Debates[M]∥Social Computing,Behavio-ral-Cultural Modeling and Prediction.Springer International Publishing,2016:153-160.
[6]MIKOLOV T,LE Q V,SUTSKEVER I.Exploiting Similarities among Languages for Machine Translation[J].arXiv preprint arXiv:1309.4168.2013.
[7]PAN S J,YANG Q.A survey on transfer learning[J].IEEE Transactions on Data Engineering,2010,22(10):1345-1359.
[8]SCHÖLKOPF B,PLATT J,HOFMANN T.Analysis of Representations for Domain Adaptation[C]∥International Conference on Neural Information Processing Systems.MIT Press,2006:137-144.
[9]ERHAN D,BENGIO Y,COURVILLE A C,et al.Why Does Unsupervised Pre-training Help Deep Learning?[J].Journal of Machine Learning Research,2010,11(3):625-660.
[10]GRAVES A.Supervised Sequence Labelling with Recurrent
Neural Networks[OL].http://mediatum.ub.tum.de/doc/673554/file.pdf.
[11]BENGIO Y,SIMARD P,FRASCONI P.Learning long-term dependencies with gradient descent is difficult[J].IEEE Transactions on Neural Networks,2002,5(2):157-166.
[12]HOCHREITER S,SCHMIDHUBER J.Long Short-Term Me-mory[J].Neural Computation,1997,9(8):1735.
[13]SCHUSTER M,PALIWAL K K.Bidirectional re-current neural networks[J].IEEE Transactions on Signal Processing,1997,45(11):2673-2681.
[14]GRAVES A,JAITLY N,MOHAMED A R.Hybrid speech re-cognition with Deep Bidirectional LSTM[C]∥Automatic Speech Recognition and Understanding.IEEE,2014:273-278.
[15]BAHDANAU D,CHO K,BENGIO Y.Neural Machine Translation by Jointly Learning to Align and Translate[C]∥3rd International Conference on Learning Representation.2015.
[16]YOSINSKI J,CLUNE J,BENGIO Y,et al.How transferable are features in deep neural networks?[J].EprintArxiv,2014,27:3320-3328.
[17]KIROS R,ZHU Y,SALAKHUTDINOV R,et al.Skip-Thought Vectors[OL].http://www.cs.toronto.edu/~zemel/documents/skipThought.pdf.
[18]DAI A M,LE Q V.Semi-supervised Sequence Learning[C]∥International Conference on Neural Internation Processing Systems.2015:3079-3087.
[19]HILL F,CHO K,KORHONEN A.Learning Distributed Representations of Sentences from Unlabelled Data[C]∥Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2016:1367-1377.
[20]WESTON J,CHOPRA S,ADAMS K.#TagSpace:Semantic
Embeddings from Hashtags[C]∥Conference on Empirical Methods in Natural Language Processing.2014:1822-1827.
[21]YANG W,SONG J J,TANG J Q.A Study on the Classification Approach for Chinese MicroBlog Subjective and Objective Sentences[J].Journal of Chongqing Institute of Technology,2013,27(1):51-56.(in Chinses)
杨武,宋静静,唐继强.中文微博情感分析中主客观句分类方法[J].重庆理工大学学报(自然科学),2013,27(1):51-56.
[1] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[2] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[3] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[4] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[5] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[6] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[7] 方义秋, 张震坤, 葛君伟.
基于自注意力机制和迁移学习的跨领域推荐算法
Cross-domain Recommendation Algorithm Based on Self-attention Mechanism and Transfer Learning
计算机科学, 2022, 49(8): 70-77. https://doi.org/10.11896/jsjkx.210600011
[8] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[9] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[10] 周慧, 施皓晨, 屠要峰, 黄圣君.
基于主动采样的深度鲁棒神经网络学习
Robust Deep Neural Network Learning Based on Active Sampling
计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044
[11] 苏丹宁, 曹桂涛, 王燕楠, 王宏, 任赫.
小样本雷达辐射源识别的深度学习方法综述
Survey of Deep Learning for Radar Emitter Identification Based on Small Sample
计算机科学, 2022, 49(7): 226-235. https://doi.org/10.11896/jsjkx.210600138
[12] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
[13] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[14] 谢柏林, 黎琦, 邝建.
基于隐半马尔可夫模型的微博流行信息检测方法
Microblog Popular Information Detection Based on Hidden Semi-Markov Model
计算机科学, 2022, 49(6A): 291-296. https://doi.org/10.11896/jsjkx.210800011
[15] 王君锋, 刘凡, 杨赛, 吕坦悦, 陈峙宇, 许峰.
基于多源迁移学习的大坝裂缝检测
Dam Crack Detection Based on Multi-source Transfer Learning
计算机科学, 2022, 49(6A): 319-324. https://doi.org/10.11896/jsjkx.210500124
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!