计算机科学 ›› 2018, Vol. 45 ›› Issue (9): 243-247.doi: 10.11896/j.issn.1002-137X.2018.09.040
周艳芳, 周刚, 鹿忠磊
ZHOU Yan-fang, ZHOU Gang, LU Zhong-lei
摘要: 立场分析旨在发现用户对特定目标对象所持的观点态度。针对现有方法往往难以克服标注数据匮乏及微博文本中大量未登录词等导致的分词误差的问题,提出了基于迁移学习及字、词特征混合的立场分析方法。首先,将字、词特征输入深度神经网络,级联两者隐藏层输出,复现由分词错误引起的缺失语义信息;然后,利用与立场相关话题的辅助数据训练话题分类模型(父模型),得到更为有效的句子特征表示;接着,以父模型参数初始化立场分析模型(子模型),从辅助数据(话题分类数据)迁移知识能加强句子的语义表示能力;最后,使用有标注数据微调子模型参数并训练分类器。在NLPCC-2016任务4的语料上进行实验,F1值达72.2%,优于参赛团队的最佳成绩。实验结果表明,该方法可提高立场分类性能,同时缓解分词误差带来的影响。
中图分类号:
[1]THOMAS M,PANG B,LEE L.Get out the vote:determining support or opposition from congressional floor-debate transcripts[C]∥Conference on Empirical Methods in Natural Language Processing.Association for Computational Linguistics,2006:327-335. [2]SOMASUNDARAN S,WIEBE J.Recognizing Stances in Online Debates[C]∥Joint Conference of the 47th Annual Metting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP.2009:116-124. [3]MURAKAMI A,RAYMOND R.Support or oppose?:classif-ying positions in online debates from reply activities and opinion expressions[C]∥International Conference on Computational Linguistics:Posters.Association for Computational Linguistics,2010:869-875. [4]WALKER M A,ANAND P,ABBOTT R,et al.Stance classification using dialogic properties of persuasion[C]∥Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2013:592-596. [5]RAJADESINGAN A,LIU H.Identifying Users with Opposing Opinions in Twitter Debates[M]∥Social Computing,Behavio-ral-Cultural Modeling and Prediction.Springer International Publishing,2016:153-160. [6]MIKOLOV T,LE Q V,SUTSKEVER I.Exploiting Similarities among Languages for Machine Translation[J].arXiv preprint arXiv:1309.4168.2013. [7]PAN S J,YANG Q.A survey on transfer learning[J].IEEE Transactions on Data Engineering,2010,22(10):1345-1359. [8]SCHÖLKOPF B,PLATT J,HOFMANN T.Analysis of Representations for Domain Adaptation[C]∥International Conference on Neural Information Processing Systems.MIT Press,2006:137-144. [9]ERHAN D,BENGIO Y,COURVILLE A C,et al.Why Does Unsupervised Pre-training Help Deep Learning?[J].Journal of Machine Learning Research,2010,11(3):625-660. [10]GRAVES A.Supervised Sequence Labelling with Recurrent Neural Networks[OL].http://mediatum.ub.tum.de/doc/673554/file.pdf. [11]BENGIO Y,SIMARD P,FRASCONI P.Learning long-term dependencies with gradient descent is difficult[J].IEEE Transactions on Neural Networks,2002,5(2):157-166. [12]HOCHREITER S,SCHMIDHUBER J.Long Short-Term Me-mory[J].Neural Computation,1997,9(8):1735. [13]SCHUSTER M,PALIWAL K K.Bidirectional re-current neural networks[J].IEEE Transactions on Signal Processing,1997,45(11):2673-2681. [14]GRAVES A,JAITLY N,MOHAMED A R.Hybrid speech re-cognition with Deep Bidirectional LSTM[C]∥Automatic Speech Recognition and Understanding.IEEE,2014:273-278. [15]BAHDANAU D,CHO K,BENGIO Y.Neural Machine Translation by Jointly Learning to Align and Translate[C]∥3rd International Conference on Learning Representation.2015. [16]YOSINSKI J,CLUNE J,BENGIO Y,et al.How transferable are features in deep neural networks?[J].EprintArxiv,2014,27:3320-3328. [17]KIROS R,ZHU Y,SALAKHUTDINOV R,et al.Skip-Thought Vectors[OL].http://www.cs.toronto.edu/~zemel/documents/skipThought.pdf. [18]DAI A M,LE Q V.Semi-supervised Sequence Learning[C]∥International Conference on Neural Internation Processing Systems.2015:3079-3087. [19]HILL F,CHO K,KORHONEN A.Learning Distributed Representations of Sentences from Unlabelled Data[C]∥Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2016:1367-1377. [20]WESTON J,CHOPRA S,ADAMS K.#TagSpace:Semantic Embeddings from Hashtags[C]∥Conference on Empirical Methods in Natural Language Processing.2014:1822-1827. [21]YANG W,SONG J J,TANG J Q.A Study on the Classification Approach for Chinese MicroBlog Subjective and Objective Sentences[J].Journal of Chongqing Institute of Technology,2013,27(1):51-56.(in Chinses) 杨武,宋静静,唐继强.中文微博情感分析中主客观句分类方法[J].重庆理工大学学报(自然科学),2013,27(1):51-56. |
[1] | 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺. 时序知识图谱表示学习 Temporal Knowledge Graph Representation Learning 计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204 |
[2] | 饶志双, 贾真, 张凡, 李天瑞. 基于Key-Value关联记忆网络的知识图谱问答方法 Key-Value Relational Memory Networks for Question Answering over Knowledge Graph 计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277 |
[3] | 汤凌韬, 王迪, 张鲁飞, 刘盛云. 基于安全多方计算和差分隐私的联邦学习方案 Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy 计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108 |
[4] | 王剑, 彭雨琦, 赵宇斐, 杨健. 基于深度学习的社交网络舆情信息抽取方法综述 Survey of Social Network Public Opinion Information Extraction Based on Deep Learning 计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099 |
[5] | 郝志荣, 陈龙, 黄嘉成. 面向文本分类的类别区分式通用对抗攻击方法 Class Discriminative Universal Adversarial Attack for Text Classification 计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077 |
[6] | 姜梦函, 李邵梅, 郑洪浩, 张建朋. 基于改进位置编码的谣言检测模型 Rumor Detection Model Based on Improved Position Embedding 计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046 |
[7] | 方义秋, 张震坤, 葛君伟. 基于自注意力机制和迁移学习的跨领域推荐算法 Cross-domain Recommendation Algorithm Based on Self-attention Mechanism and Transfer Learning 计算机科学, 2022, 49(8): 70-77. https://doi.org/10.11896/jsjkx.210600011 |
[8] | 孙奇, 吉根林, 张杰. 基于非局部注意力生成对抗网络的视频异常事件检测方法 Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection 计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061 |
[9] | 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木. 中文预训练模型研究进展 Advances in Chinese Pre-training Models 计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018 |
[10] | 周慧, 施皓晨, 屠要峰, 黄圣君. 基于主动采样的深度鲁棒神经网络学习 Robust Deep Neural Network Learning Based on Active Sampling 计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044 |
[11] | 苏丹宁, 曹桂涛, 王燕楠, 王宏, 任赫. 小样本雷达辐射源识别的深度学习方法综述 Survey of Deep Learning for Radar Emitter Identification Based on Small Sample 计算机科学, 2022, 49(7): 226-235. https://doi.org/10.11896/jsjkx.210600138 |
[12] | 胡艳羽, 赵龙, 董祥军. 一种用于癌症分类的两阶段深度特征选择提取算法 Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification 计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092 |
[13] | 程成, 降爱莲. 基于多路径特征提取的实时语义分割方法 Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction 计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157 |
[14] | 谢柏林, 黎琦, 邝建. 基于隐半马尔可夫模型的微博流行信息检测方法 Microblog Popular Information Detection Based on Hidden Semi-Markov Model 计算机科学, 2022, 49(6A): 291-296. https://doi.org/10.11896/jsjkx.210800011 |
[15] | 王君锋, 刘凡, 杨赛, 吕坦悦, 陈峙宇, 许峰. 基于多源迁移学习的大坝裂缝检测 Dam Crack Detection Based on Multi-source Transfer Learning 计算机科学, 2022, 49(6A): 319-324. https://doi.org/10.11896/jsjkx.210500124 |
|