基于触发词语义选择的Twitter事件共指消解研究

doi:10.11896／j.issn.1002-137X.2018.12.020

摘要/Abstract

摘要： 随着社交媒体的发展与普及,如何识别短文本中事件描述的共指关系已成为一个亟待解决的问题。在传统的事件共指消解研究中,需要从NLP工具和知识库中获得丰富的语义特征,这种方式不仅限制了领域的扩展性,而且还导致了误差传播。为了打破上述局限,提出了一种新颖的基于事件触发词来选择性表达句子语义的方法,以判断短文本中事件的共指关系。首先,利用双向长短记忆模型(Bi-LSTM)提取短文本的句子级语义特征和事件描述级语义特征;其次,通过在句子级特征上应用一个基于事件触发词的选择门来选择性表达句子级语义,以产生潜在语义特征;然后,设计了触发词重叠词数和时间间隔两个辅助特征;最后,通过融合以上特征形成一个分类器来预测共指关系。为评估上述方法,基于Twitter数据标注了一个新的数据集EventCoreOnTweets(ECT)。实验结果表明,与两个基准模型相比,提出的选择性表达模型显著提升了短文本共指消解的性能。

关键词: 短文本, 神经网络, 事件共指消解, 双向长短记忆模型

Abstract: With the development and popularization of social media,how to recognize the coreference relation between two event mention in short texts is an urgent issue.In traditional researches about event coreference resolution,a rich set of linguistic features derived from pre-existing NLP tools and various knowledge bases is required,which restricts domain scalability and leads to the propagation of errors.To overcome these limitations,this paper proposed a novel selective expression approach based on event trigger to explore the coreference relationship on Twitter.Firstly,a bi-direction long short term memory (Bi-LSTM) is exploited to extract the features at sentence level and at mention level.Then,the latent features are generated by applying a gate on sentence level features to make it selectively express.Next,two auxiliary features named the overlapped words of trigger and time interval are designed.Finally,all these features are concatenated and fed into a simple classifier to predict the coreference relationship.In order to evaluate this method,this paper annotated a new dataset EventCoreOnTweet (ECT).The experimental results demonstrate that the selective expression approach significantly improves the performance of coreference resolution of short texts.

Key words: Bi-direction long short-term memory, Event coreference resolution, Neural networks, Short text

中图分类号:

TP391

魏萍, 巢文涵, 罗准辰, 李舟军. 基于触发词语义选择的Twitter事件共指消解研究[J]. 计算机科学, 2018, 45(12): 130-136. https://doi.org/10.11896／j.issn.1002-137X.2018.12.020

WEI Ping, CHAO Wen-han, LUO Zhun-chen, LI Zhou-jun. Selective Expression Approach Based on Event Trigger for Event Coreference Resolution on Twitter[J]. Computer Science, 2018, 45(12): 130-136. https://doi.org/10.11896／j.issn.1002-137X.2018.12.020

参考文献

[1]BEJAN C A,HARABAGIU S.Unsupervised event coreference resolution with rich linguistic features[C]∥Meeting of the Association for Computational Linguistics.Association for Computational Linguistics,2010:1412-1422.
[2]HOVY E,MITAMURA T,VERDEJO F,et al.Events are not simple:Identity,non-identity,and quasi-identity.http://aclweb.org/anthology/w13-1203.
[3]ALLAN J.Topic Detection and Tracking Pilot Study :Final Report[C]∥Proceedings of DARPA Broadcast News Transcription and Understanding Workshop.1998:194-218.
[4]HUMPHREYS K,GAIZAUSKAS R,AZZAM S.Event corefe-rence for information extraction[C]∥A Workshop on Operatio-nal Factors in Practical,Robust Anaphora Resolution for Unrestricted Texts.Association for Computational Linguistics,1997:75-81.
[5]TELLEX S,KATZ B,LIN J,et al.Quantitative evaluation ofpassage retrieval algorithms for question answering[C]∥International ACM SIGIR Conference on Research and Development in Informaion Retrieval.ACM,2003:41-47.
[6]MCCARTHY D,CARROLL J.Disambiguating Nouns,Verbs,and Adjectives Using Automatically Acquired Selectional Pre-ferences.Computational Linguistics,2003,29(4):639-654.
[7]ZENG D,LIU K,LAI S,et al.Relation classification via convolutional deep neural network∥Proceedings of COLING 2014,the 25th International Conference on Computational Linguistics:Technical Papers.2014:2335-2344.
[8]NGUYEN T H,GRISHMAN R.Event Detection and Domain Adaptation with Convolutional Neural Networks∥Procee-dings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2:Short Papers).2015:365-371.
[9]CHEN Y,XU L,LIU K,et al.Event Extraction via DynamicMulti-Pooling Convolutional Neural Networks[C]∥The Mee-ting of the Association for Computational Linguistics.2015.
[10]KRAUSE S,XU F,USZKOREIT H,et al.Event Linking with Sentential Features from Convolutional Neural Networks[C]∥Signll Conference on Computational Natural Language Lear-ning.2016:239-249.
[11]HAGHIGHI A,DAN K.Coreference resolution in a modular,entity-centered model[C]∥Human Language Technologies:the 2010 Conference of the North American Chapter of the Association for Computational Linguistics.Association for Computational Linguistics,2010:385-393.
[12]RAHMAN A,NG V.Coreference Resolution with WorldKnowledge[C]∥The Meeting of the Association for Computational Linguistics:Human Language Technologies.2011:814-824.
[13]RAO D,MCNAMEE P,DREDZE M.Streaming Cross Docu-ment Entity Coreference Resolution[C]∥International Conference on Coling 2010.2010:1050-1058.
[14]MNIH V,HEESS N,Graves A.Recurrent models of visual attention∥Advances in neural information processing systems.2014:2204-2212.
[15]BAHDANAU D,CHO K,BENGIO Y.Neural machine translation by jointly learning to align and translate.arXiv preprint arXiv:1409.0473,2014.
[16]BAGGA A,BALDWIN B.Cross-document coreference:Annotations,Experiments,and Observations∥Proceedings of ACL-99 Workshop on Coreference and Its Applications.1999:1-8.
[17]CHEN Z,JI H,HARALICK R.A pairwise event coreferencemodel,feature impact and evaluation for event coreference resolution[C]∥The Workshop on Events in Emerging Text Types.Association for Computational Linguistics,2009:17-22.
[18]CHEN Z,JI H.Graph-based event coreference resolution[C]∥The Workshop on Graph-Based Methods for Natural Language Processing.Association for Computational Linguistics,2009:54-57.
[19]LIU Z,ARAKI J,HOVY E H,et al.Supervised Within-Docu-ment Event Coreference using Information Propagation.http://www.lrec-conf.org/proceedings/lrec 2014/pdf/646_paper.pdf.
[20]PENG H,SONG Y,DAN R.Event Detection and Co-reference with Minimal Supervision[C]∥Conference on Empirical Me-thods in Natural Language Processing.2016:392-402.
[21]TEH Y W,JORDAN M I,BEAL M J,et al.HierarchicalDirichlet Processes.Publications of the American Statistical Association,2006,101(476):1566-1581.
[22]GAEL J V,TEH Y W,GHAHRAMANI Z.The infinite facto-rial hidden Markov model[C]∥International Conference on Neural Information Processing Systems.Curran Associates Inc.,2008:1697-1704.
[23]YANG B,CARDIE C,FRAZIER P.A Hierarchical Distance-dependent Bayesian Model for Event Coreference Resolution.arXiv:1504.05929,2015.
[24]BLEI D M,FRAZIER P I.Distance Dependent Chinese Restaurant Processes.Journal of Machine Learning Research,2011,12(1):2461-2488.
[25]LEE H,RECASENS M,CHANG A,et al.Joint entity and event coreference resolution across documents∥Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Lear-ning.Association for Computational Linguistics,2012:489-500.
[26]PRADHAN S S,RAMSHAW L,WEISCHEDEL R,et al.Unrestricted coreference:Identifying entities and events in OntoNotes∥International Conference on Semantic Computing.IEEE Computer Society,2007:446-453.
[27]ARAKI J,LIU Z,HOVY E H,et al.Detecting Subevent Structure for Event Coreference Resolution∥International Conference on Language Resource and Evaluation.2014:4553-4558.
[28]MIKOLOV T,SUTSKEVER I,CHEN K,et al.Distributed representations of words and phrases and their compositionality[C]∥International Conference on Neural Information Processing Systems.Curran Associates Inc.,2013:3111-3119.
[29]HOCHREITER S,SCHMIDHUBER J.Long short-term memory.Neural Computation,1997,9(8):1735-1780.
[30]SUTSKEVER I,VINYALS O,LE Q V.Sequence to sequencelearning with neural networks∥Advances in neural information processing systems.2014:3104-3112.
[31]WU Y,SCHUSTER M,CHEN Z,et al.Google’s neural machine translation system:Bridging the gap between human and machine translation.arXiv preprint arXiv:1609.08144,2016.
[32]KINGMA D P,BA J.A method for stochastic optimization.arXiv preprint arXiv:1412.6980.2014.
[33]COHEN J.A coefficient of agreement for nominal scales.Educational & Psychological Measurement,2016,20(1):37-46.
[34]VILAIN M,BURGER J,ABERDEEN J,et al.A Model-Theoretic Coreferenc e Scoring Scheme[C]∥Conference on Message Understanding,Muc 1995,Columbia,Maryland,Usa,November.DBLP,1995:45-52.
[35]BAGGA A,BALDWIN B.Algorithms for scoring coreferencechains∥The First International Conference on Language Resources and Evaluation Workshop on Linguistics Corefe-rence.1998:563-566.
[36]RECASENS M,HOVY E.BLANC:Implementing the Rand index for coreference evaluation.Natural Language Enginee-ring,2011,17(4):485-510.
[37]LUO X.On coreference resolution performance metrics[C]∥HLT/EMNLP 2005,Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing,Proceedings of the Conference,Vancouver,British Columbia,Canada.DBLP,2005:25-32.
[38]PRADHAN S,LUO X,RECASENS M,et al.Scoring Corefe-rence Partitions of Predicted Mentions:A Reference Implementation[C]∥Meeting of the Association for Computational Linguistics.2014:30.

相关文章 15

[1]	宁晗阳, 马苗, 杨波, 刘士昌. 密码学智能化研究进展与分析 Research Progress and Analysis on Intelligent Cryptology 计算机科学, 2022, 49(9): 288-296. https://doi.org/10.11896/jsjkx.220300053
[2]	周芳泉, 成卫青. 基于全局增强图神经网络的序列推荐 Sequence Recommendation Based on Global Enhanced Graph Neural Network 计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[3]	吕晓锋, 赵书良, 高恒达, 武永亮, 张宝奇. 基于异质信息网的短文本特征扩充方法 Short Texts Feautre Enrichment Method Based on Heterogeneous Information Network 计算机科学, 2022, 49(9): 92-100. https://doi.org/10.11896/jsjkx.210700241
[4]	周乐员, 张剑华, 袁甜甜, 陈胜勇. 多层注意力机制融合的序列到序列中国连续手语识别和翻译 Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion 计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[5]	李宗民, 张玉鹏, 刘玉杰, 李华. 基于可变形图卷积的点云表征学习 Deformable Graph Convolutional Networks Based Point Cloud Representation Learning 计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[6]	郝志荣, 陈龙, 黄嘉成. 面向文本分类的类别区分式通用对抗攻击方法 Class Discriminative Universal Adversarial Attack for Text Classification 计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[7]	王润安, 邹兆年. 基于物理操作级模型的查询执行时间预测方法 Query Performance Prediction Based on Physical Operation-level Models 计算机科学, 2022, 49(8): 49-55. https://doi.org/10.11896/jsjkx.210700074
[8]	陈泳全, 姜瑛. 基于卷积神经网络的APP用户行为分析方法 Analysis Method of APP User Behavior Based on Convolutional Neural Network 计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121
[9]	朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥. 基于注意力机制的医学影像深度哈希检索算法 Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism 计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[10]	檀莹莹, 王俊丽, 张超波. 基于图卷积神经网络的文本分类方法研究综述 Review of Text Classification Methods Based on Graph Convolutional Network 计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[11]	闫佳丹, 贾彩燕. 基于双图神经网络信息融合的文本分类方法 Text Classification Method Based on Information Fusion of Dual-graph Neural Network 计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042
[12]	齐秀秀, 王佳昊, 李文雄, 周帆. 基于概率元学习的矩阵补全预测融合算法 Fusion Algorithm for Matrix Completion Prediction Based on Probabilistic Meta-learning 计算机科学, 2022, 49(7): 18-24. https://doi.org/10.11896/jsjkx.210600126
[13]	杨炳新, 郭艳蓉, 郝世杰, 洪日昌. 基于数据增广和模型集成策略的图神经网络在抑郁症识别上的应用 Application of Graph Neural Network Based on Data Augmentation and Model Ensemble in Depression Recognition 计算机科学, 2022, 49(7): 57-63. https://doi.org/10.11896/jsjkx.210800070
[14]	张颖涛, 张杰, 张睿, 张文强. 全局信息引导的真实图像风格迁移 Photorealistic Style Transfer Guided by Global Information 计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[15]	戴朝霞, 李锦欣, 张向东, 徐旭, 梅林, 张亮. 基于DNGAN的磁共振图像超分辨率重建算法 Super-resolution Reconstruction of MRI Based on DNGAN 计算机科学, 2022, 49(7): 113-119. https://doi.org/10.11896/jsjkx.210600105

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed