Computer Science ›› 2019, Vol. 46 ›› Issue (8): 277-281.doi: 10.11896/j.issn.1002-137X.2019.08.046

• Artificial Intelligence • Previous Articles     Next Articles

Employing Multi-attention Mechanism to Resolve Event Coreference

FANG Jie, LI Pei-feng, ZHU Qiao-ming   

  1. (School of Computer Science and Technology,Soochow University,Suzhou,Jiangsu 215006,China)
    (Province Key Laboratory of Computer Information Processing Technology of Jiangsu,Suzhou,Jiangsu 215006,China)
  • Received:2018-07-29 Online:2019-08-15 Published:2019-08-15

Abstract: Event coreference resolution is an asignificant subtask of information extraction and plays an import role in information fusion,QA system and reading comprehension.This paper introduced a multi-attention-based CNN neural network,called CorefNet,to resolve document-level event coreference.CorefNet uses a deep CNN to extract event features and a multi-attention mechanism to capture important features.Compared with most previous studies with probability-based or graph-based models,the proposed model only uses a few features.Compared with the current main stream nueral network model,this menthod can extract deep event features,and significantly improve the performance of event coreference resolution.The experimental results on the ACE2005 corpus show that this model achieves the state-of-the-art results

Key words: Event coreference, Document-level, Attention mechanism, Deep CNN

CLC Number: 

  • TP391
[1] ALLAN J,CARBONELL J G,DODDINGTON G,et al.Topic Detection and Tracking Pilot Study:Final report[C]∥Procee-dings of the DARPA Broadcast News Transcription and Understanding Workshop.Burlington:Morgan Kaufmann Press,1998:194-218.
[2] LI P F,ZHU Q M,ZHOU G D.Argument Inference from Relevant Event Mentions in Chinese Argument Extraction[C]∥Proceedings of ACL 2013.Minneapolis:ACL Press,2013:1477-1487.
[3] NARAYANAN S,HARABAGIU S.Question Answering Based on Semantic Structures[C]∥Proceedings of COLING 2004.Switzerland:ACL Press,2004:693-702.
[4] CHOUBEY P K,HUANG R H.Event Coreference Resolution by Iteratively Unfolding Inter-dependencies among Events[C]∥Proceedings of EMNLP 2017.Copenhagen:ACL Press,2017:2124-2133.
[5] CHEN Z,JI H.Graph-based Event Coreference Resolution [C]∥Proceedings of TextGraphs 4.Singapore:ACL Press,2009:54-57.
[6] HAGHIGHI A,KLEIN D.Simple Coreference Resolution with Rich Syntactic and Semantic Features[C]∥Proceedings of EMNLP 2009.Singapore:ACL Press,2009:1152-1161.
[7] NG V,CARDIE C.Identifying Anaphoric and Non-Anaphoric Noun Phrases to Improve Coreference Resolution[C]∥Procee-dings of COLING 2002.Taipei:ACL Press,2002:1-7.
[8] CHEN Z,JI H,HARALICK R.A Pairwise Event Coreference Model,Feature Impact and Evaluation for Event Coreference Resolution[C]∥Proceedings of the Workshop on Events in Emerging Text Types.Borovets:ACL Press,2009:17-22.
[9] LIU Z Z,ARAKI J,HOVY E,et al.Supervised Within-Docu- ment Event CoReference Using Information Propagation[C]∥Proceedings of LREC 2014.Reykjavik:Springer,2014:4539-4544.
[10] LU J,NG V.Joint Learning for Event Coreference Resolution[C]∥Proceedings of ACL 2017.Vancouver:ACL Press,2017:90-101.
[11] KIM Y.Convolutional Neural Networks for Sentence Classification[C]∥Proceedings of EMNLP 2014.Doha:ACL Press,2014:1746-1751.
[12] YIH W T,TOUTANOVA K,PLATT J C,et al.Learning Discriminative Projections for Text Similarity Measures[C]∥Proceedings of CoNLL 2011.Oregan:ACL Press,2011:247-256.
[13] KRAUSE S,XU F Y,USZKOREIT H,et al.Event Linking with Sentential Features from Convolutional Neural Networks[C]∥Proceedings of CoNLL 2016.Berlin:ACL Press,2016:239-249.
[14] VASWANI A,SHAZEER N,PARMAR N,et al.Attention Is All You Need[C]∥Proceedings of NLPS 2017.California:MIT Press,2017:5998-6008.
[15] PENNINGTON J,SOCHER R,MANNING C,et al.Glove: Global Vectors for Word Representation[C]∥Proceedings of EMNLP 2014.Doha:ACL Press,2014:1532-1543.
[16] BAGGA A,BALDWIN B.Algorithms for Scoring Coreference Chains[C]∥Proceedings of LREC 1998.Granada:ELRA Press,1998:563-566.
[17] LUO X Q.On Coreference Resolution Performance Metrics [C]∥Proceedings of EMNLP 2005.Vancouver:ACL Press,2005:25-32.
[18] VILAIN M,BURGER J,ABERDEEN J,et al.A Model-Theoretic Coreference Scoring Scheme[C]∥Proceedings of MUC-6.California:ACM Press,1998:45-52.
[1] PAN Zu-jiang, LIU Ning, ZHANG Wei, WANG Jian-yong. MTHAM:Multitask Disease Progression Modeling Based on Hierarchical Attention Mechanism [J]. Computer Science, 2020, 47(9): 185-189.
[2] ZHAO Wei, LIN Yu-ming, WANG Chao-qiang, CAI Guo-yong. Opinion Word-pairs Collaborative Extraction Based on Dependency Relation Analysis [J]. Computer Science, 2020, 47(8): 164-170.
[3] YUAN Ye, HE Xiao-ge, ZHU Ding-kun, WANG Fu-lee, XIE Hao-ran, WANG Jun, WEI Ming-qiang, GUO Yan-wen. Survey of Visual Image Saliency Detection [J]. Computer Science, 2020, 47(7): 84-91.
[4] LIU Yan, WEN Jing. Complex Scene Text Detection Based on Attention Mechanism [J]. Computer Science, 2020, 47(7): 135-140.
[5] YU Yi-lin, TIAN Hong-tao, GAO Jian-wei and WAN Huai-yu. Relation Extraction Method Combining Encyclopedia Knowledge and Sentence Semantic Features [J]. Computer Science, 2020, 47(6A): 40-44.
[6] NI Hai-qing, LIU Dan, SHI Meng-yu. Chinese Short Text Summarization Generation Model Based on Semantic-aware [J]. Computer Science, 2020, 47(6): 74-78.
[7] HUANG Yong-tao, YAN Hua. Scene Graph Generation Model Combining Attention Mechanism and Feature Fusion [J]. Computer Science, 2020, 47(6): 133-137.
[8] ZHANG Zhi-yang, ZHANG Feng-li, CHEN Xue-qin, WANG Rui-jin. Information Cascade Prediction Model Based on Hierarchical Attention [J]. Computer Science, 2020, 47(6): 201-209.
[9] DENG Yi-jiao, ZHANG Feng-li, CHEN Xue-qin, AI Qing, YU Su-zhe. Collaborative Attention Network Model for Cross-modal Retrieval [J]. Computer Science, 2020, 47(4): 54-59.
[10] ZHANG Peng-fei, LI Guan-yu, JIA Cai-yan. Truncated Gaussian Distance-based Self-attention Mechanism for Natural Language Inference [J]. Computer Science, 2020, 47(4): 178-183.
[11] ZHANG Yi-fei,WANG Zhong-qing,WANG Hong-ling. Product Review Summarization Using Discourse Hierarchical Structure [J]. Computer Science, 2020, 47(2): 195-200.
[12] LIN Min-hong, MENG Zu-qiang. Multimodal Sentiment Analysis Based on Attention Neural Network [J]. Computer Science, 2020, 47(11A): 508-514, 548.
[13] LAN Yi-lun, MENG Min, WU Ji-gang. Visual Sentiment Prediction with Visual Semantic Embedding and Attention Mechanism [J]. Computer Science, 2020, 47(11): 250-254.
[14] KANG Yan, BU Rong-jing, LI Hao, YANG Bing, ZHANG Ya-chuan, CHEN Tie. Neural Collaborative Filtering Based on Enhanced-attention Mechanism [J]. Computer Science, 2020, 47(10): 114-120.
[15] WANG Qi-fa, WANG Zhong-qing, LI Shou-shan, ZHOU Guo-dong. Comment Sentiment Classification Using Cross-attention Mechanism and News Content [J]. Computer Science, 2020, 47(10): 222-227.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] LEI Li-hui and WANG Jing. Parallelization of LTL Model Checking Based on Possibility Measure[J]. Computer Science, 2018, 45(4): 71 -75, 88 .
[2] SUN Qi, JIN Yan, HE Kun and XU Ling-xuan. Hybrid Evolutionary Algorithm for Solving Mixed Capacitated General Routing Problem[J]. Computer Science, 2018, 45(4): 76 -82 .
[3] ZHANG Jia-nan and XIAO Ming-yu. Approximation Algorithm for Weighted Mixed Domination Problem[J]. Computer Science, 2018, 45(4): 83 -88 .
[4] WU Jian-hui, HUANG Zhong-xiang, LI Wu, WU Jian-hui, PENG Xin and ZHANG Sheng. Robustness Optimization of Sequence Decision in Urban Road Construction[J]. Computer Science, 2018, 45(4): 89 -93 .
[5] SHI Wen-jun, WU Ji-gang and LUO Yu-chun. Fast and Efficient Scheduling Algorithms for Mobile Cloud Offloading[J]. Computer Science, 2018, 45(4): 94 -99, 116 .
[6] ZHOU Yan-ping and YE Qiao-lin. L1-norm Distance Based Least Squares Twin Support Vector Machine[J]. Computer Science, 2018, 45(4): 100 -105, 130 .
[7] LIU Bo-yi, TANG Xiang-yan and CHENG Jie-ren. Recognition Method for Corn Borer Based on Templates Matching in Muliple Growth Periods[J]. Computer Science, 2018, 45(4): 106 -111, 142 .
[8] GENG Hai-jun, SHI Xin-gang, WANG Zhi-liang, YIN Xia and YIN Shao-ping. Energy-efficient Intra-domain Routing Algorithm Based on Directed Acyclic Graph[J]. Computer Science, 2018, 45(4): 112 -116 .
[9] CUI Qiong, LI Jian-hua, WANG Hong and NAN Ming-li. Resilience Analysis Model of Networked Command Information System Based on Node Repairability[J]. Computer Science, 2018, 45(4): 117 -121, 136 .
[10] WANG Zhen-chao, HOU Huan-huan and LIAN Rui. Path Optimization Scheme for Restraining Degree of Disorder in CMT[J]. Computer Science, 2018, 45(4): 122 -125 .