Computer Science ›› 2021, Vol. 48 ›› Issue (5): 202-208.doi: 10.11896/jsjkx.200800038

Special Issue: Natural Language Processing

• Artificial Intelligence • Previous Articles     Next Articles

Chinese Event Detection with Hierarchical and Multi-granularity Semantic Fusion

DING Ling, XIANG Yang   

  1. School of Electronics and Information Engineering,Tongji University,Shanghai 201804,China
  • Received:2020-08-10 Revised:2020-09-15 Online:2021-05-15 Published:2021-05-09
  • About author:DING Ling,born in 1995,doctoral student,is a member of China Compu-ter Federation.Her main research in-terests include natural language proces-sing,information extraction and event extraction.(dling@tongji.edu.cn)
    XIANG Yang,born in 1962,Ph.D,professor,is a member of China Computer Federation.His main research interests include machine learning,data mining and natural language processing.
  • Supported by:
    National Basic Research Program of China(2019YFB1704402).

Abstract: Event detection is an important task in information extraction field,which aims to identify trigger words in raw text and then classify them into correct event types.Neural network based methods usually regard event detection as a word-wise classification task,which suffers from the mismatch problem between words and triggers when applied to Chinese.Besides,due to the multiple word senses of a trigger word,the same trigger word in different sentences causes the ambiguity problem.To address the two problems in Chinese event detection,we propose a Chinese event detection model with hierarchical and multi-granularity semantic fusion.First,we adopt a character-based sequence labelling method to solve the mismatch problem,in which we devise a Character-Word Fusion Gate to capture the semantic information of words in different segmentation ways.Then we device a Character-Sentence Fusion Gate to learn a character-word-sentence hybrid representation of sequence,which takes the semantic information of the entire sentence into condition and solves the ambiguity problem.Finally,in order to balance the influence the label “O” and the other labels,a loss function with bias is applied to train our model.The experimental results on the widely used ACE2005 dataset show that our approach outperforms at least 3.9%,1.4% and 2.9% than other Chinese event detection models under the metrics of accuracy (Precision,P),recall (Recall,R) and F1.

Key words: Bidirectional long short-term memory model, Chinese event detection, Con-volutional neural network, Information extraction, Multi-granularity semantic fusion, Pre-trained language model

CLC Number: 

  • TP182
[1]AHN D.The stages of event extraction[C]//Proceedings of the Workshop on Annotating and Reasoning about Time and Events.2006:1-8.
[2]YANG H,CHEN Y,LIU K,et al.DCFEE:A document-levelChinese financial event extraction system based on automatically labeled training data[C]//Proceedings of ACL 2018,System Demonstrations.2018:50-55.
[3]PATWARDHAN S,RILOFF E.A unified model of phrasal and sentential evidence for information extraction[C]//Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing.2009:151-160.
[4]LIAO S,GRISHMAN R.Using document level cross-event inference to improve event extraction[C]//Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics.2010:789-797.
[5]HONG Y,ZHANG J,MA B,et al.Using cross-entity inference to improve event extraction[C]//Proceedings of the 49th An-nual Meeting of the Association for Computational Linguistics:Human Language Technologies.2011:1127-1136.
[6]HUANG R,RILOFF E.Modeling textual cohesion for event extraction[C]//Twenty-Sixth AAAI Conference on Artificial Intelligence.2012:1-7.
[7]CHEN Y,XU L,LIU K,et al.Event extraction via dynamic multi-pooling convolutional neural networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Na-tural Language Processing.2015:167-176.
[8]CHEN Y,YANG H,LIU K,et al.Collective event detection via a hierarchical and bias tagging networks with gated multi-level attention mechanisms[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing.2018:1267-1276.
[9]LIU X,LUO Z,HUANG H.Jointly multiple events extraction via attention-based graph information aggregation[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing.2018:1247-1256.
[10]NGUYEN T H,CHO K,GRISHMAN R.Joint event extraction via recurrent neural networks[C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2016:300-309.
[11]NGUYEN T H,GRISHMAN R.Modeling skip-grams for event detection with convolutional neural networks[C]//Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing.2016:886-891.
[12]SHA L,QIAN F,CHANG B,et al.Jointly extracting event triggers and arguments by dependency-bridge rnn and tensor-based argument interaction[C]//Thirty-Second AAAI Conference on Artificial Intelligence.2018:5916-5923.
[13]LIN H,LU Y,HAN X,et al.Nugget proposal networks for Chi-nese event detection[C]//Proceedings of the 56st Annual Mee-ting of the Association for Computational Linguistics (volume 1:Long papers).2018:1565-1574.
[14]CHEN Z,JI H.Language specific issue and feature exploration in Chinese event extraction[C]//Proceedings of Human Language Technologies:The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics.2009:209-212.
[15]QIN B,ZHAO Y,DING X,et al.Event type recognition based on trigger expansion[J].Tsinghua Science and Technology,2010,15(3):251-258.
[16]LI P,ZHOU G.Employing morphological structures and se-memes for Chinese event extraction[C]//Proceedings of COLING 2012.2012:1619-1634.
[17]GRISHMAN R,WESTBROOK D,MEYERS A.Nyu's english ace 2005 system description[C]//Proccedings of the ACE 2005 Evaluation Workshop.2005.
[18]LI Y,BONTCHEVA K,CUNNINGHAM H.Using unevenmargins SVM and perceptron for information extraction[C]//Proceedings of the Ninth Conference on Computational Natural Language Learning (CoNLL-2005).2005:72-79.
[19]LI Q,JI H,HUANG L.Joint event extraction via structured prediction with global features[C]//Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics.2013:73-82.
[20]LU W,ROTH D.Automatic event extraction with structured preference modeling[C]//Proceedings of the 50th Annual Mee-ting of the Association for Computational Linguistics.2012:835-844.
[21]YANG B,MITCHELL T.Joint extraction of events and entities within a document context [C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2016:289-299.
[22]NGUYEN T H,GRISHMAN R.Event detection and domainadaptation with convolutional neural networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computatio-nal Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2:Short Papers).2015:365-371.
[23]ZHAO Y,JIN X,WANG Y,et al.Document embedding en-hanced event detection with hierarchical and supervised attention[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics.2018:414-419.
[24]LIN H,LU Y,HAN X,et al.Cost-sensitive regularization for label confusion-aware event detection[J].arXiv:1906.06003,2019.
[25]XIA Y,LIU Y.Chinese Event Extraction Using Deep NeuralNetwork with Word Embedding[J].arXiv:1610.00842,2016.
[26]ZENG Y,YANG H,FENG Y,et al.A convolution BiLSTMneural network model for Chinese event extraction[M].Cham:Springer,2016:275-287.
[27]DEVLIN J,CHANG M W,LEE K,et al.Bert:Pre-training of deep bidirectional transformers for language understanding[J].arXiv:1810.04805,2018.
[28]CHEN C,NG V.Joint modeling for chinese event extractionwith rich linguistic features[C]//Proceedings of COLING 2012.2012:529-544.
[1] WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[2] LI Xiao-wei, SHU Hui, GUANG Yan, ZHAI Yi, YANG Zi-ji. Survey of the Application of Natural Language Processing for Resume Analysis [J]. Computer Science, 2022, 49(6A): 66-73.
[3] WU Fan, ZHU Pei-pei, WANG Zhong-qing, LI Pei-feng, ZHU Qiao-ming. Chinese Event Detection with Joint Representation of Characters and Words [J]. Computer Science, 2021, 48(4): 249-253.
[4] ZOU Ao, HAO Wen-ning, JIN Da-wei, CHEN Gang, TIAN Yuan. Study on Text Retrieval Based on Pre-training and Deep Hash [J]. Computer Science, 2021, 48(11): 300-306.
[5] LU Jia-wen, YAN Li. Mapping Method from Object-relational Database to RDF(S) [J]. Computer Science, 2021, 48(10): 145-151.
[6] ZUO Xian-yu, ZHANG Zhe, SU Yue-han, LIU Yang, GE Qiang, TIAN Jun-feng. Extraction Algorithm of NDVI Based on GPU Multi-stream Parallel Model [J]. Computer Science, 2020, 47(4): 25-29.
[7] SHI He, YANG Qun, LIU Shao-han, LI Wei. Study on Information Extraction of Power Grid Fault Emergency Pre-plans Based on Deep Learning [J]. Computer Science, 2020, 47(11A): 52-56.
[8] GAO Li-zheng, ZHOU Gang, LUO Jun-yong, LAN Ming-jing. Survey on Meta-event Extraction [J]. Computer Science, 2019, 46(8): 9-15.
[9] YUAN Ren-jin, CHEN Gang. Research on News Recommendation Methods Considering Geographical Location of News [J]. Computer Science, 2018, 45(11A): 462-467.
[10] LI Ying, HAO Xiao-yan and WANG Yong. N-ary Chinese Open Entity-relation Extraction [J]. Computer Science, 2017, 44(Z6): 80-83.
[11] Munina YUSUFU and Gulina YUSUFU. Repetitive Pattern Recognition Algorithms and Applications in Web Information Extraction and Clustering Analysis [J]. Computer Science, 2017, 44(Z11): 39-45.
[12] YANG Jun-wen, WANG Hai, PENG Xin and ZHAO Wen-yun. Web Resource Recommendation Based on Analysis of Developer’s Behavior [J]. Computer Science, 2017, 44(7): 147-150.
[13] BO Shu-kui and JING Yong-ju. One-class Information Extraction from Remote Sensing Imagery Based on Nearest Neighbor Rule [J]. Computer Science, 2016, 43(Z6): 217-218.
[14] WANG Ya-pu, WANG Zhi-jian and YE Feng. Research of Improved Tree Path Model in Web Page Clustering [J]. Computer Science, 2015, 42(5): 109-113.
[15] ZHANG Yu-xiang, GAO Xu-yang, WANG Ting, ZHANG Le-fei and DU Bo. Background Self-learning Framework for Bio Information Extraction from Hyperspectral Images [J]. Computer Science, 2015, 42(4): 292-296.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!