Computer Science ›› 2023, Vol. 50 ›› Issue (5): 255-261.doi: 10.11896/jsjkx.220300154

• Artificial Intelligence • Previous Articles     Next Articles

Document-level Event Extraction Based on Multi-granularity Entity Heterogeneous Graph

ZHANG Hu, ZHANG Guangjun   

  1. School of Computer and Information Technology,Shanxi University,Taiyuan 030006,China
  • Received:2022-03-16 Revised:2022-10-15 Online:2023-05-15 Published:2023-05-06
  • About author:ZHANG Hu,born in 1979,Ph.D,professor,is a member of China Computer Federation.His main research interests include natural language processing and representation learning.
  • Supported by:
    National Natural Science Foundation of China(62176145) and National Key Research and Development Program of China(2020AAA0106100).

Abstract: Document-level event extraction is an event extraction task for long texts with multiple sentences.Existing document-level event extraction studies generally divide event extraction into three sub-tasks:candidate entity extraction,event detection and argument recognition,and usually train them with joint learning.However,most of the existing document-level event extraction methods extract candidate entities sentence-by-sentence without considering the cross-sentence contextual information,which obviously reduces the accuracy of entity extraction and argument recognition.Furthermore,it affects the final event extraction results.Based on this,this paper proposes a document-level event extraction method based on multi-granularity entity heteroge- neous graphs.This method uses two independent encoders,Transformer and RoBerta,for sentence-level and paragraph-level entity extraction respectively.Meanwhile,this paper proposes a multi-granularity entity fusion strategy to select entities that are more likely to be event arguments from the set of sentence entities and paragraph entities,and further constructs a heterogeneous graph incorporating multi-granularity entities.Finally,we use graph convolutional network to obtain document-aware entity and sentence representations for multi-label classification of event types and event arguments to achieve event detection and arguments recognition.Experiments on ChFinAnn and Duee-fin datasets show that the proposed method improves about 1.3% and 3.9% by F1 compared with previous methods,which proves its effectiveness.

Key words: Document-level event extraction, Event extraction, Heterogeneous graph, Entity extraction, Multi-granularity

CLC Number: 

  • TP391
[1]ZHENG S,CAO W,XU W,et al.Doc2EDAG:An end-to-end document-level framework for Chinese financial event extraction[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing(EMNLP-IJC-NLP).2019.
[2]CHEN Y,XU L,LIN K,et al.Event extraction via dynamic multi-pooling convolutional neural networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computa-tional Linguistics and the 7th International Joint Conference on Natural Language Processing(Volume 1:Long Papers).2015:167-176.
[3]NGUYEN T H,CHO K,GRISH MAN R.Joint event extraction via recurrent neural networks[C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2016:300-309.
[4]LIU X,LUO Z,HUANG H.Jointly multiple events extraction via attention-based graph information aggregation[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing.2018.
[5]YANG S,FENG D,QIAN L,et al.Exploring pre-trained lan-guage models for event extraction and generation[C]//Procee-dings of the 57th Annual Meeting of the Association for Computational Linguistics.2019:5284-5294.
[6]DU X,CARDIE C.Event extraction by answering(almost) na-tural questions[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing.2020.
[7]LIU J,CHEN Y,LIU K,et al.Event extraction as machinereading comprehension[C]//Proceedings of the 2020 Confe-rence on Empirical Methods in Natural Language Processing(EMNLP).2020:1641-1651.
[8]ZHOU Y,CHEN Y,ZHAO J,et al.What the role is vs.What plays the role:Semi-supervised Event Argument Extraction via Dual Question Answering[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:14638-14646.
[9]YANG H,CHEN Y,LIU K,et al.Dcfee:A document-level Chinese financial event extraction system based on automatically labeled training data[C]//Proceedings of the 56th Annual Mee-ting of the Association for Computational Linguistics.ACL 2018.
[10]XU R,LIU T,LI L,et al.Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics.ACL 2021.
[11]YANG H,SUI D,CHEN Y,et al.Document-level Event Extraction via Parallel Prediction Networks[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing(Volume 1:Long Papers).2021:6298-6308.
[12]HUANG Y,JIA W.Exploring Sentence Community for Document-Level Event Extraction[C]//Findings of the Association for Computational Linguistics:EMNLP 2021.2021:340-351.
[13]DODDINGTON G R,MITCHELL A,PRZYBOCKI M A,et al.The automatic content extraction(ace) program-tasks,data,and evaluation[C]//Language Resources and Evaluation Confe-rence.2004,2(1):837-840.
[14]LI X,LI F,PAN L,et al.DuEE:a large-scale dataset for Chinese event extraction in real-world scenarios[C]//CCF International Conference on Natural Language Processing and Chinese Computing.Cham:Springer,2020:534-545.
[15]LI X.DuEE-fin:A Document-level Event Extraction Dataset in the Financial Domain Releasedby BAIDU[OL].https://aistudio.baidu.com/aistudio/competition/detail/46.
[16]VASWANI A,SHAZEERN,PARMAR N,et al.Attention is all you need[C]//Advances in Neural Information Processing Systems.2017:5998-6008.
[17]LIU Y,OTT M,GOYAL N,et al.Roberta:A robustly opti-mized bert pretraining approach[J].arXiv:1907.11692,2019.
[1] WEI Tao, LI Zhihua, WANG Changjie, CHENG Shunhang. Cybersecurity Threat Intelligence Mining Algorithm for Open Source Heterogeneous Data [J]. Computer Science, 2023, 50(6): 330-337.
[2] LIU Songyue, WANG Huan. Leaf Classification and Ranking Method Based on Multi-granularity Feature Fusion [J]. Computer Science, 2023, 50(3): 216-222.
[3] LIU Pan, GUO Yanming, LEI Jun, LAO Mingrui, LI Guohui. Study on Chinese Named Entity Extraction Rules Based on Boundary Location and Correction [J]. Computer Science, 2023, 50(3): 276-281.
[4] LIU Luping, ZHOU Xin, CHEN Junjun, He Xiaohai, QING Linbo, WANG Meiling. Event Extraction Method Based on Conversational Machine Reading Comprehension Model [J]. Computer Science, 2023, 50(2): 275-284.
[5] PU Jinyao, BU Lingmei, LU Yongmei, YE Ziming, CHEN Li, YU Zhonghua. Utilizing Heterogeneous Graph Neural Network to Extract Emotion-Cause Pairs Effectively [J]. Computer Science, 2023, 50(1): 205-212.
[6] SONG Jie, LIANG Mei-yu, XUE Zhe, DU Jun-ping, KOU Fei-fei. Scientific Paper Heterogeneous Graph Node Representation Learning Method Based onUnsupervised Clustering Level [J]. Computer Science, 2022, 49(9): 64-69.
[7] QIN Qi-qi, ZHANG Yue-qin, WANG Run-ze, ZHANG Ze-hua. Hierarchical Granulation Recommendation Method Based on Knowledge Graph [J]. Computer Science, 2022, 49(8): 64-69.
[8] ZHANG Yuan, KANG Le, GONG Zhao-hui, ZHANG Zhi-hong. Related Transaction Behavior Detection in Futures Market Based on Bi-LSTM [J]. Computer Science, 2022, 49(7): 31-39.
[9] YANG Fei-fei, SHEN Si-yu, SHEN De-rong, NIE Tie-zheng, KOU Yue. Method on Multi-granularity Data Provenance for Data Fusion [J]. Computer Science, 2022, 49(5): 120-128.
[10] MIU Feng, WANG Ping, LI Tai-yong. Implicit Causality Extraction Method Based on Event Action Direction [J]. Computer Science, 2022, 49(3): 276-280.
[11] CHI Yu-ning, GUO Yun-fei, WANG Ya-wen, HU Hong-chao. Software Diversity Evaluation Method Based on Multi-granularity Features [J]. Computer Science, 2022, 49(12): 118-124.
[12] ZHU Yi-na, CAO Yang, ZHONG Jing-yue, ZHENG Yong-zhi. Survey on Event Extraction Technology [J]. Computer Science, 2022, 49(12): 264-273.
[13] MIAO Lan-xin, LEI Yu, ZENG Peng-peng, LI Xiao-yu, SONG Jing-kuan. Granularity-aware and Semantic Aggregation Based Image-Text Retrieval Network [J]. Computer Science, 2022, 49(11): 134-140.
[14] WANG Dong, ZHOU Da-ke, HUANG You-da , YANG Xin. Multi-scale Multi-granularity Feature for Pedestrian Re-identification [J]. Computer Science, 2021, 48(7): 238-244.
[15] XU Jin. Construction and Application of Knowledge Graph for Industrial Assembly [J]. Computer Science, 2021, 48(6A): 285-288.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!