计算机科学 ›› 2022, Vol. 49 ›› Issue (12): 305-311.doi: 10.11896/jsjkx.211100264
朱广丽, 许鑫, 张顺香, 吴厚月, 黄菊
ZHU Guang-li, XU Xin, ZHANG Shun-xiang, WU Hou-yue, HUANG Ju
摘要: 因果关系抽取是一种从文本中抽取因果实体对的自然语言处理技术,被广泛应用于金融、医疗等领域。传统的因果关系抽取技术需要人工选取文本特征进行因果匹配或使用神经网络多次提取特征,导致模型结构较为复杂,抽取效率不高。针对这一问题,提出一种基于位置的因果关系抽取网络(Position-based Causal Extraction Network,PosNet),以期提高因果关系的抽取效率。首先,预处理文本,构建多粒度文本特征作为网络的输入;然后,将文本特征传入位置预测网络,使用经典的浅层卷积神经网络预测因果实体的开始位置和结束位置;最后,通过组装算法按起始位置组装因果实体,抽取出全部因果实体对。实验结果证明PosNet可以提升因果关系抽取的效率。
中图分类号:
[1]WANG Z J,WANG S,LI X Q,et al.Review of Event Causality Extraction Based on Deep Learning[J].Journal of Computer Applications,2021,41(5):1247-1255. [2]ITTOO A,BOUMA G.Extracting explicit and implicit causal relations from sparse,domain-specific texts[C]//International Conference on Application of Natural Language to Information Systems.Berlin:Springer,2011:52-63. [3]LUO Z,SHA Y,ZHU K Q,et al.Commonsense causal reaso-ning between short texts[C]//Proceedings of the Fifteenth International Conference on Principles of Knowledge Representation and Reasoning.Palo Alto,CA:AAAI Press,2016:421-430. [4]ZHAO S,LIU T,ZHAO S,et al.Event causality extractionbased on connectives analysis[J].Neurocomputing,2016,173(3):1943-1950. [5]SEOL J W,YI W,CHOI J,et al.Causality patterns and machine learning for the extraction of problem-action relations in discharge summaries[J].International Journal of Medical Informatics,2017,98:1-12. [6]LEE D G,SHIN H.Disease causality extraction based on lexical semantics and document-clause frequency from biomedical literature[J].BMC Medical Informatics and Decision Making,2017,17(1):1-9. [7]LEE S,SEO S,OH B,et al.Cross-sentence N-ary Relation Extraction using Entity Link and Discourse Relation[C]//Procee-dings of the 29th ACM International Conference on Information &Knowledge Management.New York:ACM,2020:705-714. [8]FRATTINI J,JUNKER M,UNTERKALMSTEINER M,et al.Automatic extraction of cause-effect-relations from requirements artifacts[C]//Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering.NJ:IEEE,2020:561-572. [9]HEINDORF S,SCHOLTEN Y,WACHSMUTH H,et al.Causenet:Towards a causality graph extracted from the web[C]//Proceedings of the 29th ACM International Conference on Information & Knowledge Management.New York:ACM,2020:3023-3030. [10]KRUENGKRAI C,TORISAWA K,HASHIMOTO C,et al.Improving event causality recognition with multiple background knowledge sources using multi-column convolutional neural networks[C]//Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence.Palo Alto,CA:AAAI Press,2017:3466-3473. [11]ZHENG S,HAO Y,LU D,et al.Joint entity and relation extraction based on a hybrid neural network[J].Neurocomputing,2017,257:59-66. [12]ZHENG S,WANG F,BAO H,et al.Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme[C]//Procee-dings of the 55th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers).PA:ACL,2017:1227-1236. [13]ZENG X,ZENG D,HE S,et al.Extracting relational facts by an end-to-end neural model with copy mechanism[C]//Proceedings of the 56th Annual Meeting of the Association for Computa-tional Linguistics(Volume 1:Long Papers).PA:ACL,2018:506-514. [14]DASGUPTA T,SAHA R,DEY L,et al.Automatic extraction of causal relations from text using linguistically informed deep neural networks[C]//Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue.PA:ACL,2018:306-316. [15]LI P,MAO K.Knowledge-oriented convolutional neural net-work for causal relation extraction from natural language texts[J].Expert Systems with Applications,2019,115:512-523. [16]LI Z,LI Q,ZOU X,et al.Causality extraction based on self-attentive bilstm-crf with transferred embeddings[J].Neurocomputing,2021,423:207-219. [17]SAHU S K,THOMAS D,CHIU B,et al.Relation extraction with self-determined graph convolutional network[C]//Proceedings of the 29th ACM International Conference on Information & Knowledge Management.New York:ACM,2020:2205-2208. [18]ZHAO K,JI D,HE F,et al.Document-level event causalityidentification via graph inference mechanism[J].Information Sciences,2021,561:115-129. [19]CAO Y,CHEN D,XU Z,et al.Nested relation extraction with iterative neural network[J].Frontiers of Computer Science,2021,15(3):1-14. [20]JIAO F,LI H,DOBOLI A.Modeling and extraction of causal information in analog circuits[J].IEEE Transactions on Compu-ter-Aided Design of Integrated Circuits and Systems,2017,37(10):1915-1928. [21]KIM H,JOUNG J,KIM K.Semi-automatic extraction of technological causality from patents[J].Computers & Industrial Engineering,2018,115:532-542. [22]MAISONNAVE M,DELBIANCO F,TOHMÉ F,et al.Asses-sing Causality Structures learned from Digital Text Media[C]//Proceedings of the ACM Symposium on Document Engineering 2020.New York:ACM,2020:1-4. [23]NASAR Z,JAFFRY S W,MALIK M K.Named Entity Recognition and Relation Extraction:State-of-the-Art[J].ACM Computing Surveys(CSUR),2021,54(1):1-39. [24]MIKOLOV T,CHEN K,CORRADO G,et al.Efficient estimation of word representations in vector space[J].arXiv:1301.3781,2013. [25]KENTON J D M W C,TOUTANOVA L K.BERT:Pre-trai-ning of Deep Bidirectional Transformers for Language Understanding[C]//Proceedings of NAACL-HLT.PA:ACL,2019:4171-4186. [26]LAN Z,CHEN M,GOODMAN S,et al.Albert:A lite bert for self-supervised learning of language representations[J].arXiv:1909.11942,2019. [27]SHAW P,USZKOREIT J,VASWANI A.Self-Attention with Relative Position Representations[C]//Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.PA:ACL,2018:464-468. |
[1] | 周芳泉, 成卫青. 基于全局增强图神经网络的序列推荐 Sequence Recommendation Based on Global Enhanced Graph Neural Network 计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085 |
[2] | 金方焱, 王秀利. 融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取 Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM 计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190 |
[3] | 杨岚岚, 王文琪, 王福田. 基于高秩特征和位置注意力的RGBT目标跟踪 RGBT Object Tracking Based on High Rank Feature and Position Attention 计算机科学, 2022, 49(12): 236-243. https://doi.org/10.11896/jsjkx.220600037 |
[4] | 余晗青, 杨贞, 殷志坚. 基于区域激活策略的Tiny YOLOv3目标检测算法 Tiny YOLOv3 Target Detection Algorithm Based on Region Activation Strategy 计算机科学, 2021, 48(6A): 118-121. https://doi.org/10.11896/jsjkx.200700122 |
[5] | 陈庆超, 王韬, 尹世庄, 冯文博. 多级字典存储的未知文本协议候选关键词链式合并方法 Chain Merging Method for Unknown Text Protocol Candidate Keyword Stored in Multi-levelDictionary 计算机科学, 2020, 47(12): 332-335. https://doi.org/10.11896/jsjkx.190900116 |
[6] | 纪明轩, 宋玉蓉. 一种基于对数位置表示和自注意力的机器翻译新模型 New Machine Translation Model Based on Logarithmic Position Representation and Self-attention 计算机科学, 2020, 47(11A): 86-91. https://doi.org/10.11896/jsjkx.200200003 |
[7] | 陈湘涛,肖碧文. 基于位置信息的显露序列模式挖掘研究 Emerging Sequences Pattern Mining Based on Location Information 计算机科学, 2017, 44(7): 175-179. https://doi.org/10.11896/j.issn.1002-137X.2017.07.031 |
[8] | 王青芸,程春玲. 基于位置信息的移动SNS数据动态划分复制算法 Mobile SNS Data Dynamic Partitioning and Replication Algorithm Based on Location Information 计算机科学, 2017, 44(3): 220-225. https://doi.org/10.11896/j.issn.1002-137X.2017.03.046 |
[9] | 庞松超,罗长远,韩东东,庞涵滢. 一种新的航空自组网混合路由算法 Aeronautical Ad hoc Network Hybrid Routing Algorithm 计算机科学, 2016, 43(5): 56-61. https://doi.org/10.11896/j.issn.1002-137X.2016.05.010 |
[10] | 席瑞,李玉军,侯孟书. 室内定位方法综述 Survey on Indoor Localization 计算机科学, 2016, 43(4): 1-6. https://doi.org/10.11896/j.issn.1002-137X.2016.04.001 |
[11] | 李响,孙华志. 一种新型的防范历史攻击的k-匿名算法 New k-anonymization Algorithm for Preventing Historical Attacks 计算机科学, 2015, 42(8): 194-197. |
[12] | . 基于位置信息的自适应Ad Hoc路由协议 计算机科学, 2007, 34(5): 20-24. |
[13] | 孙君顶 张喜民 崔江涛 周利华. 一种新的基于颜色和空间特征的图像检索方法 计算机科学, 2005, 32(6): 158-160. |
[14] | 王庆辉 孙俊锁 王光兴. 一种基于位置信息的MANET网络多路径路由方法 计算机科学, 2005, 32(5): 27-30. |
|