计算机科学 ›› 2023, Vol. 50 ›› Issue (3): 42-48.doi: 10.11896/jsjkx.220600239

• 知识图谱赋能的知识工程:理论、技术与系统专题 • 上一篇    下一篇

BGPNRE:一种基于BERT的全局指针网络实体关系联合抽取方法

邓亮1,2,3, 齐攀虎4, 刘振龙4, 李敬鑫4, 唐积强5   

  1. 1 中国科学院大学 北京 100049
    2 中国科学院沈阳计算技术研究所 沈阳 110168
    3 国家知识产权专利局 北京 100083
    4 中国科学院计算技术研究所 北京 100190
    5 国家计算机网络应急技术处理协调中心 北京 100029
  • 收稿日期:2022-06-27 修回日期:2022-12-15 出版日期:2023-03-15 发布日期:2023-03-15
  • 通讯作者: 齐攀虎(qipanhu@163.com)
  • 作者简介:(deng_sipo@msn.com)

BGPNRE:A BERT-based Global Pointer Network for Named Entity-Relation Joint Extraction Method

DENG Liang1,2,3, QI Panhu4, LIU Zhenlong4, LI Jingxin4, TANG Jiqiang5   

  1. 1 University of Chinese Academy of Sciences,Beijing 100049,China
    2 Shenyang Institute of Computing Technology,Chinese Academy of Sciences,Shenyang 110168,China
    3 China National Intellectual Property Administration,Beijing 100083,China
    4 Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China
    5 National Computer Network Emergency Response Technical Team/Coordination Center of China,Beijing 100029,China
  • Received:2022-06-27 Revised:2022-12-15 Online:2023-03-15 Published:2023-03-15
  • About author:DENG Liang,born in 1980,postgra-duate.His main research interests include deep learning and knowledge graph.
    QI Panhu,born in 1990,postgraduate.His main research interests include deep learning andnatural language Processing.

摘要: 实体-关系联合抽取指从非结构化文本中联合抽取出实体-关系三元组,是信息抽取和知识图谱构建的一项关键任务。文中提出了一种新的基于全局指针网络实体关系联合抽取方法BGPNRE(BERT-based Global Pointer Network for Named Entity-Relation Joint Extraction),首先通过潜在关系预测模块预测文本中蕴含的关系,过滤掉不可能存在的关系,将实体抽取限制在预测的关系子集中;其次通过使用基于关系的全局指针网络,获取所有主客体实体的位置;最后通过全局指针网络通信模块,将主客体位置高效率地解码对齐成一个实体关系三元组。该方法避免了传统管道式方法存在的错误传播问题,同时也解决了关系冗余、实体重叠、Span提取泛化不足等问题。实验结果表明,所提方法在多关系和重叠实体抽取上表现卓越,并且在NYT和WebNLG公共数据集上达到了最先进的水平。

关键词: 实体-关系联合抽取, BGPNRE, 全局指针网络, BERT

Abstract: Named entity-relation joint extraction refers to extracting entity-relation triples from unstructured text.It's an important task for information extraction and knowledge graph construction.This paper proposes a new method--BERT-based global pointer network for named entity-relation joint extraction(BGPNRE).Firstly,the potential relation prediction module is used to predict the relations contained in the text,filters out the impossible relations,and limits the predicted relation subset for entity recognition.Then a relation-specific global pointer-net is used to obtain the location of all subject and object entities.Finally,a global pointer network correspondence component is designed to align the subject and object position into named entity-relation triples.This method avoids error propagation frompipeline model,and also solves the the redundancy of relation prediction,entity overlapping,and poor generalization of span-based extraction.Extensive experiments show that our model achieves state-of-the-art performance on NYT and WebNLG public benchmarks with higher performance gain on multi relations and entities overlapping.

Key words: Named entity-relation joint extraction, BGPNRE, Global pointer network, BERT

中图分类号: 

  • TP391
[1]SANG E,MEULDER F.Language-independent named entityrecognition[C]//Proceedings of the Seventh Conference onNa-tural Language Learning at HLT-NAACL 2003.2003:142-147.
[2]RATINOV L,ROTH D.Design challenges and misconceptions in named entity recognition[C]//Proceedings of the Thirteenth Conference on Computational Natural Language Learning.2009:147-155.
[3]ZELNKO D,AONE C,RICHARDELLA A.Kernel methods for relation extraction[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing-Volume 10.2002:77-78.
[4]RAZVAN C,RAYMOND J.A shortest path dependency kernel for relation extraction[C]//Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing.2005:724-731.
[5]PAWAR S,GIRISH K,BHATTACHARYYA P.Relation ex-traction:A survey[J/OL].https://doi.org/10.48550/arXiv.1712.05191.
[6]WANG Z,WEN R,CHEN X,et al.Finding in influential instances for distantly supervisedrelation extraction[J/OL].https://doi.org/10.48550/arXiv.1712.05191.
[7]YU X,LAM W.Jointly identifying entities and extracting relations in encyclopedia text via agraphical model approach[C]//The 28th International Conference on Computational Linguistics.2010:1399-1407.
[8]LI Q,JI H.Incremental joint extraction of entity mentions and relations[C]//Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers).2014:402-412.
[9]MIWA M,SASAKI Y.Modeling joint entity and relation extraction with tablerepresentation[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing.2014:1858-1869.
[10]REN X,WU Z,HE W,et al.Cotype:Joint extraction of typed entities and relations with knowledge bases[C]//Proceedings of the 26th International Conference on World Wide Web.2017:1015-1024.
[11]ZHENG S,WANG F,BAO H,et al.Joint extraction of entities and relations based on a noveltatagging scheme[C]//Procee-dings of the 55th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers).2017:1227-1236.
[12]BEKOULIS G,DElEU J,DEMEESTER T,et al.Joint entityrecognition and relation extraction as a multi-head selection problem[J].Expert System with Applications,2018,114:34-45.
[13]NAYAK T,TOU H.Effective modeling of encoder-decoder architecture for joint entity and relation extraction[C]//Procee-dings of the AAAI Conference on Artifificial Intelligence.2020:8528-8535.
[14]WEI Z,SU J,WANG Y,et al.A novel cascade binary tagging framework for relational tripleextraction[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:1476-1488.
[15]YUAN Y,ZHOU X,PAN S,et al.A relation-specifific attention network for joint entity andrelation extraction[C]//Inter-national Joint Confernce on Artifificial Intelligence.2020:4054-4060.
[16]WANG Y,YU B,ZHANG Y,et al.TPLinker:Single-stage joint extraction of entities and relations through token pair linking[C]//Proceedings of the 28th International Conference on Computational Linguistics.2020:1572-1582.
[17]SUI D,CHEN Y,LIU K,et al.Joint Entity and Relation Extraction with Set Prediction Networks [J/OL].https://doi.org/ 10.48550/arXiv.2011.01675.
[18]ZENG X,HE S,ZENG D,et al.PRGC:Potential Relation andGlobal Correspondence Based Joint Relational Triple Extraction[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics.2021:6225-6235.
[19]SHANG Y,HUANG H,MAO X.OneRel:Joint Entity and Relation Extraction with One Module in One Step[C]//Procee-dings of the AAAI Conference on Artificial Intelligence.2022:11285-11293.
[20]VINYALS O,FORTUNATO M,JAITLY N.Pointer networks[C]//Advances in Neural Information Processing Systems.2015:2674-2682.
[21]DEVLIN J,CHANG M,LEE K.Bert:Pre-training of deep bidirectional transformers forlanguage understanding[C]//Annual Conference of the North American Chapter of the Association for Computational Linguistics.2019:4171-4186.
[22]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[C]//Advances in Neural Information Processing Systems.2017:6000-6010.
[23]SU J,LU Y,PAN S,et al.RoFormer:Enhanced Transformer with Rotary Position Embedding[J/OL].https://doi.org/10.48550/arXiv.2104.09864.
[24]LIN M,CHEN Q,YAN S.Network in network[C]//2nd International Conference on Learning Representations.2014:14-16.
[25]SUN Y,CHENG C,ZHANG Y,et al.Circle Loss:A Unified Perspective of Pair Similarity Optimization[J/OL].https://doi.org/ 10.48550/arXiv.2002.10857.
[26]YU B,ZHANG Z,SU J,et al.Joint extraction of entities and relations based on a novel decomposition strategy[C]//24th European Conference on Artifificial Intelligence-ECAI 2020.2019.
[27]RIEDEL S,YAO L,MCCALLUM A.Modeling relations andtheir mentions without labeled text [C]//Proceedings of Joint European Conference on Machine Learning and Knowledge Discovery in Databases.2010.
[28]GARDENT C,SHIMORINA A,NARAYAN S,et al.Creating training corpora for NLG microplanners[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers).2017:179-188.
[29]DIEDERIK P,BA J.Adam:A method for stochastic optimization[C]//3rd International Conference on Learning Representations.2015.
[30]LOSHCHILOV I,HUTTER F.Fixing weight decay regularization in Adam [J/OL].https://doi.org/10.48550/arXiv.1711.05101.
[31]ZENG X,ZENG D,HE S,et al.Extracting relational facts by an end-to-end neural model withCopy Mechanism[C]//Procee-dings of the 56th Annual Meeting of the Association for Computational(Volume1:Long Papers).2018:506-514.
[32]FU T,LI P,MA W.GraphRel:Modeling text as Relationalgraphs for joint entity and relation extraction[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.2019:1409-1418.
[33]ZENG X,HE S,ZENG D,et al.Learning the extraction order of multiple relational facts in a sentence with reinforcement learning[C]//Proceedings of the 56th Annual Meeting of the Association for Computational(Volume1:Long Papers).2018:506-514.
[1] 刘哲, 殷成凤, 李天瑞.
基于BERT和多特征融合嵌入的中文拼写检查
Chinese Spelling Check Based on BERT and Multi-feature Fusion Embedding
计算机科学, 2023, 50(3): 282-290. https://doi.org/10.11896/jsjkx.220100104
[2] 曹金娟, 钱忠, 李培峰.
基于联合模型的端到端事件可信度识别
End-to-End Event Factuality Identification with Joint Model
计算机科学, 2023, 50(2): 292-299. https://doi.org/10.11896/jsjkx.211200108
[3] 于家畦, 康晓东, 白程程, 刘汉卿.
一种新的中文电子病历文本检索模型
New Text Retrieval Model of Chinese Electronic Medical Records
计算机科学, 2022, 49(6A): 32-38. https://doi.org/10.11896/jsjkx.210400198
[4] 康雁, 吴志伟, 寇勇奇, 张兰, 谢思宇, 李浩.
融合Bert和图卷积的深度集成学习软件需求分类
Deep Integrated Learning Software Requirement Classification Fusing Bert and Graph Convolution
计算机科学, 2022, 49(6A): 150-158. https://doi.org/10.11896/jsjkx.210500065
[5] 余本功, 张子薇, 王惠灵.
一种融合多层次情感和主题信息的TS-AC-EWM在线商品排序方法
TS-AC-EWM Online Product Ranking Method Based on Multi-level Emotion and Topic Information
计算机科学, 2022, 49(6A): 165-171. https://doi.org/10.11896/jsjkx.210400238
[6] 郭雨欣, 陈秀宏.
融合BERT词嵌入表示和主题信息增强的自动摘要模型
Automatic Summarization Model Combining BERT Word Embedding Representation and Topic Information Enhancement
计算机科学, 2022, 49(6): 313-318. https://doi.org/10.11896/jsjkx.210400101
[7] 韦入铭, 陈若愚, 李晗, 刘旭红.
基于深度学习与文本计量的技术趋势分析
Analysis of Technology Trends Based on Deep Learning and Text Measurement
计算机科学, 2022, 49(11A): 211100119-6. https://doi.org/10.11896/jsjkx.211100119
[8] 陈孜卓, 林夕, 王中卿.
基于论据边界识别的立场分类研究
Stance Detection Based on Argument Boundary Recognition
计算机科学, 2022, 49(11A): 210800180-5. https://doi.org/10.11896/jsjkx.210800180
[9] 朱若尘, 杨长春, 张登辉.
EGOS-DST:对话现象感知和模式引导的一步对话状态追踪算法
EGOS-DST:Efficient Schema-guided Approach to One-step Dialogue State Tracking for Diverse Expressions
计算机科学, 2022, 49(11A): 210900246-7. https://doi.org/10.11896/jsjkx.210900246
[10] 程思伟, 葛唯益, 王羽, 徐建.
BGCN:基于BERT和图卷积网络的触发词检测
BGCN:Trigger Detection Based on BERT and Graph Convolution Network
计算机科学, 2021, 48(7): 292-298. https://doi.org/10.11896/jsjkx.200500133
[11] 董哲, 邵若琦, 陈玉梁, 翟维枫.
基于BERT和对抗训练的食品领域命名实体识别
Named Entity Recognition in Food Field Based on BERT and Adversarial Training
计算机科学, 2021, 48(5): 247-253. https://doi.org/10.11896/jsjkx.200800181
[12] 陈德, 宋华珠, 张娟, 周泓林.
融合BERT和记忆网络的实体识别
Entity Recognition Fusing BERT and Memory Networks
计算机科学, 2021, 48(10): 91-97. https://doi.org/10.11896/jsjkx.200900015
[13] 杜琳, 曹东, 林树元, 瞿溢谦, 叶辉.
基于BERT与Bi-LSTM融合注意力机制的中医病历文本的提取与自动分类
Extraction and Automatic Classification of TCM Medical Records Based on Attention Mechanism of BERT and Bi-LSTM
计算机科学, 2020, 47(11A): 416-420. https://doi.org/10.11896/jsjkx.200200020
[14] 王子牛, 姜猛, 高建瓴, 陈娅先.
基于BERT的中文命名实体识别方法
Chinese Named Entity Recognition Method Based on BERT
计算机科学, 2019, 46(11A): 138-142.
[15] 章慧,陈宏明.
融合SUSAN算法和Robert算法的图像边缘检测滤波处理技术
Hybrid SUSAN Algorithm and Robert Algorithm for Image Edge Detection Filtering Technique
计算机科学, 2013, 40(3): 302-304.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!