计算机科学 ›› 2020, Vol. 47 ›› Issue (6A): 40-44.doi: 10.11896/JsJkx.190700042

• 人工智能 • 上一篇    下一篇

结合百科知识与句子语义特征的关系抽取方法

吕亿林, 田宏韬, 高建伟, 万怀宇   

  1. 北京交通大学计算机与信息技术学院 北京 100044
  • 发布日期:2020-07-07
  • 通讯作者: 万怀宇(hywan@bJtu.edu.cn)
  • 作者简介:yilinlv@bJtu.edu.cn
  • 基金资助:
    国家重点研发计划项目(2018YFC0830200)

Relation Extraction Method Combining Encyclopedia Knowledge and Sentence Semantic Features

YU Yi-lin, TIAN Hong-tao, GAO Jian-wei and WAN Huai-yu   

  1. School of Computer and Information Technology,BeiJing Jiaotong University,BeiJing 100044,China
  • Published:2020-07-07
  • About author:LYU Yi-lin, born in 1997, undergra-duate student.His main research interest is information extraction.
    WAN Huai-yu, born in 1981, Ph.D, associate professor, Ph.D supervisor, is a member of China Computer Federation.His main research interests include social network mining, text mining, user behavior analysis, and traffic data mining.
  • Supported by:
    This work was supported by the National Key R&D Program of China (2018YFC0830200).

摘要: 关系抽取是信息抽取领域中重要的研究任务之一,其典型的应用场景包括知识图谱、问答系统、机器翻译等。目前已经有大量的研究工作将深度学习应用于关系抽取任务中,基于深度神经网络的关系抽取方法在很多场景中的表现都优于传统关系抽取方法。然而,目前基于深度神经网络的方法大多仅依赖于语料本身,缺乏外部知识的引入。针对这个问题,提出了一种结合百科知识与句子语义特征的神经网络关系抽取模型。该模型引入百科实体的背景描述信息作为外部知识,并通过注意力机制动态地从描述信息中提取实体特征,同时利用双向LSTM模型抽取句子中所包含的语义特征,最后结合实体特征和句子语义特征进行实体关系抽取。在人工标注数据集上的对比实验结果表明,文中所提模型的表现明显优于其他现有的关系抽取方法。

关键词: 长短期记忆网络, 实体关系抽取, 中文百科, 注意力机制

Abstract: Relation extraction is one of the important research topics in the field of information extraction.Its typical application scenarios include knowledge graphs,question answering systems,machine translation,etc.Recently,deep learning has been applied in a large amount of relation extraction researches,and deep neural networks based relationship extraction method performs much better than the traditional methods in many situations.However,most of the current deep neural network-based relation extraction methods Just rely on the corpus itself and lack the introduction of external knowledge.To address this issue,this paper proposed a neural network model,which combined encyclopedia knowledge and semantic features of sentences for relation extraction.The model introduced the description information of entities in encyclopedia as external knowledge,and dynamically extracted entity features through attention mechanism.Meanwhile,it employed bidirectional LSTM networks to extract the semantic features contained in the sentence.Finally,the model combined the entity features and the sentence semantic features for relation extraction.A series of experiments were carried out based on a manually labeled dataset.Experimental results demonstrate that the proposed model is superior to other existing relationship extraction methods.

Key words: Attention mechanism, Chinese encyclopedia, Entity relation extraction, Long short-term memory

中图分类号: 

  • TP391
[1] ZENG D,LIU K,LAI S,et al.Relation Classification via Convolutional Deep Neural Network //Proceedings of the 25th International Conference on Computational Linguistics.Stroudsburg,PA:ACL,2014:2335-2344.
[2] SANTOS C N,XIANG B,ZHOU B.Classifying Relations by Ranking with Convolutional Neural Networks .arXiv:1504.06580,2015.
[3] NGUYEN T H,GRISHMAN R.Relation extraction:Perspective from convolutional neural networks //Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing.Stroudsburg,PA:ACL,2015:39-48.
[4] ZHANG D,WANG D.Relation Classification via Recurrent Neural Network .arXiv:1508.01006,2015.
[5] ZHANG S,ZHENG D,HU X,et al.Bidirectional long short-term memory networks for relation classification //Procee-dings of the 29th Pacific Asia Conference on Language,Information and Computation.Stroudsburg,PA:ACL,2015:73-78.
[6] ZHOU P,SHI W,TIAN J,et al.Attention-based bidirectional long short-term memory networks for relation classification //Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2:Short Papers).Stroudsburg,PA:ACL,2016:207-212.
[7] BAHDANAU D,CHO K,BENGIO Y.Neural Machine Translation by Jointly Learning to Align and Translate .arXiv:1409.0473,2014.
[8] KAMBHATLA N.Combining lexical,syntactic,and semantic features with maximum entropy models for extracting relations //Proceedings of the ACL Interactive Poster and Demonstration Sessions.Stroudsburg,PA:ACL,2004:22.
[9] SUCHANEK F M,IFRIM G,WEIKUM G.Combining linguistic and statistical analysis to extract relations from web documents //Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.New York:ACM,2006:712-717.
[10] QIAN L,ZHOU G,KONG F,et al.Exploiting constituent dependencies for tree kernel-based semantic relation extraction //Proceedings of the 22nd International Conference on Computational Linguistics.Stroudsburg,PA:ACL,2008:697-704.
[11] BUNESCU R C,MOONEY R J.A shortest path dependency kernel for relation extraction //Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing.Stroudsburg,PA:ACL,2005:724-731.
[12] MOONEY R J,BUNESCU R C.Subsequence kernels for relation extraction //Proceedings of Advances in Neural Information Processing Systems.Stroudsburg,PA:ACL,2006:171-178.
[13] ZENG D,LIU K,CHEN Y,et al.Distant supervision for relation extraction via piecewise convolutional neural networks //Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing.Stroudsburg,PA:ACL,2015:1753-1762.
[14] HOCHREITER S,SCHMIDHUBER J.Long short-term memory .Neural computation,1997,9(8):1735-1780.
[15] CHO K,VAN M B,GULCEHRE C,et al.Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation .arXiv:1406.1078,2014.
[16] XU Y,MOU L,LI G,et al.Classifying relations via long short term memory networks along shortest dependency path .arXiv:1508.03720,2015.
[17] MIKOLOV T,CHEN K,CORRADO G,et al.Efficient Estimation of Word Representations in Vector Space .arXiv:1301.3781,2013.
[18] CHE W,LI Z,LIU T.LTP:A Chinese Language Technology Platform //Proceedings of the 23rd International Conference on Computational Linguistics Demonstrations Volume.Stroudsburg,PA:ACL,2010:13-16.
[19] KINGMA D P,BA J.Adam:A method for stochastic optimization .arXiv:1412.6980,2014.
[1] 周芳泉, 成卫青.
基于全局增强图神经网络的序列推荐
Sequence Recommendation Based on Global Enhanced Graph Neural Network
计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[2] 戴禹, 许林峰.
基于文本行匹配的跨图文本阅读方法
Cross-image Text Reading Method Based on Text Line Matching
计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032
[3] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[4] 熊丽琴, 曹雷, 赖俊, 陈希亮.
基于值分解的多智能体深度强化学习综述
Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization
计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112
[5] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[6] 汪鸣, 彭舰, 黄飞虎.
基于多时间尺度时空图网络的交通流量预测模型
Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction
计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188
[7] 王馨彤, 王璇, 孙知信.
基于多尺度记忆残差网络的网络流量异常检测模型
Network Traffic Anomaly Detection Method Based on Multi-scale Memory Residual Network
计算机科学, 2022, 49(8): 314-322. https://doi.org/10.11896/jsjkx.220200011
[8] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[9] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[10] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[11] 闫佳丹, 贾彩燕.
基于双图神经网络信息融合的文本分类方法
Text Classification Method Based on Information Fusion of Dual-graph Neural Network
计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042
[12] 金方焱, 王秀利.
融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取
Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM
计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190
[13] 熊罗庚, 郑尚, 邹海涛, 于化龙, 高尚.
融合双向门控循环单元和注意力机制的软件自承认技术债识别方法
Software Self-admitted Technical Debt Identification with Bidirectional Gate Recurrent Unit and Attention Mechanism
计算机科学, 2022, 49(7): 212-219. https://doi.org/10.11896/jsjkx.210500075
[14] 彭双, 伍江江, 陈浩, 杜春, 李军.
基于注意力神经网络的对地观测卫星星上自主任务规划方法
Satellite Onboard Observation Task Planning Based on Attention Neural Network
计算机科学, 2022, 49(7): 242-247. https://doi.org/10.11896/jsjkx.210500093
[15] 赵冬梅, 吴亚星, 张红斌.
基于IPSO-BiLSTM的网络安全态势预测
Network Security Situation Prediction Based on IPSO-BiLSTM
计算机科学, 2022, 49(7): 357-362. https://doi.org/10.11896/jsjkx.210900103
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!