计算机科学 ›› 2020, Vol. 47 ›› Issue (6A): 40-44.doi: 10.11896/JsJkx.190700042
吕亿林, 田宏韬, 高建伟, 万怀宇
YU Yi-lin, TIAN Hong-tao, GAO Jian-wei and WAN Huai-yu
摘要: 关系抽取是信息抽取领域中重要的研究任务之一,其典型的应用场景包括知识图谱、问答系统、机器翻译等。目前已经有大量的研究工作将深度学习应用于关系抽取任务中,基于深度神经网络的关系抽取方法在很多场景中的表现都优于传统关系抽取方法。然而,目前基于深度神经网络的方法大多仅依赖于语料本身,缺乏外部知识的引入。针对这个问题,提出了一种结合百科知识与句子语义特征的神经网络关系抽取模型。该模型引入百科实体的背景描述信息作为外部知识,并通过注意力机制动态地从描述信息中提取实体特征,同时利用双向LSTM模型抽取句子中所包含的语义特征,最后结合实体特征和句子语义特征进行实体关系抽取。在人工标注数据集上的对比实验结果表明,文中所提模型的表现明显优于其他现有的关系抽取方法。
中图分类号:
[1] ZENG D,LIU K,LAI S,et al.Relation Classification via Convolutional Deep Neural Network //Proceedings of the 25th International Conference on Computational Linguistics.Stroudsburg,PA:ACL,2014:2335-2344. [2] SANTOS C N,XIANG B,ZHOU B.Classifying Relations by Ranking with Convolutional Neural Networks .arXiv:1504.06580,2015. [3] NGUYEN T H,GRISHMAN R.Relation extraction:Perspective from convolutional neural networks //Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing.Stroudsburg,PA:ACL,2015:39-48. [4] ZHANG D,WANG D.Relation Classification via Recurrent Neural Network .arXiv:1508.01006,2015. [5] ZHANG S,ZHENG D,HU X,et al.Bidirectional long short-term memory networks for relation classification //Procee-dings of the 29th Pacific Asia Conference on Language,Information and Computation.Stroudsburg,PA:ACL,2015:73-78. [6] ZHOU P,SHI W,TIAN J,et al.Attention-based bidirectional long short-term memory networks for relation classification //Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2:Short Papers).Stroudsburg,PA:ACL,2016:207-212. [7] BAHDANAU D,CHO K,BENGIO Y.Neural Machine Translation by Jointly Learning to Align and Translate .arXiv:1409.0473,2014. [8] KAMBHATLA N.Combining lexical,syntactic,and semantic features with maximum entropy models for extracting relations //Proceedings of the ACL Interactive Poster and Demonstration Sessions.Stroudsburg,PA:ACL,2004:22. [9] SUCHANEK F M,IFRIM G,WEIKUM G.Combining linguistic and statistical analysis to extract relations from web documents //Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.New York:ACM,2006:712-717. [10] QIAN L,ZHOU G,KONG F,et al.Exploiting constituent dependencies for tree kernel-based semantic relation extraction //Proceedings of the 22nd International Conference on Computational Linguistics.Stroudsburg,PA:ACL,2008:697-704. [11] BUNESCU R C,MOONEY R J.A shortest path dependency kernel for relation extraction //Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing.Stroudsburg,PA:ACL,2005:724-731. [12] MOONEY R J,BUNESCU R C.Subsequence kernels for relation extraction //Proceedings of Advances in Neural Information Processing Systems.Stroudsburg,PA:ACL,2006:171-178. [13] ZENG D,LIU K,CHEN Y,et al.Distant supervision for relation extraction via piecewise convolutional neural networks //Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing.Stroudsburg,PA:ACL,2015:1753-1762. [14] HOCHREITER S,SCHMIDHUBER J.Long short-term memory .Neural computation,1997,9(8):1735-1780. [15] CHO K,VAN M B,GULCEHRE C,et al.Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation .arXiv:1406.1078,2014. [16] XU Y,MOU L,LI G,et al.Classifying relations via long short term memory networks along shortest dependency path .arXiv:1508.03720,2015. [17] MIKOLOV T,CHEN K,CORRADO G,et al.Efficient Estimation of Word Representations in Vector Space .arXiv:1301.3781,2013. [18] CHE W,LI Z,LIU T.LTP:A Chinese Language Technology Platform //Proceedings of the 23rd International Conference on Computational Linguistics Demonstrations Volume.Stroudsburg,PA:ACL,2010:13-16. [19] KINGMA D P,BA J.Adam:A method for stochastic optimization .arXiv:1412.6980,2014. |
[1] | 周芳泉, 成卫青. 基于全局增强图神经网络的序列推荐 Sequence Recommendation Based on Global Enhanced Graph Neural Network 计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085 |
[2] | 戴禹, 许林峰. 基于文本行匹配的跨图文本阅读方法 Cross-image Text Reading Method Based on Text Line Matching 计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032 |
[3] | 周乐员, 张剑华, 袁甜甜, 陈胜勇. 多层注意力机制融合的序列到序列中国连续手语识别和翻译 Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion 计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026 |
[4] | 熊丽琴, 曹雷, 赖俊, 陈希亮. 基于值分解的多智能体深度强化学习综述 Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization 计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112 |
[5] | 饶志双, 贾真, 张凡, 李天瑞. 基于Key-Value关联记忆网络的知识图谱问答方法 Key-Value Relational Memory Networks for Question Answering over Knowledge Graph 计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277 |
[6] | 汪鸣, 彭舰, 黄飞虎. 基于多时间尺度时空图网络的交通流量预测模型 Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction 计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188 |
[7] | 王馨彤, 王璇, 孙知信. 基于多尺度记忆残差网络的网络流量异常检测模型 Network Traffic Anomaly Detection Method Based on Multi-scale Memory Residual Network 计算机科学, 2022, 49(8): 314-322. https://doi.org/10.11896/jsjkx.220200011 |
[8] | 姜梦函, 李邵梅, 郑洪浩, 张建朋. 基于改进位置编码的谣言检测模型 Rumor Detection Model Based on Improved Position Embedding 计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046 |
[9] | 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥. 基于注意力机制的医学影像深度哈希检索算法 Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism 计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153 |
[10] | 孙奇, 吉根林, 张杰. 基于非局部注意力生成对抗网络的视频异常事件检测方法 Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection 计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061 |
[11] | 闫佳丹, 贾彩燕. 基于双图神经网络信息融合的文本分类方法 Text Classification Method Based on Information Fusion of Dual-graph Neural Network 计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042 |
[12] | 金方焱, 王秀利. 融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取 Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM 计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190 |
[13] | 熊罗庚, 郑尚, 邹海涛, 于化龙, 高尚. 融合双向门控循环单元和注意力机制的软件自承认技术债识别方法 Software Self-admitted Technical Debt Identification with Bidirectional Gate Recurrent Unit and Attention Mechanism 计算机科学, 2022, 49(7): 212-219. https://doi.org/10.11896/jsjkx.210500075 |
[14] | 彭双, 伍江江, 陈浩, 杜春, 李军. 基于注意力神经网络的对地观测卫星星上自主任务规划方法 Satellite Onboard Observation Task Planning Based on Attention Neural Network 计算机科学, 2022, 49(7): 242-247. https://doi.org/10.11896/jsjkx.210500093 |
[15] | 赵冬梅, 吴亚星, 张红斌. 基于IPSO-BiLSTM的网络安全态势预测 Network Security Situation Prediction Based on IPSO-BiLSTM 计算机科学, 2022, 49(7): 357-362. https://doi.org/10.11896/jsjkx.210900103 |
|