计算机科学 ›› 2009, Vol. 36 ›› Issue (8): 208-211.

• 人工智能 • 上一篇    下一篇

复杂中文文本的实体关系抽取研究

王苑,徐德智,陈建二   

  1. (中南大学信息科学与工程学院 长沙 410083)
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本课题受国家自然科学基金重点项目(60433020),湖南省自然科学基金(06JJ50142),湖南省国土资源厅科技计划项目(200718)资助。

Entity Relation Extraction for Complex Chinese Text

WAND Yuan XU De-zhi CHEN Jian-er   

  • Online:2018-11-16 Published:2018-11-16

摘要: 实体关系抽取是信息抽取研究领域中的重要研究课题之一。针对已有方法在处理复杂文本上的不足,提出了复杂中文文本的实体关系抽取方法。结合中文文本的语法特征,提出了7条抽取关系特征序列的启发式规则,并采用语义序列核和KNN机器学习算法结合的方法来分类和标注关系的类型。通过对ACE:评测定义下的两个子类的实体关系抽取,关系抽取的平均F值达到了76%,明显高于传统的基于特征向量和最短依存路径核的方法。

关键词: 实体关系抽取,语法特征,启发式规则,语义序列核

Abstract: Entity Relation Extraction is one of the important research fields in Information Extraction. Aiming at the problem of inefficiency of existing approaches dealing with entity relation extraction, this paper presented a novel approach. I}his new approach proposes seven heuristic rules to extract relation feature sequence through combining with grammar feature of Chinese text, and applies the semantic sequence kernel function with KNN learning algorithm to fulfill the entity relation extraction task. Experiments arc carried out on two kinds of relation types defined in the ACE guidelines, results show that the new approach achieves an average F-score up to 76%,significantly higher than the traditional feature-based approaches and traditional shortest path for dependency kernel approaches.

Key words: Entity relation extraction, Grammar feature, Heuristic rule, Semantic sequence kernel

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!