计算机科学 ›› 2012, Vol. 39 ›› Issue (10): 278-281.

• 人工智能 • 上一篇    下一篇

基于规则的中文零指代项识别研究

秦凯伟,孔芳,李培峰,朱巧明   

  1. (苏州大学计算机科学与技术学院 苏州215006) (江苏省计算机信息处理技术重点实验室 苏州215006)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Rule-based Identification of Chinese zero Anaphora

  • Online:2018-11-16 Published:2018-11-16

摘要: 提出了一个基于规则的中文零指代项识别方法,即输入一个句法分析树,根据这个句法分析树得到当前词的最小IP子树,再依据得到的IP子树提出中文零指代识别的一些规则。所用的语料是Ontonotes。从实验结果可以看到,该方法在标准的句法分析树上F值能达到82.45%,在自动句法树上其也能达到66. 45%。从实验结果可以看出,该方法在中文零指代识别上具有很好的性能。

关键词: 自然语言处理,中文零指代,句法分析树,基于规则,Ontonotes3. 0

Abstract: A rule-based approach for Chinese zero anaphor detection was proposed. Given a parse tree, the smallest IP sub-tree covering the current predicate was captured. Based on this IP sub-tree, some rules were proposed for detecting whether a Chinese zero anaphor exists. I}his paper also systematically evaluated the rulcbased method on OntoNotescorpus. Using golden parse tree, our method achieves 82. 45 in F-measure. And the F-measure is 63. 84 using automatic parser. The experiment results show that our method is very effective on Chinese zero anaphor detection.

Key words: Natural language processing, Chinese zero anaphora, Parsing tree, Rulcbased, Ontonotc3. 0

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!