计算机科学 ›› 2010, Vol. 37 ›› Issue (9): 173-176.
• 数据库与数据挖掘 • 上一篇 下一篇
黄承慧,印鉴,侯昉
出版日期:
发布日期:
基金资助:
HUANG Cheng-hui,YIN Jian,HOU Fang
Online:
Published:
摘要: 在文本检索领域,当前广泛应用的方法或者是考察检索词项与被检索文本的词频信息,或者是考察检索词项与被检索文本的语义相似性。这些方法忽略了检索词项与被检索文本的结构信息,检索结果有一定的局限性。通过分析检索词项与被检索文本句子结构的主谓宾信息,进而考察主谓宾结构中词汇的语义相似性,最终实现对文本的语义检索。实验表明,该方法能够有效提高检索的查准率。
关键词: 文本信息检索,语义相似度,语法结构信息,检索算法
Abstract: In text retrieve area, popular methods either considered word frequency or semantic information between retrieve terms and text corpus. These methods ignore the semantic structure information of retrieve terms and text corpus,and then the good result limits to some domains. This paper analyzed the subject verb-object structure information of text, and then computed the similarity of the words where the words lie in the subject-verb-object structure, and final1y implemented semantic information retrieving of texts. The experiment shows that the approach could improve the precision effectively.
Key words: Text information retrieve, Semantic similarity, Syntax structure information, Retroeve algorithm
黄承慧,印鉴,侯昉. 一种基于主谓宾结构的文本检索算法[J]. 计算机科学, 2010, 37(9): 173-176. https://doi.org/
HUANG Cheng-hui,YIN Jian,HOU Fang. Improved Text Retrieve Algorithm Based on Subject-verb-object Structure[J]. Computer Science, 2010, 37(9): 173-176. https://doi.org/
0 / / 推荐
导出引用管理器 EndNote|Reference Manager|ProCite|BibTeX|RefWorks
链接本文: https://www.jsjkx.com/CN/
https://www.jsjkx.com/CN/Y2010/V37/I9/173
Cited