一种基于主谓宾结构的文本检索算法

计算机科学 ›› 2010, Vol. 37 ›› Issue (9): 173-176.

一种基于主谓宾结构的文本检索算法

黄承慧,印鉴,侯昉

(中山大学信息科学与技术学院广州510275);(广东金融学院计算机系广州510520)

出版日期:2018-12-01 发布日期:2018-12-01
基金资助:
本文受国家自然科学基金(60573097,60773198,60703111),(广东省自然科学基金(05200302,06104916），广州市科技计划项目（2007Z3-D3071}，高等学校博十学科点专项科研基金(20050558017)，新世纪优秀人才支持计划(NCET-06-0727)资助。

Improved Text Retrieve Algorithm Based on Subject-verb-object Structure

HUANG Cheng-hui,YIN Jian,HOU Fang

Online:2018-12-01 Published:2018-12-01

摘要/Abstract

摘要： 在文本检索领域，当前广泛应用的方法或者是考察检索词项与被检索文本的词频信息，或者是考察检索词项与被检索文本的语义相似性。这些方法忽略了检索词项与被检索文本的结构信息，检索结果有一定的局限性。通过分析检索词项与被检索文本句子结构的主谓宾信息，进而考察主谓宾结构中词汇的语义相似性，最终实现对文本的语义检索。实验表明，该方法能够有效提高检索的查准率。

关键词: 文本信息检索，语义相似度，语法结构信息，检索算法

Abstract: In text retrieve area, popular methods either considered word frequency or semantic information between retrieve terms and text corpus. These methods ignore the semantic structure information of retrieve terms and text corpus,and then the good result limits to some domains. This paper analyzed the subject verb-object structure information of text, and then computed the similarity of the words where the words lie in the subject-verb-object structure, and final1y implemented semantic information retrieving of texts. The experiment shows that the approach could improve the precision effectively.

Key words: Text information retrieve, Semantic similarity, Syntax structure information, Retroeve algorithm

黄承慧,印鉴,侯昉. 一种基于主谓宾结构的文本检索算法[J]. 计算机科学, 2010, 37(9): 173-176. https://doi.org/

HUANG Cheng-hui,YIN Jian,HOU Fang. Improved Text Retrieve Algorithm Based on Subject-verb-object Structure[J]. Computer Science, 2010, 37(9): 173-176. https://doi.org/

参考文献

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed