计算机科学 ›› 2010, Vol. 37 ›› Issue (9): 222-224.

• 人工智能 • 上一篇    下一篇

一种基于本体相似度计算的文本聚类算法研究

王刚,钟国祥   

  1. (安康学院电子与信息工程系 安康725000);(重庆教育学院科技处 重庆400067)
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受陕西省教育厅项目(09JK317),急于本体的服务研究(AYQDZR200916) ,智能信息处理技术关键问题及应用研究(2008akxy005)资助。

Study on Text Clustering Algorithm Based on Similarity Measurement of Ontology

WANG Gang,ZHONG Guo-xiang   

  • Online:2018-12-01 Published:2018-12-01

摘要: 为了改善文本聚类的质量,得到满意的聚类结果,针对文本聚类缺少涉及概念的内涵及概念间的联系,提出了一种基于本体相似度计算的文本聚类算法TC130(Text Clustering 13ascd on Ontology)。该算法把文档用本体来刻画,以便描述概念的内涵及概念间的联系。设计和改进了文本相似度计算算法,应用本体的语义相似度来度量文档间相近程度,设计了具体的根据相似度进行文本聚类的算法。实验证明,该方法从聚类的准确性和聚类的关联度方面改善了聚类质量。

关键词: 本体,相似度,文本聚类,语义

Abstract: To improve the quality of text clustering and get the satisfactory clustering results,we proposed a text clustering based on similarity of ontology. I3y organizing text as ontology, we were easy to represent the meanings and relalions of concepts. We designed and improved the measurement of similarity and measured the text similarity by similarity of text ontology, we designed the algorithm of text clustering based on similarity. Experiments show that our method can avoid using the term isolation and high-dimensional, and can improve the clustering quality in correction degree and association degree.

Key words: Ontology, Similarity, Text clustering, Semantic

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!