计算机科学 ›› 2010, Vol. 37 ›› Issue (11): 217-222.

• 人工智能 • 上一篇    下一篇

面向结构稳定性的分裂-合并聚类算法

雷小锋,何涛,李奎儒,谢昆青,丁世飞   

  1. (中国矿业大学计算机科学与技术学院 徐州221116);(北京大学信息科学技术学院视觉与听觉国家重点实验室 北京100871)
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受863国家高技术研究发展计划(2006AA12Z217)和中国矿业大学科技基金(No.OD080313)资助。

Split-Merge Based Clustering Algorithm Oriented to Structure Stability of Clusters

LEI Xiao-feng,HE Tao,LI Kui-ru,XIE Kun-qing,DING Shi-fei   

  • Online:2018-12-01 Published:2018-12-01

摘要: 聚类是在假设数据具有某种群聚结构的前提下根据观察到的无标记样本发现数据的最优划分。现有的聚类算法通常简单地导出假设结构和给定先验下最优或较优的聚类结果,体现为算法对样本分布拟合度的迭代最优化,即算法有效性。实际上,聚类的有效性取决于结构有效性、算法有效性和先验有效性3个方面的因素。基于这种考虑,提出了一种变体混合模型的聚类结构假设,以及判定聚类结构的稳定性的度量和方法,在算法有效的前提下通过单簇的分裂与合并来改进聚类结构的稳定性,并得到最终聚类结果,设计并实现了SMClus聚类算法,通过对模拟数据和真实数据的聚类实验,例证了方法的有效性。

关键词: 聚类算法,变体混合模型,结构稳定性,分裂-合并

Abstract: Clustering is to find the best partition of unlabeled observations under a certain group stucture hypothesis.Given the group stucturc hypothesis, the most clustering algorithms is to to iteratively optimize of fittness of data distribution (called algorithm validity). In fact, the clustering validity is determined by three factors: hypothesis, algorithm and apriori validity. Therefore, a variation of gaussian mixture model was proposed in this paper, then the measurement and estimation method of cluster structure stability were defined. Based on them, the SMCIus algorithm was designed to achieve the stable clustering structure by means of split merge operations. The experiment shows SMCIus' performance in clustering quality.

Key words: Clustering, Variation of mixture model, Structure robustness, Split Merge

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!