计算机科学 ›› 2015, Vol. 42 ›› Issue (3): 195-200.doi: 10.11896/j.issn.1002-137X.2015.03.040

• 人工智能 • 上一篇    下一篇

一种近邻传播的层次优化算法

倪志伟,荆婷婷,倪丽萍   

  1. 合肥工业大学管理学院 合肥230009 过程优化与智能决策教育部重点实验室 合肥230009,合肥工业大学管理学院 合肥230009 过程优化与智能决策教育部重点实验室 合肥230009,合肥工业大学管理学院 合肥230009 过程优化与智能决策教育部重点实验室 合肥230009
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受国家自然科学基金(71271071,71301041),国家“863”云制造主题项目(2011AA040501)资助

Affinity Propagation Hierarchical Optimization Algorithm

NI Zhi-wei, JING Ting-ting and NI Li-ping   

  • Online:2018-11-14 Published:2018-11-14

摘要: 近邻传播算法是一种新的聚类算法,在许多领域有较好的应用。近邻传播算法倾向于生成多于真实数目的类,且先验值P对该算法结果优劣有很大影响。故提出了一种有效的近邻传播的层次优化算法——CAP算法。CAP算法利用CURE算法对近邻传播算法的结果进行优化,是一种半监督的聚类算法。在5个UCI数据集上进行了实验验证,结果显示该算法均取得比近邻传播算法更好的聚类结果质量且使得生成的类的个数更接近真实类个数;同时 与K-means、Spectral、CURE算法进行比较,结果表明CAP算法能取得更优的结果。

关键词: 近邻传播算法,CURE算法,层次优化,先验值

Abstract: Affinity propagation (AP) clustering algorithm is a new clustering algorithm,and it is used in many fields well.Affinity propagation clustering algorithm tends to generate more classes than the real data sets.P has a great influence on the result.So this paper proposed an effective affinity propagation clustering’s hierarchical optimization algorithm called as CAP.CAP algorithm uses the CURE algorithm to optimize the result of AP algorithm,and CAP is a semi-supervised clustering algorithm.The result of experiment on five UCI data sets shows that CAP algorithm achieves higher quality than AP algorithm and the number of classes is much closer to the real number.At the same time,CAP also achieves much better clustering result than K-means,Spectral and CURE.

Key words: Affinity propagation algorithm,CURE algorithm,Hierarchical optimization,Priori value

[1] Jain A K.Data clustering:50 years beyond K-means[J].Pattern Recognition Letters,2010,31(8):651-666
[2] Shang Fan-hua,Jiao L C,Shi Jia-rong,et al.Fast affinity propagation clustering:A multilevel approach[J].Pattern Recognition,2012,5(2012):474-486
[3] Vlasblom J,Ahoshana J W.Markov clustering versus affinitypropagation for thpartitioning of protein interaction graphs[J].BMC Bioinformatics,2009(10):99-113
[4] Paccanaro A,Casbon J A,Saqi M A S.Spectral clustering ofprotein sequences[J].Nucleic Acids Research,2006,4(5):1571-1580
[5] Wang Kai-jun,Zhang Jun-ying,Li Dan,et al.Adaptive affinity propagation clustering[J].Acta Automatica Sinica,2007,3(12):1242-1246
[6] Karen K.Affinity program slashes computing times.2007-10-25.http://www.news.utoronto.ca/bin6/070215-2952.asp
[7] Bodenhofer U,Kothmeier A,Hochreiter S.APCluster:an Rpackage for affinity propagation clustering[J].Bioinformatics,2011,27(17):2463-2464
[8] Guha S,Rastogi R,Shim K.CURE:an efficient clustering algorithm for large databases[J].ACM SIGMOD Record,ACM,1998,27(2):73-84
[9] Ertoz L,Steinbach M,Kumar V.A new shared nearest neighborclustering algorithm and its applications[C]∥Workshop on Clustering High Dimensional Data and its Applications at 2nd SIAM International Conference on Data Mining.2002:105-115
[10] J Fray B J,Dueck D.Clustering by Passing Messages Between Data Points[J].Science,2007,315(5814):972-976
[11] Hsu D,Kakade S M,Zhang Tong.A spectral algorithm forlearning hidden Markov models[J].Journal of Computer and System Sciences,2012,78(5):1460-1480
[12] Powers D M W.Evaluation:from precision,recall and F-measure to ROC,informedness,markedness & correlation[J].Journal of Machine Learning Technologies,2011,2(1):37-63
[13] Rawashdeh M,Ralescu A.Crisp and fuzzy cluster validity:ge-neralized intra-inter silhouette index[C]∥2012Annual Meeting of the North American Fuzzy Information Processing Society (NAFIPS).IEEE,2012:1-6
[14] Guan R,Shi X,Marchese M,et al.Text clustering with seeds affinity propagation[J].IEEE Transactions on Knowledge and Data Engineering,2011,23(4):627-637
[15] Kazantseva A,Szpakowicz S.Linear text segmentation using affinity propagation[C]∥Proceedings of the Conference on Empirical Methods in Natural Language Processing.Association for Computational Linguistics,2011:284-293
[16] Patidar A K,Agrawal J,Mishra N.Analysis of Different Similarity Measure Functions and their Impacts on Shared Nearest Neighbor Clustering Approach[J].International Journal of Computer Applications,2012,40(16):1-5
[17] Lee S S,Lin J C.An accelerated K-means clustering algorithm using selection and erasure rules[J].Journal of Zhejiang University SCIENCE C,2012,13(10):761-768
[18] 鲁伟明,杜晨阳,魏宝刚,等.基于MapReduce的分布式近邻传播聚类算法[J].计算机研究与发展,2012,9(8):1762-1772
[19] 唐东明,朱清新,杨凡,等.一种有效的蛋白质序列聚类分析方法[J].软件学报,2011,2(8):1872-1837
[20] 沈洁,赵雷,杨季文,等.一种基于划分的层次聚类算法[J].计算机工程与应用,2007,43(31):175-177
[21] 王开军,李健,张军英,等.半监督的仿射传播聚类[J].计算机工程,2007,33(23):197-198
[22] 邢艳,周勇.基于互近邻一致性的近邻传播算法[J].计算机应用研究,2012,9(7):2524-2526

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!