Computer Science ›› 2015, Vol. 42 ›› Issue (6): 41-45.doi: 10.11896/j.issn.1002-137X.2015.06.009

Previous Articles     Next Articles

Semi-supervised Fuzzy Clustering Ensemble Approach with Data Correlation

FENG Chen-fei, YANG Yan, WANG Hong-jun, XU Ying-ge and WANG Tao   

  • Online:2018-11-14 Published:2018-11-14

Abstract: Semi-supervised clustering ensemble has emerged as a powerful machine learning paradigm that provides improved precision,robustness and stability by taking advantage of prior information,while most of them only consider the given pairwise constraints and do not consider the neighbors around the data points constrained in the ensemble step.In this paper,a semi-supervised fuzzy clustering ensemble with data correlation(SFCEDC)was proposed to overcome this defect.Firstly,an ensemble information matrix is built by primarily exploiting the results of semi-supervised fuzzy clustering and a similarity matrix is constructed by aggregating much information of the ensemble information matrix.And then this matrix is modified by using the given constraints and the neighbors around the data points constrained.Finally,a graph partitioning algorithm is employed to get the final clustering results.Experimental results on UCI datasets demonstrate that the proposed approach can improve clustering performance effectively.

Key words: Semi-supervised clustering ensemble,Fuzzy clustering,Pairwise constraints,Neighbors points

[1] Han J,Kamber M,Pei J.Data Mining Concepts and Techniques [M].Morgan Kaufmann Press,2012
[2] Wolpert D H,Macready W G.No free lunch theorems for search [R].Technical Report SFI-TR-9502010.Santa Fe Institute,1995
[3] Topchy A,Jain A K,Puneh W.Clustering ensembles:models of consensus and weak partition [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(12):1866-1881
[4] Strehl A,Ghosh J.Cluster ensembles:a knowledge reuse framework for combining multiple partitions [J].Journal of Machine Learning Research,2003,3(3):583-617
[5] 罗会兰,危辉.基于数学形态学的聚类集成算法[J].计算机科学,2010,37(8):214-218 Luo Hui-lan,Wei Hui.Clustering Enseble Algorithm Based on Mathematical Morphology[J].Computer Science,2010,37(8):214-218
[6] Zhou Zhi-hua.Ensemble Methods:Foundations and Algorithms [M].CRC Press,2012
[7] Iam-on N,Boongone T,Garrett S,et al.Link-based cluster ensemble approach for categorical data clustering [J].IEEE Transactions on Knowledge and Data Engineering,2012,24:413-425
[8] Naldi M C,Carvalho A C P L F,Campello R J G B.Cluster ensemble selection based on relative validity indexes [J].Data Mining and Knowledge Discovery,2013,27(2):259-289
[9] Abdala D D,Jiang X.An evidence accumulation approach to constrained clustering combination[C]//Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition.Leipzig,Germany,2009:361-371
[10] 王红军,李志蜀,戚建淮,等.基于贝叶斯网络的半监督聚类集成模型[J].软件学报,2010,21(11):2814-2825 Wang Hong-jun,Li Zhi-shu,Qi Jian-huai,et al.Semi-supervised Cluster Ensemble Model Based on Bayesian Network[J].Journal of Software,2010,21(11):2814-2825
[11] Iqbal A M,Moh’d A,Khan Z A.Semi-supervised clustering ensemble by voting [C]∥Proceeding of the International Confe-rence on Information and Communication System.Amman,Jordan,2009:1-5
[12] Yang Yan,Wang Hong-jun,Lin Chao,et al.Semi-supervisedClustering Ensemble Based on Multi-ant Colonies Algorithm [C]∥Rough Sets and Knowledge Technology 7th International Conference.Chengdu,China,2012:302-309
[13] Wagstaff K,Cardie C,Rogers S.Constrained k-means clustering with background knowledge [C]//Proceedings of the 18th International Conference on Machine Learning.San Francisco,CA,USA,2001:577-584
[14] Klein D,Kamvar S D,Manning C.From instance-level con-straints to space-level constraints:marking the most of prior knowledge in data clustering [C]∥Proceedings of the 19th International Conference on Machine Learning.San Francisco,CA,USA,2002:307-314
[15] Pedrycz W,Waletzky J.Fuzzy Clustering with Partial Supervi-sion [J].IEEE Transactions on Systems,Man,and Cybernetics,1997,27(5):787-795
[16] Karypis G,Kumar V.Multilevel K-Way partitioning scheme for irregular graphs [J].Journal of Parallel Distributed Computing,1998,41(2):278-300
[17] Ng A,Jordan M,Weiss Y.On Spectral Clustering:Analysis and an Algorithm[C]∥Advances in Neural Information Processing Systems.2001,14:849-856
[18] Blake C L,Merz C J.UCI repository of machine learning data-bases [EB/OL].2012-05-01[2012-12-01].http://archive.ics.uci.edu/ml
[19] Modha D,Spangler W S.Feature weighting in k-means clustering [J].Machine Learning,2003,52(3):217-237

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!