计算机科学 ›› 2014, Vol. 41 ›› Issue (2): 240-244.
王丽娟,郝志峰,蔡瑞初,温雯
WANG Li-juan,HAO Zhi-feng,CAI Rui-chu and WEN Wen
摘要: 提出基于随机初始化、参数扰动和特征子集映射的多扰动的局部自适应软子空间聚类(LAC)融合算法(MLACE)。MLACE具有以下特点:(i)多扰动融合:从初始化、参数和特征子集等不同侧面,探测数据内部结构,使之相互融合,从而达到改善聚类正确性的目的;(ii)融合信息提升:根据LAC算法输出的子空间权重矩阵,定义数据属于每一类的概率,形成提升的融合信息;(iii)融合一致性函数改进:融合信息的形式由0/1二值信息转换成[0,1]实值信息,因此,一致性函数采用了性能较优的实数值融合算法Fast global K-means来进一步改善融合正确性。实验选取2个仿真数据库和5个UCI数据库测试MLACE的聚类正确性,实验结果表明,MLACE聚类正确性优于K-means、LAC、基于参数扰动LAC融合算法(P-MLACE)。
[1] Kriegel H P,Kroger P,Zimek A.Clustering High-Dimensional Data:A Survey on Subspace Clustering,Pattern-Based Clustering,and Correlation Clustering [J].ACM Transactions on Knowledge Discovery from Data,2009,3(1):1-58 [2] Parsons L,Haque E,Liu H.Subspace Clustering for High Dimensional Data:A Review [J].ACM SIGKDD Explorations New-sletter-Special issue on learning from imbalanaced datasets,2004,6(1):90-105 [3] Huang J Z.Automated variable weighting in K-means type clustering [J].IEEE Trans.on Pattern Analysis and Machine Intelligence,2005,27(5):657-668 [4] Gan G,Wu J.A convergence theorem for the fuzzy subspaceclustering (FSC) algorithm [J].Pattern Recognition,2008,41(6):1939-1947 [5] Domeniconi C.Locally adaptive metrics fore clustering high dimensional data [J].Data mining knowledge discovery,2007,14:63-97 [6] Jing L P.An entropy weighting K-means algorithm for subspace clustering of high dimensional sparse data [J].IEEE Trans.on Knowledge and Data Engineering,2007,19(8):1026-1041 |
No related articles found! |
|