计算机科学 ›› 2015, Vol. 42 ›› Issue (9): 268-271.doi: 10.11896/j.issn.1002-137X.2015.09.052

• 人工智能 • 上一篇    下一篇

广义洛伦兹内核函数在模糊C均值聚类中的应用研究

王建华,李晓峰,高巍巍   

  1. 哈尔滨师范大学 哈尔滨150025,黑龙江外国语学院信息科学系 哈尔滨150025,黑龙江外国语学院信息科学系 哈尔滨150025
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受黑龙江省智能教育与信息工程重点实验室开放基金项目(1155xnc107),黑龙江省教育厅科学技术研究项目(12543067)资助

Research on Generalized Lorenz Kernel Function in Fuzzy C Means Clustering

WANG Jian-hua, LI Xiao-feng and GAO Wei-wei   

  • Online:2018-11-14 Published:2018-11-14

摘要: 模糊C均值(FCM)算法是数据聚类分析的主要算法。但在嘈杂环境下,对于抽样大小不一的聚类,数目越多准确性越低,上述弊端可通过替代性FCM(AFCM)的高斯内核映射来解决。鉴于AFCM的不足,提出了针对模糊C均值聚类的广义洛伦兹内核函数。利用该算法对鸢尾数据库进行聚类,将其划分成山鸢尾、变色鸢尾和维吉尼亚鸢尾3类。实验结果表明,广义洛伦兹模糊C均值(GLFCM)可实现对离群聚类和大小不等的聚类数据的分类,其结果优于K均值、FCM、替代性C均值(AFCM)、Gustafson-Kessel(GK)和 Gath-Geva(GG)方法,收敛迭代次数比AFCM的更少,其分区索引(SC)效果也好于其他方法。

关键词: 广义洛伦兹隶属函数,K均值,替代性模糊C均值,聚类,离群聚类

Abstract: Fuzzy C means(FCM) algorithm is the main algorithm for data clustering analysis.But in a noisy environment,for the clusters of different sampling sizes,accuracy is low when the number of clusters is large.The above disadvantages can be sloved through the Gauss kernel mapping of alternative FCM(AFCM) .This paper proposed generalized Lorenz kernel function to the fuzzy C means clustering for the deficiency of AFCM. This algorithm was used to analyze the Iris database cluster,to classify the Iris database into three clusters of Iris setosa,Iris versicolour and Iris virginica.Experimental results show that the generalized lorentzian fuzzy C-means(GLFCM) can classify data of outliers and un-equal sized clusters.The GLFCM yields better cluster than K-means(KM),FCM,alternative fuzzy C-means(AFCM),Gustafson-Kessel(GK) and Gath-Geva(GG).It takes less iteration than that of AFCM to converge.Its partition index(SC) is better than the others.

Key words: Generalized lorentzian membership function,K-means,Alternative fuzzy C-means,Clustering,Outlier clustering

[1] Kaufman L,Rousseeuw P.Finding Groups in Data[M].Wiley Series in Probability and Statistic,2005:56-67
[2] Mirkin B.Clustering for Data Mining:A Data Recovery Approach[M].Chapman and Hall,2005:12-24
[3] Wang Xiang,Guo Rui,et al.A Novel Alternative WeightedFuzzy C-means Algorithm and Cluster Validity Analysis [C]∥IEEE Pacific-Asia Workshop on Computational Intelligence and Industrial Application.2008:130-134
[4] Hammerly G,Elkan C.Alternatives to the k-mean algorithm that find better clusterings[C]∥Proceedings of the 11th InternationalConference on Information and Knowledge Management,2002:600-607
[5] 郭小芳,李锋,宋晓宁,等.基于连续域混合蚁群优化的核模糊C-均值聚类算法研究[J].模式识别与人工智能,2014,7(9):841-846 Guo Xiao-fang,Li Feng,Song Xiao-ning,et al.Kernelized Fuzzy C-means Clusterling Algorithm Based on Hybrid Ant Colony Optimization for Continuons Domains[J].Pattern Recognition and Artificial Intelligence,2014,7(9):841-846
[6] 李广原,杨炳儒,刘英华,等.基于模糊论的数据挖掘研究综述[J].计算机工程与设计,2011,2(12):4064-4067 Li Guang-yuan,Yang Bing-ru,Liu Ying-hua,et al.Survey of data mining based on fuzzy set theory[J].Computer Engineering and Design,2011,2(12):4064-4067
[7] 李丽丽,李明,刘希玉.基于粒子群模糊C-均值聚类的图像分割算法[J].计算机工程与应用,2009,5(31):158-160 Li Li-li,Li Ming,Liu Xi-yu.Image segmentation algorithm based on particle swarm optimization fuzzy C-means clustering [J].Computer Engineering and Applications,2009,5(31):158-160
[8] Liu X,Yang C.Performance research of Gaussian functionweighted fuzzy C-means algorithm[C]∥Proceedings of SPIE.2007
[9] Yang M S,Tsai H S.A Gaussian kernel-based fuzzy c-means algotihm with a spatial bias correction[J].Pattern Recognition Letters,2008,29(12):1713-1725
[10] Ramathilagam S,Huang Yueh-min.Extended Gaussian kernelversion of fuzzy c-means in the problem of data analyzing[J].Expert Systems with Applications:An International Journal,2011,38(4):3793-3805

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!