Computer Science ›› 2021, Vol. 48 ›› Issue (10): 197-203.doi: 10.11896/jsjkx.200900061

• Database & Big Data & Data Science • Previous Articles     Next Articles

Fuzzy Clustering Validity Index Combined with Multi-objective Optimization Algorithm and Its Application

CUI Guo-nan1, WANG Li-song1, KANG Jie-xiang2, GAO Zhong-jie2, WANG Hui2, YIN Wei2   

  1. 1 School of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 210000,China
    2 Software Department of China Aeronautical Radio Electronics Research Institute,Shanghai 200233,China
  • Received:2020-09-08 Revised:2021-03-22 Online:2021-10-15 Published:2021-10-18
  • About author:CUI Guo-nan,born in 1996,postgra-duate.His main research interests include multi-object optimization method and data mining.
    WANG Li-song,born in 1969,associate professor,is a member of China Computer Federation.His main research interests include avionics safety analysis,data management in distributed environments,formal methods and model-based safety analysis,and wireless sensor network.
  • Supported by:
    National Natural Science Foundation of China(J19K004).

Abstract: Fuzzy clustering method can analyze complex data sets more effectively.Because there are many kinds of fuzzy clustering algorithms and the clustering results will change with the number of input clusters,the results of fuzzy clustering algorithm are not accurate,so the number of fuzzy clustering k must be determined in order to obtain certain clustering results.At present,the existing research mainly uses a variety of fuzzy clustering effectiveness indexes to determine the optimal number of clusters k.However,fuzzy clustering indexes such as SSD,PBM will decrease monotonically with the increase of clustering number k,which makes it impossible to determine the optimal number of clusters k.Therefore,this paper proposes a fuzzy clustering validity index (OSACF) combined with a multi-objective optimization algorithm,which combines fuzzy clustering validity with a multi-objective optimization algorithm (MOEA) to solve the optimal number of clusters k problem.Different from using clustering validity index,OSACF establishes a bi-objective model between cluster number k and clustering validity index,and uses MOEA to optimize the bi-objective model to determine the optimal cluster number k,so as to avoid the influence of monotonous decreasing of clustering validity index.On the other hand,OSACF uses morphological similarity distance to replace the traditional Euclidean distance metric,which avoids the influence of cluster shape on the calculation of cluster k.The experimental results show that the optimal fuzzy cluster number k obtained by OSACF combined with MOEA is more accurate than the results obtained by the existing clustering effectiveness indicators.

Key words: Clustering validity index, Fuzzy clustering, Multi-objective optimization algorithm, Number of clusters k

CLC Number: 

  • TP302
[1]BEZDEK J C,EHRLICH R,FULL W.FCM:The fuzzy c-means clustering algorithm[J].Computers & Geosciences,1984,10(2/3):191-203.
[2]ZHANG P Z,ZHANG H Y.A Review of Features and Labels Dimensionality Reduction Methods of Multi Label Data[J].Journal of Chongqing Technology and Business University(Na-tural Science Edition),2020,37(5):23-29.
[3]WANG Z H,WANG S Y,DU H.Improved Fuzzy C-meansClustering Algorithm Based on Density-Sensitive Distance[J].Computer Engineering,2021,47(5):88-96,103.
[4]GAN G,MA C,WU J.Data clustering:theory,algorithms,and applications[M].Society for Industrial and Applied Mathema-tics,2020.
[5]MATHER P,TSO B.Classification methods for remotely sensed data[M].CRC Press,2016.
[6]CUI H,ZHANG K,FANG Y,et al.A clustering validity index based on pairing frequency[J].IEEE Access,2017,5:24884-24894.
[7]VAIDYA J,SHAFIQ B,BASU A,et al.Differentially privatenaive bayes classification[C]//2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).IEEE,2013,1:571-576.
[8]BEZDEK J C.Numerical taxonomy with fuzzy sets[J].Journal of Mathematical Biology,1974,1(1):57-71.
[9]BEZDEK J C.Cluster validity with fuzzy sets[J].Journal of Cybernetics,1973,3:58-73.
[10]XIE X L,BENI G.A validity measure for fuzzy clustering[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1991,13(8):841-847.
[11]FUKUYAMA Y.A new method of choosing the number ofclusters for the fuzzy c-mean method[C]//Proc.5th Fuzzy Syst.Symp.,1989.1989:247-250.
[12]ZAHID N,LIMOURI M,ESSAID A.A new cluster-validity for fuzzy clustering[J].Pattern Recognition,1999,32(7):1089-1097.
[13]PAKHIRA M K,BANDYOPADHYAY S,MAULIK U.Vali-dity index for crisp and fuzzy clusters[J].Pattern Recognition,2004,37(3):487-501.
[14]PAKHIRA M K,BANDYOPADHYAY S,MAULIK U.Astudy of some fuzzy cluster validity indices,genetic clustering and application to pixel classification[J].Fuzzy Sets and Systems,2005,155(2):191-214.
[15]WANG R,LAI S,WU G,et al.Multi-clustering via evolutionary multi-objective optimization[J].Information Sciences,2018,450:128-140.
[16]ZHANG Y,WANG W,ZHANG X,et al.A cluster validity index for fuzzy clustering[J].Information Sciences,2008,178(4):1205-1218.
[17]WU K L,YANG M S.A cluster validity index for fuzzy clustering[J].Pattern Recognition Letters,2005,26(9):1275-1291.
[18]MIRJALILI S,JANGIR P,SAREMI S.Multi-objective ant lion optimizer:a multi-objective optimization algorithm for solving engineering problems[J].Applied Intelligence,2017,46(1):79-95.
[19]YANG X S.Nature-inspired optimization algorithms[M].Academic Press,2020.
[20]GUO Y,WENG G.K-means++ clustering-based active contour model for fast image segmentation[J].Journal of Electronic Imaging,2018,27(6):063013.
[21]LI Z,YUAN J,ZHANG W.Fuzzy C-mean algorithm with morphology similarity distance[C]//2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.IEEE,2009,3:90-94.
[22]YANG S,LI K,LIANG Z,et al.A novel cluster validity index for fuzzy c-means algorithm[J].Soft Computing,2018,22(6):1921-1931.
[23]CAI X,MEI Z,FAN Z.A decomposition-based many-objective evolutionary algorithm with two types of adjustments for direction vectors[J].IEEE Transactions on Cybernetics,2017,48(8):2335-2348.
[24]WANG L,CUI G,ZHOU Q,et al.A multi-clustering method based on evolutionary multiobjective optimization with grid decomposition[J].Swarm and Evolutionary Computation,2020,55:100691.
[25]REZAEE B.A cluster validity index for fuzzy clustering[J].Fuzzy Sets and Systems,2010,161(23):3014-3025.
[26]DAVIES D L,BOULDIN D W.A cluster separation measure[J].IEEE transactions on Pattern Analysis and Machine Intelligence,1979(2):224-227.
[1] ZHANG Ya-di, SUN Yue, LIU Feng, ZHU Er-zhou. Study on Density Parameter and Center-Replacement Combined K-means and New Clustering Validity Index [J]. Computer Science, 2022, 49(1): 121-132.
[2] QIAO Ying-jing, GAO Bao-lu, SHI Rui-xue, LIU Xuan, WANG Zhao-hui. Improved FCM Brain MRI Image Segmentation Algorithm Based on Tamura Texture Feature [J]. Computer Science, 2021, 48(8): 111-117.
[3] SHAO Chao and MA Jin-Jia. Selective Clustering Ensemble Based on Xie-Beni Index [J]. Computer Science, 2020, 47(6A): 457-460.
[4] ZHANG Tian-zhu, ZOU Cheng-ming. Study on Image Classification of Capsule Network Using Fuzzy Clustering [J]. Computer Science, 2019, 46(12): 279-285.
[5] JI Jin-chao, ZHAO Xiao-wei, HE Fei, HU Ying-hui, BAI Tian and LI Zai-rong. Fuzzy Weighted Clustering Algorithm with Fuzzy Centroid for Mixed Data [J]. Computer Science, 2018, 45(2): 109-113.
[6] XING Rui-kang, LI Cheng-hai. Research on Intrusion Detection System Method Based on Intuitionistic Fuzzy Sets [J]. Computer Science, 2018, 45(11A): 344-348.
[7] ZHENG Qi-bin, DIAO Xing-chun and CAO Jian-jun. Fuzzy Clustering Algorithm for Incomplete Data Considering Missing Pattern [J]. Computer Science, 2017, 44(12): 58-63.
[8] GENG Zong-ke, WANG Chang-bin and ZHANG Zhen-guo. Fuzzy c-means and Adaptive PSO Based Fuzzy Clustering Algorithm [J]. Computer Science, 2016, 43(8): 267-272.
[9] CHEN Xiao-dong, SUN Li-juan, HAN Chong and GUO Jian. Detecting Concept Drift of Data Stream Based on Fuzzy Clustering [J]. Computer Science, 2016, 43(4): 219-223.
[10] LI Chun-xin and PENG Ren-can. Rapid Visualization Method Based on 3D Delaunay Triangulation [J]. Computer Science, 2015, 42(Z6): 236-237.
[11] LIU Meng-jiao and WU Cheng-mao. Research on Improved Local Fuzzy C-means Clustering Segmentation Algorithm [J]. Computer Science, 2015, 42(Z6): 190-194.
[12] WU Da-qing, ZHENG Jian-guo, ZHU Jia-jun and SUN Li. Dynamic Multi-objective Particle Swarm Optimization Algorithm Based on Human Social Behavior [J]. Computer Science, 2015, 42(8): 249-252.
[13] FENG Chen-fei, YANG Yan, WANG Hong-jun, XU Ying-ge and WANG Tao. Semi-supervised Fuzzy Clustering Ensemble Approach with Data Correlation [J]. Computer Science, 2015, 42(6): 41-45.
[14] GUO Hua-feng, ZHAO Jian-min and PAN Xiu-qiang. Adapting Suppressed Fuzzy C-regression Models Algorithm [J]. Computer Science, 2015, 42(2): 274-276.
[15] DONG Shi-long,CHEN Ning-jiang,TAN Ying,HE Zi-long and ZHU Li-rong. Optimization of Cluster Resource Fuzzy Clustering Partition Algorithm for Cloud Computing [J]. Computer Science, 2014, 41(9): 104-109.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!