Computer Science ›› 2026, Vol. 53 ›› Issue (6): 332-338.doi: 10.11896/jsjkx.250600056

• Database & Big Data & Data Science • Previous Articles     Next Articles

Fuzzy Three-way Clustering Based on Mean Shift

NI Yongting1, QIAN Jin1,2, YAN Shaowei1, WU Yueyang1   

  1. 1 School of Information and Software Engineering,East China Jiaotong University,Nanchang 330013,China
    2 School of Computer Science and Engineering,Jiangsu University of Science and Technology,Zhenjiang,Jiangsu 212003,China
  • Received:2025-06-09 Revised:2025-08-21 Online:2026-06-15 Published:2026-06-09
  • About author:NI Yongting,born in 2001,postgra-duate.Her main research interests include clustering analysis and granular computing.
    QIAN Jin,born in 1975,Ph.D,professor,is a member of CCF(No.06332S).His main research interests include big data mining,intelligent decision and privacy protection.
  • Supported by:
    National Natural Science Foundation of China(62466017,62066014) and Natural Science Foundation of Jiangxi Province,China(20232ACB202013).

Abstract: Mean Shift is a widely used density-based clustering method,but it is greatly affected by bandwidth selection,and the partitioning characteristics based on the number of visits may lead to low clustering accuracy.To address these issues,this paper proposes a fuzzy three-way clustering algorithm based on Mean Shift(FTWMS).By introducing a dynamic drift point selection strategy and a fuzzy membership-based division mechanism,the proposed method optimizes the clustering process of the tradi-tional Mean Shift algorithm.Firstly,the algorithm enhances the drift process by dynamically selecting drift points,which helps identify more reliable cluster centers.In addition,the fuzzy membership is calculated by integrating the distances from data points to different drift points,the visit frequencies of clusters,and the local densities of cluster.Based on this,each cluster is respectively divided into core and boundary regions,yielding the final three-way clustering structure.Finally,experiments are conducted on 6 synthetic datasets and 6 UCI datasets.The results show that the FTWMS algorithm outperforms traditional Mean Shift,K-means and three-way evidence theory-based density peak clustering(3W-PEDP) algorithms in clustering evaluation metrics,including ACC,NMI,and ARI.The algorithm also demonstrates superior ability in delineating cluster boundaries,offering better overall performance than the three comparison algorithms.

Key words: Data mining, Mean shift, Three-way clustering, Fuzzy membership, Local density

CLC Number: 

  • TP311
[1]ALI H H,KADHUM L E.K-means clustering algorithm applications in data mining and pattern recognition[J].International Journal of Science and Research,2017,6(8):1577-1584.
[2]ZHANG Q H,ZHOU J P,DAI Y Y,et al.Density Peaks Clustering Algorithm Based on Representative Points and Knearest Neighbors.[J] Ruan Jian Xue Bao/Journal of Software,2023,34(12):5629-5648
[3]IKOTUN A M,EZUGWU A E,ABUALIGAH L,et al.K-means clustering algorithms:A comprehensive review,variants analysis,and advances in the era of big data[J].Information Sciences,2023(622):178-210.
[4]BEZDEK J C,EHRLICH ROBERT,FULL W.FCM:The fuzzy c-means clustering algorithm[J].Computers & Geosciences,1984,10(2/3):191-203.
[5]KARYPIS G,HAN E H,KUMAR V.Chameleon:Hierarchical clustering using dynamic modeling[J].Computer,1999,32(8):68-75.
[6]LI B J,WU H M.Deep embedded clustering based on dirichlet variational autoencoder[J].Journal of Chongqing Technology and Business University(Natural Science Edition),2025,42(3):52-62.
[7]ESTER M,KRIGEL H P,SANDER J,et al.A Density-BasedAlgorithm for Discovering Clusters in Large Spatial Databases with Noise[C]//Proceedings of the 2nd International Confe-rence on Knowledge Discovery and Data Mining.Association for Computing Machinery,1996:226-231.
[8]RODRIGUEZ A,LAIO A.Clustering by fast search and find of density peaks[J],Science,2014,344(6191):1492-1496.
[9]FUKUNAGA K,HOSTETLER L.The estimation of the gradient of a density function,with applications in pattern recognition[J].IEEE Transactions on Information Theory,1975,21(1):32-40.
[10]CHENG Y Z.Mean shift,mode seeking,and clustering[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1995,17(8):790-799.
[11]CARREIRA-PERPINAN M A.Gaussian mean-shift is an EMalgorithm[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2007,29(5):767-776.
[12]WEN L Y,PANG K.Adaptive mean shift clustering algorithm based on cover tree[J].Computer Engineering and Design,2024,45(2):452-458.
[13]QIAN J TANG D W,HONG C X.Research on multi-granularity hierarchical sequential three-branch decision model[J].Journal of Shandong University(Science Edition),2022,57(9):33-45.
[14]YU H.A Framework of Three-way Cluster Analysis [C]//International Joint Conference on Rough Sets.Springer Nature,2017:300-312.
[15]WANG P X,YAO Y Y.CE3:A three-way clustering methodbased on mathematical morphology[J].Knowledge-based Systems,2018(155):54-65.
[16]JU H R,LU Y,DING W P,et al.Three-way evidence theory-based density peak clustering with the principle of justifiable granularity[J].Applied Soft Computing,2024(152):111217.
[17]YU H,CHEN Y,LINGRAS P,et al.A three-way cluster en-semble approach for large-scale data[J].International Journal of Approximate Reasoning,2019(11):32-49.
[18]XIONG J,YU H.An adaptive three-way clustering algorithmfor mixed-type data[C] // Foundations of Intelligent Systems:24th International Symposium(ISMIS).Springer Nature,2018:379-388.
[19]JIANG C M,ZHAO S B.Multi-granularity three-branch clustering ensemble based on shadow set[J].Journal of Electronics,2021,49(8):1524-1532.
[20]DU M J,ZHAO J Q,SUN J R,et al.M3W:Multistep three-way clustering[J].IEEE Transactions on Neural Networks and Learning Systems,2022,35(4): 5627-5640.
[21]WANG P X,YANG X B,DING W P,et al.Three-way clustering:Foundations,survey and challenges[J].Applied Soft Computing,2024(151):111131.
[22]CHENG D D,LI Y,XIA S Y,et al.A fast granular-ball-based density peaks clustering algorithm for large-scale data[J].IEEE Transactions on Neural Networks and Learning Systems,2023(35):22.
[23]WANG Y Z,QIAN J X,HASSAN M,et al.Density peak clustering algorithms:A review on the decade 2014-2023[J].Expert Systems with Applications,2024(238):121860.
[24]WU M,SCHÖLKOPF B.A local learning approach for clustering[J].https://proceedings.neurips.cc/paper_files/paper/2006/file/366f0bc7bd1d4bf414073cabbadfdfcd-Paper.pdf.
[25]VINH N X,EPPS J,BAILEY J.Information Theoretic Measures for Clusterings Comparison:Variants,Properties,Normalization and Correction for Chance[J].Journal of Machine Lear-ning Research,2010(11):2837-2854.
[26]FRÄNTI P,REZAEI M,ZHAO Q P.Centroid index:Clusterlevel similarity measure[J].Pattern Recognition,2014,47(9):3034-3045.
[1] DONG Guojun, CHENG Can, FANG Qing, YOU Lan, WANG Wei, PENG Qingxi. How to Filter Collaborative Development Projects from Open-source Communities--Exploratory Study on GitHub [J]. Computer Science, 2026, 53(6): 396-407.
[2] ZOU Xiaoyang, JU Hengrong, CAO Jinxin, MA Xingru, HUANG Jiashuang, DING Weiping. Overlapping Community Detection with Graph Regularized Fuzzy Autoencoder [J]. Computer Science, 2026, 53(3): 207-213.
[3] XI Penghui, WU Xiazhen, JIANG Wencong, FANG Liangda, HE Chaobo, GUAN Quanlong. Review of Personalized Educational Resource Recommendations [J]. Computer Science, 2026, 53(2): 1-15.
[4] ZHAI Jie, CHEN Lexuan, PANG Zhiyu. Survey on Graph Neural Network-based Methods for Academic Performance Prediction [J]. Computer Science, 2026, 53(2): 16-30.
[5] HU Xin, DUAN Jiangli, HUANG Denan. Concept Cognition for Knowledge Graphs by Mining Double Granularity Concept Characteristics [J]. Computer Science, 2025, 52(6A): 240800047-6.
[6] GU Huijie, FANG Wenchong, ZHOU Zhifeng, ZHU Wen, MA Guang, LI Yingchen. CSO-LSTM Based Power Prediction Method for New Energy Generation [J]. Computer Science, 2025, 52(6A): 240600053-11.
[7] XIONG Keqin, RUAN Sijie, YANG Qianyu, XU Changwei , YUAN Hanning. Mobility Data-driven Location Type Inference Based on Crowd Voting [J]. Computer Science, 2025, 52(3): 169-179.
[8] WANG Tianyi, LIN Youfang, GONG Letian, CHEN Wei, GUO Shengnan, WAN Huaiyu. Check-in Trajectory and User Linking Based on Natural Language Augmentation [J]. Computer Science, 2025, 52(2): 99-106.
[9] LIU Yuting, GU Jingjing, ZHOU Qiang. Urban Flow Prediction Method Based on Structural Causal Model [J]. Computer Science, 2025, 52(10): 70-78.
[10] KONG Lingchao, LIU Guozhu. Review of Outlier Detection Algorithms [J]. Computer Science, 2024, 51(8): 20-33.
[11] DONG Wanqing, ZHAO Zirong, LIAO Huimin, XIAO Hui, ZHANG Xiaoliang. Research and Implementation of Urban Traffic Accident Risk Prediction in Dynamic Road Network [J]. Computer Science, 2024, 51(6A): 230500118-10.
[12] JIANG Yanjie, DONG Chunhao, LIU Hui. Nonsense Variable Names Detection Method Based on Lexical Features and Data Mining [J]. Computer Science, 2024, 51(6): 23-33.
[13] XING Cunyuan, ZHANG Jie, JIN Ying. Discipline Competition Evaluation Model Based on Multi-attribute Comprehensive Evaluation [J]. Computer Science, 2024, 51(5): 21-26.
[14] BAO Kainan, ZHANG Junbo, SONG Li, LI Tianrui. ST-WaveMLP:Spatio-Temporal Global-aware Network for Traffic Flow Prediction [J]. Computer Science, 2024, 51(5): 27-34.
[15] CHEN Xinyang, CHEN Hanze, ZHOU Jiasheng, HUANG Jiaqing, YU Jiashuo, ZHU Longlong, ZHANG Dong. IntervalSketch:Approximate Statistical Method for Interval Items in Data Stream [J]. Computer Science, 2024, 51(4): 4-10.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!