Computer Science ›› 2016, Vol. 43 ›› Issue (12): 213-217.doi: 10.11896/j.issn.1002-137X.2016.12.039

Previous Articles     Next Articles

Clustering Algorithm Based on Relative Density and k-nearest Neighbors over Manifolds

GU Ling-lan and PENG Li-min   

  • Online:2018-12-01 Published:2018-12-01

Abstract: For the problem that traditional Euclidean distance similarity measure cannot fully reflect the distribution characteristics of the complicated data structure,a clustering algorithm based on relative density and k-nearest neighbors over manifolds was proposed.The manifold distance which describes the global consistency and the k-nearest neighbors concept that shows local similarity and affinity were introduced.Based on above descriptions,firstly,the similarity between two objects is measured through the k-nearest neighbors similarity over manifolds.Secondly,the cluster under different densities is found by adapting the relative uniformity of the k-nearest neighbors.Lastly,the k-nearest neighbor pair constraint rule is designed to search the nearest neighbor chain which is composed of the k-nearest data points,in order to classify data objects and identify outliers.Experimental results show that compared with traditional k-means clustering algorithm and the improved k-means clustering algorithm by manifold distance, the algorithm can effectively deal with the clustering problem for complicated data structure and achieve better clustering effect on artificial data sets and UCI public data sets.

Key words: Manifold distance,k-nearest neighbors over manifolds,k-nearest neighbors similarity,Relative density

[1] Jin Jian-guo.Review of clustering method[J].Computer Scie-nce,2014,1(11A):288-293(in Chinese) 金建国.聚类方法综述[J].计算机科学,2014,41(11A):288-293
[2] Huang X,Su W.An improved K-means clustering algorithm[J].Journal of Networks,2014,9(1):161-167
[3] Chen Y,Bruzzone L,Sun F,et al.A fuzzy-statistics-based affinity propagation technique for clustering in multispectral images[J].IEEE Transactions on Geoscience and Remote Sensing,2010,48(6):2647-2659
[4] Ng A Y,Jordan M I,Weiss Y.On spectral clustering:Analysis and an algorithm[J].Advances in Neural Information Processing Systems,2002(2):849-856
[5] Chapelle O,Zien A.Semi-supervised classification by low density separation[C]∥Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics.Savannah,Barbados:Pascal Press,2005:57-64
[6] Gong Mao-guo,Jiao Li-cheng,Ma Wen-ping,et al.Unsupervised classification and recognition using an artificial immune system based on manifold distance[J].Acta Automatica Sinica,2008,34(3):367-375(in Chinese) 公茂果,焦李成,马文萍,等.基于流形距离的人工免疫无监督分类与识别算法[J].自动化学报,2008,34(3):367-375
[7] Li Yang-yang,Shi Hong-zhu,Jiao Li-cheng,et al.Quantum-inspired evolutionary clustering algorithm based on manifold distance[J].Acta Automatic Sinica,2011,9(10):2343-2347(in Chinese) 李阳阳,石洪竺,焦李成,等.基于流形距离的量子进化聚类算法[J].电子学报,2011,39(10):2343-2347
[8] Yang Rui-rui,Niu Jian-qiang,Meng Hong-fei.Pavement crackextraction using iterative clustering algorithm based on manifold distance[J].Computer Engineering,2011,7(12):212-214(in Chinese) 杨瑞瑞,牛建强,孟红飞.基于流形距离的迭代聚类算法路面裂缝提取[J].计算机工程,2011,37(12):212-214
[9] Wang Li-min,Gao Xue-dong,Gong Yu,et al.Community structure detection algorithm based on relative density[J].Computer Engineering,2009,35(1):117-119(in Chinese) 王立敏,高学东,宫雨,等.基于相对密度的社团结构探测算法[J].计算机工程,2009,35(1):117-119
[10] Li Wen-jie,Li Wen-ming.Design and simulation of positioning method based on k-nearest neighbors algorithm[J].Computer Simulation,2009,26(4):194-196(in Chinese) 李文杰,李文明.基于k-近邻算法的定位方法设计和仿真[J].计算机仿真,2009,26(4):194-196
[11] Breunig M M,Kriegel H P,Ng R T,et al.LOF:identifying density-based local outliers[J].Acm Sigmod Record,2000,29(2):93-104
[12] Jin W,Tung A K H,Han J,et al.Ranking outliers using symmetric neighborhood relationship[M]//Advances in Knowledge Discovery and Data Mining.Springer Berlin Heidelberg,2006:577-593
[13] Tang J,Chen Z,Fu W C,et al.Enhancing effectiveness of outlier detections for low density patterns[C]∥Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining.Springer-Verlag,2002:535-548
[14] 严蔚敏,吴伟民.数据结构(C语言版)(第三版)[M].北京:清华大学出版社,2009:274-277,289
[15] Witten I H,Frank E,Hall M A.Data mining:practical machine learning tools and techniques(3rd Ed)[M].San Fransisco:Morgan Kaufmann Publishers,2011:175
[16] Li Yan-bo,Song Qiong,Guo Xin-chen.Artifical immune clustering semi-supervised algorithm based on manifold distance[J].Computer Science,2012,39(11):204-207(in Chinese) 李岩波,宋琼,郭新辰.基于流形距离的人工免疫半监督聚类算法[J].计算机科学,2012,9(11):204-207
[17] Garcia S,Derrac J,Cano J R,et al.Prototype selection for nearest neighbor classification:Taxonomy and empirical study[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2012,4(3):417-435

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!