计算机科学 ›› 2016, Vol. 43 ›› Issue (Z11): 368-372.doi: 10.11896/j.issn.1002-137X.2016.11A.085

• 信息安全 • 上一篇    下一篇

基于差分隐私保护的KDCK-medoids动态聚类算法

马银方,张琳   

  1. 南京邮电大学计算机学院 南京210003,南京邮电大学计算机学院 南京210003
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家自然科学基金(61402241,61572260,61373017,61572261,61472192),江苏省科技支撑计划(BE2015702)资助

KDCK-medoids Dynamic Clustering Algorithm Based on Differential Privacy

MA Yin-fang and ZHANG Lin   

  • Online:2018-12-01 Published:2018-12-01

摘要: K-medoids算法对初始中心点敏感,不能有效地对动态数据进行聚类,且需要对相关的隐私数据进行保护。针对这些问题,提出了基于差分隐私保护的KDCK-medoids动态聚类算法。该算法在采用差分隐私保护技术的基础上将KD-树优化选取出的k个聚类中心和增量数据相结合建立新的KD-树,然后采用近邻搜索策略将增量数据分配到与其相应的聚类簇中,从而完成最终的动态聚类。通过实验分别对小数据集和多维的大数据集的聚类准确率及运行时间进行了分析,同时也对采用差分隐私保护技术的KDCK-medoids算法在不同数据集上的有效性进行了评估。实验结果表明,基于差分隐私保护的KDCK-medoids动态聚类算法能够在实现隐私保护的同时快速高效地处理增量数据的动态聚类问题。

关键词: KD-树,K-medoids聚类算法,差分隐私,动态聚类

Abstract: The traditional K-medoids clustering algorithm is sensitive to the initial center points,can’t effectively deal with dynamic data clustering,and needs privacy protection for private data.Therefore,this paper proposed the KDCK-medoids dynamic clustering algorithm.It establishes a new KD-tree using kth rectangular units optimally selected by KD-tree and incremental data based on differential privacy protection technologies,and then distributes the incremental data into the corresponding clusters by using the neighbor search strategy,and then completes the dynamic clustering.Through experiments on small data sets and multi-dimensional large data sets,clustering accuracy and running time are analyzed.And the effectiveness of the algorithm is evaluated.The experimental results indicate that the KDCK-medoids dynamic clustering based on differential privacy protection can realize privacy protection meanwhile quickly and efficiently process the dynamic clustering of incremental data problem.

Key words: KD-tree,K-medoids clustering algorithm,Differential privacy,Dynamic clustering

[1] 夏宁霞,苏一丹,覃希.一种高效的K-medoids聚类算法[J].计算机应用研究,2010,7(12):4517-4519
[2] Sabzi A,Farjami Y,ZiHayat M.An improved fuzzy k-medoids clustering algorithm with optimized number of clusters[C]∥Proceedings of the 11th International Conference on Hybrid Intelligent Systems.IEEE,2011:206-210
[3] 孟颖,罗可,刘建华,等.一种基于差分演化的K-medoids聚类算法[J].计算机应用研究,2012,9(5):1651-1653
[4] Zhu Y T,Wang F Z,Shan X H,et al.K-medoids clusteringbased on MapReduce and optimal search of medoids[C]∥Proceedings of the 9th International Conference on Computer Science and Education.IEEE,2014:573-577
[5] 谢娟英,高瑞.方差优化初始中心的K-medoids聚类算法[J].计算机科学与探索,2015,9(8):973-984

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!