Computer Science ›› 2015, Vol. 42 ›› Issue (Z11): 396-399.

Previous Articles     Next Articles

Improved DBSCAN Algorithm Based on MapReduce

LAI Li-ping, NIE Rui-hua, WANG Jiang-ping and HUANG Jia-hong   

  • Online:2018-11-14 Published:2018-11-14

Abstract: Aimed at solving DBSCAN’s problems of the Eps parameters and the efficiency of processing of massive data sets,the article put forward a new algorithm called OPDBSCAN.It uses overlapping partitions to get a local Eps for reducing the effect of global Eps,then uses MapReduce to cluster in parallel to improve the efficiency.At last,the experiment shows that the OPDBSCAN can cluster faster and better.

Key words: OPDBSCAN,MapReduce,Eps,K-dist,Overlap-partion

[1] 李爱国,厍向阳.数据挖掘原理、算法及应用[M].西安:西安电子科技大学出版社,2012:29-35
[2] Ekan Y J,Pallickara S.MapReduce for data intensive scientific analysis[C]∥eScience 2008:Proceedings of the Fourth IEEE,International Conference on eScience.Piscataway:IEEE Press,2008:277-284
[3] Dean J,Ghemawat S.MapReduce:Simplified data processing on large clusters [J].Communications of the ACM,2008,1(1):107-113
[4] Ester M,et al.A density based algorithm fordiscovering clusters in large spatial databases with noise[C]∥Proc of 2nd Inter Conf Knowledge Discovering in Databases and Data Mining (KDD-96).Portland:AAAI Press,1996
[5] 侯荣涛,朱斌.基于DBSCA聚类算法的闪电临近预报模型[J].计算机应用,2012,2(3):847-851
[6] 李莉平,沈俊媛.基于数据挖掘的DBSCAN算法及其应用[J].科技创业月刊,2009(8):134-135
[7] 黄毅磊.DBSCAN算法及在城市网格化管理中的应用[D].上海:上海交通大学,2010
[8] 夏鲁宁,荆继武.SA-DBSCAN:一种自适应基于密度聚类算法[J].中国科学院大学学报,2009(4):530-538
[9] 陈刚,刘秉权,吴岩.一种基于高斯分布的自适应DBSCAN算法[J].微电子学与计算机,2013,30(3):27-30
[10] 周水庚,周傲英,金文,等.FDBSCAN:一种快速 DBSCAN算法[J].软件学报,2000,1(6):735-744
[11] Ankerst M,Breunig M,Kriegel H-P,et al.Optics:Orderingpoints to Identify the Clustering Structure[C]∥Alex D,Christos F,Shahram G,eds.Proc ACM SIGMOD Int Conf on Mana-gement of Data.Philadephia:ACM Press,1999:49-60
[12] 周水庚,周傲英.基于数据分区的DBSCAN算法[J].计算机研究与发展,2000,37(10):1153-1159
[13] 孙凌燕.基于密度的聚类算法研究[D].太原:中北大学,2009
[14] 熊忠阳,吴林敏,张玉芳.针对非均匀数据集的DBSCAN过滤式改进算法[J].计算机应用研究,2009,6(10):3721-3723

No related articles found!
Full text



[1] . [J]. Computer Science, 2018, 1(1): 1 .
[2] LEI Li-hui and WANG Jing. Parallelization of LTL Model Checking Based on Possibility Measure[J]. Computer Science, 2018, 45(4): 71 -75, 88 .
[3] XIA Qing-xun and ZHUANG Yi. Remote Attestation Mechanism Based on Locality Principle[J]. Computer Science, 2018, 45(4): 148 -151, 162 .
[4] LI Bai-shen, LI Ling-zhi, SUN Yong and ZHU Yan-qin. Intranet Defense Algorithm Based on Pseudo Boosting Decision Tree[J]. Computer Science, 2018, 45(4): 157 -162 .
[5] WANG Huan, ZHANG Yun-feng and ZHANG Yan. Rapid Decision Method for Repairing Sequence Based on CFDs[J]. Computer Science, 2018, 45(3): 311 -316 .
[6] SUN Qi, JIN Yan, HE Kun and XU Ling-xuan. Hybrid Evolutionary Algorithm for Solving Mixed Capacitated General Routing Problem[J]. Computer Science, 2018, 45(4): 76 -82 .
[7] ZHANG Jia-nan and XIAO Ming-yu. Approximation Algorithm for Weighted Mixed Domination Problem[J]. Computer Science, 2018, 45(4): 83 -88 .
[8] WU Jian-hui, HUANG Zhong-xiang, LI Wu, WU Jian-hui, PENG Xin and ZHANG Sheng. Robustness Optimization of Sequence Decision in Urban Road Construction[J]. Computer Science, 2018, 45(4): 89 -93 .
[9] LIU Qin. Study on Data Quality Based on Constraint in Computer Forensics[J]. Computer Science, 2018, 45(4): 169 -172 .
[10] ZHONG Fei and YANG Bin. License Plate Detection Based on Principal Component Analysis Network[J]. Computer Science, 2018, 45(3): 268 -273 .