计算机科学 ›› 2019, Vol. 46 ›› Issue (11): 247-250.doi: 10.11896/jsjkx.190800042

• 人工智能 • 上一篇    下一篇

基于云计算平台的仿生优化聚类数据挖掘算法

申燕萍1, 顾苏杭2, 郑丽霞3   

  1. (常州工业职业技术学院信息工程学院 江苏 常州213164)1
    (江南大学数字媒体学院 江苏 无锡214122)2
    (东南大学微电子学院 南京210096)3
  • 收稿日期:2019-08-09 出版日期:2019-11-15 发布日期:2019-11-14
  • 通讯作者: 申燕萍(1979-),硕士,讲师,CCF会员,主要研究方向为软件技术、数据挖掘,E-mail:59682819@qq.com
  • 作者简介:顾苏杭(1989-),博士,工程师,主要研究方向为机器学习、人工智能;郑丽霞(1979-),博士,讲师,主要研究方向为微电子。
  • 基金资助:
    本文受国家自然科学基金(青年基金)(61805036),国家自然科学基金面上项目( 61376029)资助。

Bionic Optimized Clustering Data Mining Algorithm Based on Cloud Computing Platform

SHEN Yan-ping1, GU Su-hang2, ZHENG Li-xia3   

  1. (School of Information Engineering,Changzhou Institute of Industry Technology,Changzhou,Jiangsu 213164,China)1
    (School of Digital Media,Jiangnan University,Wuxi,Jiangsu 214122,China)2
    (Microelectronics College,Southeast University,Nanjing 210096,China)3
  • Received:2019-08-09 Online:2019-11-15 Published:2019-11-14

摘要: 为了提高云计算平台数据挖掘的有效性以及数据聚类的性能,采用仿生优化算法与相似聚类相结合的方法来实现云计算平台数据聚类。在相似聚类的优化函数求解过程中,采用狼群优化算法,以头狼的位置来确定聚类中心点,从而实现类别中心点的优化与更新。文中分别采用PBM和DB聚类效果评价方法来对聚类效果进行检验,在满足预设评价标准的情况下,不断进行狼群优化和相似聚类计算,直到达到聚类指标要求为止。经过实验证明,相比一般聚类算法,狼群优化的聚类算法对数据量大且数据维度高的云计算平台数据聚类效果更好,收敛速度更快。

关键词: 仿生优化, 聚类, 狼群算法, 评价指标, 云计算平台

Abstract: In order to improve the validity of cloud computing platform data mining and the performance of data clustering,this paper combined bionic optimization algorithm with similar clustering to achieve cloud computing platform data clustering.In the process of solving the optimization function of similar clustering,wolf swarm optimization algorithm is used to locate the head wolf position to determine the cluster centers,so as to optimize and update the category centers.PBM and DB clustering effect evaluation methods were used to test the clustering effect,and wolf swarm optimization and similar clustering calculation were carried out continuously until the requirements of clustering index are met.Experiments results show that,compared with general clustering algorithms,wolf swarm optimization clustering algorithm has better clustering effect and faster convergence speed for cloud computing platform with large data volume and high data dimension.

Key words: Bionic optimization, Cloud computing platform, Clustering, Evaluation index, Wolf swarm algorithm

中图分类号: 

  • TP311
[1]CAI Q Q,CUI H G,TANG H.Big data mining analysis method based on cloud computing[C]∥Green Energy and Sustainable Development I:Proceedings of the International Conference on Green Energy and Sustainable Development (GESD 2017).2017.
[2]ZHAO Y P,XU S C.Data Hierarchical Clustering Based on Cloud Computing and Nonnegative Matrix Decomposition [J].Modern Electronic Technology,2018,41(5):56-60.(in Chinese)
赵艳萍,徐胜超.基于云计算与非负矩阵分解的数据分级聚类[J].现代电子技术,2018,41(5):56-60.
[3]XIE F W.Optimal analysis of path mining for discontinuous data in cloud computing environment[J].Computer Simulation,2018,35(6):438-441.(in Chinese)
谢辅雯.云计算环境下非连续数据路径挖掘优化分析[J].计算机仿真,2018,35(6):438-441.
[4]KARIMI M B,ISAZADEH A,RAHMANI A M.QoS-awareservice composition in cloud computing using data mining techniques and genetic algorithm[J].Journal of Supercomputing,2017,73(4):1-29.
[5]CAO S,AN J C.Otsu image segmentation method based on improved wolf swarm optimization algorithm [J].Microelectronics and Computer,2017,34(10):16-21.(in Chinese)
曹爽,安建成.改进狼群优化算法的Otsu图像分割法[J].微电子学与计算机,2017,34(10):16-21.
[6]GUPTA S,DEEP K.Random walk grey wolf optimizer for constrained engineering optimization problems[J].Computational Intelligence,2018,34(4):189-197.
[7]FARIS H,ALJARAH I,AL-BETAR M A,et al.Grey wolf optimizer:a review of recent variants and applications[J].Neural Computing & Applications,2018,30(2):413-435.
[8]WANG T,WANG Y,MENG L P.An improved wolf swarmsearch algorithm and its application in clustering [J].Computer Applications and Software,2016,33(12):257-263.
[9]WANG F Z.Automatic segmentation of FCM image based on wolf swarm optimization [J].Control Engineering,2018(9):1727-1732.
[10]LIU X J,YUAN J B,FENG F P.K-means clustering algorithm for data distribution in cloud computing environment [J].Minicomputer system,2017,38(4):712-715.
[11]CHEN G L,YAN F L,PAN J H.Incremental Density Fast Clustering Algorithms for High Complexity Dynamic Data in Cloud Computing Environment [J].Computer Science,2018,45(2):287-290.
[12]FENG L W,CHANG D X,DENG Y,et al.The latest clustering performance evaluation index with the farthest score [J].Journal of Intelligent Systems,2017,12(1):67-74.
[13]KARUNAMBIGAI M G,AKRAM M,SIVASANKAR S,et al.Clustering Algorithm for Intuitionistic Fuzzy Graphs[J].International Journal of Uncertainty,Fuzziness and Knowledge-Based Systems,2017,25(3):367-383.
[14]HAI M,ZHANG Y.Performance Comparison of Clustering Algorithms on Hai Mo,Zhang You.Spark Platform [J].Computer Science,2017,44(s1):414-418.
[15]GAO G S.Overview of unsupervised clustering methods for entity analysis [J].Computer Engineering and Applications,2018(7):11-19.
[16]CHEN X Q,ZHOU L J,LIU Y Z.Overview of Clustering Algorithms [J].Integrated Technology,2017,6(3):41-49.
[1] 鲁晨阳, 邓苏, 马武彬, 吴亚辉, 周浩浩.
基于分层抽样优化的面向异构客户端的联邦学习
Federated Learning Based on Stratified Sampling Optimization for Heterogeneous Clients
计算机科学, 2022, 49(9): 183-193. https://doi.org/10.11896/jsjkx.220500263
[2] 柴慧敏, 张勇, 方敏.
基于特征相似度聚类的空中目标分群方法
Aerial Target Grouping Method Based on Feature Similarity Clustering
计算机科学, 2022, 49(9): 70-75. https://doi.org/10.11896/jsjkx.210800203
[3] 鲁晨阳, 邓苏, 马武彬, 吴亚辉, 周浩浩.
基于DBSCAN聚类的集群联邦学习方法
Clustered Federated Learning Methods Based on DBSCAN Clustering
计算机科学, 2022, 49(6A): 232-237. https://doi.org/10.11896/jsjkx.211100059
[4] 郁舒昊, 周辉, 叶春杨, 王太正.
SDFA:基于多特征融合的船舶轨迹聚类方法研究
SDFA:Study on Ship Trajectory Clustering Method Based on Multi-feature Fusion
计算机科学, 2022, 49(6A): 256-260. https://doi.org/10.11896/jsjkx.211100253
[5] 毛森林, 夏镇, 耿新宇, 陈剑辉, 蒋宏霞.
基于密度敏感距离和模糊划分的改进FCM算法
FCM Algorithm Based on Density Sensitive Distance and Fuzzy Partition
计算机科学, 2022, 49(6A): 285-290. https://doi.org/10.11896/jsjkx.210700042
[6] 陈景年.
一种适于多分类问题的支持向量机加速方法
Acceleration of SVM for Multi-class Classification
计算机科学, 2022, 49(6A): 297-300. https://doi.org/10.11896/jsjkx.210400149
[7] 刘丽, 李仁发.
医疗CPS协作网络控制策略优化
Control Strategy Optimization of Medical CPS Cooperative Network
计算机科学, 2022, 49(6A): 39-43. https://doi.org/10.11896/jsjkx.210300230
[8] 陈佳舟, 赵熠波, 徐阳辉, 马骥, 金灵枫, 秦绪佳.
三维城市场景中的小物体检测
Small Object Detection in 3D Urban Scenes
计算机科学, 2022, 49(6): 238-244. https://doi.org/10.11896/jsjkx.210400174
[9] 邢云冰, 龙广玉, 胡春雨, 忽丽莎.
基于SVM的类别增量人体活动识别方法
Human Activity Recognition Method Based on Class Increment SVM
计算机科学, 2022, 49(5): 78-83. https://doi.org/10.11896/jsjkx.210400024
[10] 朱哲清, 耿海军, 钱宇华.
面向化学结构的线段聚类算法
Line-Segment Clustering Algorithm for Chemical Structure
计算机科学, 2022, 49(5): 113-119. https://doi.org/10.11896/jsjkx.210700131
[11] 张宇姣, 黄锐, 张福泉, 隋栋, 张虎.
基于菌群优化的近邻传播聚类算法研究
Study on Affinity Propagation Clustering Algorithm Based on Bacterial Flora Optimization
计算机科学, 2022, 49(5): 165-169. https://doi.org/10.11896/jsjkx.210800218
[12] 左园林, 龚月姣, 陈伟能.
成本受限条件下的社交网络影响最大化方法
Budget-aware Influence Maximization in Social Networks
计算机科学, 2022, 49(4): 100-109. https://doi.org/10.11896/jsjkx.210300228
[13] 韩洁, 陈俊芬, 李艳, 湛泽聪.
基于自注意力的自监督深度聚类算法
Self-supervised Deep Clustering Algorithm Based on Self-attention
计算机科学, 2022, 49(3): 134-143. https://doi.org/10.11896/jsjkx.210100001
[14] 杨旭华, 王磊, 叶蕾, 张端, 周艳波, 龙海霞.
基于节点相似性和网络嵌入的复杂网络社区发现算法
Complex Network Community Detection Algorithm Based on Node Similarity and Network Embedding
计算机科学, 2022, 49(3): 121-128. https://doi.org/10.11896/jsjkx.210200009
[15] 蒲实, 赵卫东.
一种面向动态科研网络的社区检测算法
Community Detection Algorithm for Dynamic Academic Network
计算机科学, 2022, 49(1): 89-94. https://doi.org/10.11896/jsjkx.210100023
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!