计算机科学 ›› 2016, Vol. 43 ›› Issue (4): 197-201.doi: 10.11896/j.issn.1002-137X.2016.04.040

• 人工智能 • 上一篇    下一篇

引入时频聚集交叉项干扰抑制的大数据聚类算法

胡先兵,赵国庆   

  1. 国家计算机网络应急技术处理协调中心 北京100029,北京交通大学电子信息工程学院 北京100044
  • 出版日期:2018-12-01 发布日期:2018-12-01

Large Data Clustering Algorithm Introducing Time and Frequency Clustering Interference Suppression

HU Xian-bing and ZHAO Guo-qing   

  • Online:2018-12-01 Published:2018-12-01

摘要: 面向大数据集管理的数据聚类方法研究在模式识别、故障诊断和数据挖掘等领域具有重要的研究意义。传统的大数据聚类算法采用混合差分进化的粒子群算法,因数据信息流分量之间的交叉作用而出现的类间交叉项干扰影响了聚类分量的正确判断,聚类效果不好。提出了一种基于时频聚集交叉项干扰抑制的大数据聚类算法。在面向传播学视域下物联网大数据库中生成大数据聚类的信息特征向量,对任意两个分簇矢量进行近邻样本的隶属度训练,在时间滑动窗口模型中进行信息调度,采用高频分量抑制方法实现对时频聚集交叉项的干扰抑制,通过频域卷积相似度融合处理,采用粒子群优化算法进行聚类适应度计算,以实现数据聚类算法改进。仿真结果表明,采用该算法进行大数据聚类,具有较好的抗干扰性和自适应性,聚类准确度较高。

关键词: 大数据,聚类,粒子群,干扰抑制

Abstract: The data clustering method of large data sets has important research significance in pattern recognition,fault diagnosis,data mining and so on.Traditional data clustering algorithms use hybrid differential evolution particle swarm optimization algorithm,but because the cross term interference caused by interactions between components of data flow has an impact on the correct judgement,the clustering effect is not good.A large data clustering algorithm was proposed based on the time-frequency clustering algorithm.In the large database of Internet of things from the perspective of communication,big data clustering feature vector is generated.Arbitrary two cluster vectors are trained of membership grade about neighbor sample,and information is scheduled in time sliding window model.High frequency component suppression method is used to conduct interference suppression of time-frequency clustering interactions.By similarity fusion processing frequency domain convolution and using particle swarm optimization algorithm to do cluster fitness calculation,we improved the data clustering algorithm.The simulation results show that the algorithm has good anti-interference and self-adaptive performance,and it has high accuracy.

Key words: Large data,Clustering,Particle swarm,Interference suppression

[1] Wen Tian-zhu,Xu Ai-qiang,Cheng Gong.Multi-fault diagnosismethod based on improved ENN2 clustering algorithm[J].Control and Decision,2015,30(6):1021-1026(in Chinese) 文天柱,许爱强,程恭.基于改进ENN2聚类算法的多故障诊断方法[J].控制与决策,2015,30(6):1021-1026
[2] Xing Chang-zheng,Liu Jian.Evolutionary data stream clustering algorithm based on integration of affinity propagation and density[J].Journal of Computer Applications,2015,35(7):1927-1932(in Chinese) 邢长征,刘剑.基于近邻传播与密度相融合的进化数据流聚类算法[J].计算机应用,2015,35(7):1927-1932
[3] Tao Xin-min,Song Shao-yu,Cao Pan-dong,et al.A Spectral Clus-tering Algorithm Based on Manifold Distance Kernel[J].Information and Control,2012(3):307-313(in Chinese) 陶新民,宋少宇,曹盼东,等.一种基于流形距离核的谱聚类算法[J].信息与控制,2012(3):307-313
[4] Tian Gang,He Ke-qing,Wang Jian,et al.Domain-Oriented and Tag-Aided Web Service Clustering Method[J].Chinese Journal of Electronics,2015,43(7):1266-1274(in Chinese) 田刚,何克清,王健,等.面向领域标签辅助的服务聚类方法[J].电子学报,2015,43(7):1266-1274
[5] Wu Tao,Chen Li-fei,Guo Gong-de.High-dimensional data clustering algorithm with subspace optimization[J].Journal of Computer Applications,2014,34(8):2279-2284(in Chinese) 吴涛,陈黎飞,郭躬德.优化子空间的高维聚类算法[J].计算机应用,2014,34(8):2279-2284
[6] Xin Yu,Yang Jing,Tang Chu-heng,et al.An Overlapping Semantic Community Detection Algorithm Based on Local Semantic Cluster[J].Journal of Computer Research and Development,2015,52(7):1510-1521(in Chinese) 辛宇,杨静,汤楚蘅,等.基于局部语义聚类的语义重叠社区发现算法[J].计算机研究与发展,2015,52(7):1510-1521
[7] Liao Lü-chao,Jiang Xin-hua,Zou Fu-min,et al.A Spectral Clustering Method for Big Trajectory Data Mining with Latent Semantic Correlation[J].Chinese Journal of Electronics,2015,43(5):956-964
[8] Yu Xiao-dong,Lei Ying-jie,Yue Shao-hua,et al.Research onPSO-based intuitionistic fuzzy kernel clustering algorithm[J].Journal of Communication,2015(5):74-80(in Chinese) 余晓东,雷英杰,岳韶华,等.基于粒子群优化的直觉模糊核聚类算法研究[J].通信学报,2015(5):74-80
[9] Liang Hua-dong,Han Jiang-hong.Clustering and Sorting Radar Signals Based on Multi-wavelet Packets Characteristics of Bispectrum[J].Acta Photonica Sinica,2014,43(3):152-159(in Chinese) 梁华东,韩江洪.采用双谱多类小波包特征的雷达信号聚类分选[J].光子学报,2014,43(3):152-159
[10] Sun Yan-wei,Peng Zhi-ming,Li Jian-bo.Community detection algorithm based on particle swarm optimization and fuzzy clustering[J].Journal of Chongqing University of Posts and Telecommunications (Natural Science Edition),2015,7(5):660-666(in Chinese) 孙延维,彭智明,李健波.基于粒子群优化与模糊聚类的社区发现算法[J].重庆邮电大学学报(自然科学版),2015,7(5):660-666

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!