计算机科学 ›› 2021, Vol. 48 ›› Issue (6A): 664-667.doi: 10.11896/jsjkx.200500129

• 交叉&应用 • 上一篇    下一篇

数据腐蚀对GHTSOM模型的优化

石健1, 莫俊2   

  1. 1 深圳市英维克信息技术有限公司 广东 深圳518000
    2 深圳市英维克软件技术有限公司 广东 深圳518000
  • 出版日期:2021-06-10 发布日期:2021-06-17
  • 通讯作者: 莫俊(mushroom231@163.com)
  • 作者简介:gemingbanxie@163.com

Optimization of GHTSOM Model by Data Corrosion

SHI Jian1, MO Jun2   

  1. 1 Shenzhen Envicool Information and Technology Co.,Ltd.,Shenzhen,Guangdong 518000,China
    2 Shenzhen Envicool Software Technology Co.,Ltd.,Shenzhen,Guangdong 518000,China
  • Online:2021-06-10 Published:2021-06-17
  • About author:SHI Jian,born in 1988,master,intermediate engineer of thermal automation.His main research interests include artificial environment control algorithm design and data analysis.
    MO Jun,born in 1985,bachelor,intermediate engineer of automation.His main research interests include design and test of control scheme for artificial environment.

摘要: 聚类算法被广泛应用于模式识别、信息检索、图像处理,以及自然语言处理等领域,GCS和SOM是两种常用的基于神经网络思想的聚类方式,很多学者在它们的基础上提出了不同的改进算法,GHTSOM(Growing Hierarchical Tree SOM)便是其中之一,对于数据分类较为清晰的应用场景效果良好,但不适用于干扰数据或者噪声数据较多的应用场景。利用图像处理中的腐蚀算法对GHTSOM算法进行优化,即在调用GHTSOM过程之前,先用腐蚀算法对数据进行处理,去除掉不同类别的数据交界位置处的干扰数据或者噪声数据,使不同类别数据之间出现较为明显的界限。为使表达更加直观,采用二维数据进行处理分析,结果表明,优化后的GHTSOM模型可有效避免由于类间局部连接造成的无法分类的问题,以及由于神经元过多所造成的误分类问题。

关键词: GHTSOM, 聚类, 数据处理, 数据腐蚀, 优化

Abstract: Clustering algorithm is widely used in pattern recognition,information retrieval,image processing and natural language processing.Two common clustering methods based on neural network are GCS and SOM.Many scholars have proposed different improved algorithms based on them.GHTSOM(Growing Hierarchical Tree SOM) is one of them.GHTSOM works well for applications where there is a clear classification of data,but it is not suitable for applications where there is a lot of noise or disturbing data.The corrosion algorithm in image processing is used to optimize the GHTSOM algorithm,that is,before calling the GHTSOM process,the data is processed by the corrosion algorithm to remove the interference data or noise data at the junction of different classes of data,making a distinction between different categories of data more obvious.To make the presentation more intuitive,two-dimensional datas are used.The results show that the optimized Ghsom model can effectively avoid the unclassifiable problems caused by local connections between classes and the misclassified problems caused by too many neurons.

Key words: Clustering, Data corruption, Data processing, GHTSOM, Optimize

中图分类号: 

  • TP183
[1] HODGE V J,AUSTIN J.Hierarchical growing cell structures:TreeGCS [J].IEEE Transactions on Knowledge and Data Engineering,2001,2(13):207-218.
[2] BURZEVSKI V,MOHAN C K.Hierarchical growing cell structures[C]//Proceedings of International Conference on Neural Networks(ICNN'96).1966:1668-1663.
[3] WU S T,CHOW T W S.PRSOM:a new visualization method by hybridizing multidimensional scaling and self-organizing map [J].IEEE Transactions on Neural Networks,2005,6(16):1362-1380.
[4] 许洋.基于SOM神经网络在流形学习中的研究与应用 [J].南方农机2020,51(9):177-180.
[5] LI Y Q,LIN H X.Fault diagnosis method of train control RBC system based on KPCA-SOM network [J].Journal of Measurement Science and Instrumentation,2020,11(2):161-168.
[6] FORTI A,FORESTI G L.Growing Hierarchical Tree SOM:An Unsupervised neural network with dynamic topology [J].Neural Networks,2006(19):1568-1580.
[7] YING H,WANG L Q,ZHAO X A.Automatic roads extraction from high-resolution remote sensing images based on SOM[C]//Circuits and Systems Society.2010 Sixth International Conference on Natural Computation,2010:1194-1198.
[8] ZHANG Y B,ZHOU K.Study on automotive style recognition with the image erosion technology [C]//2011 International Conference on Consumer Electronics.Communications and Networks(CECNet),2011.
[9] ONKA M,HLAVAC V,BOYLE R.Image Processing,Analysis,and Machine Vision [M]//CL-Engineering.2007.
[10] KOHONEN T.Self-Organized Formation of Topologically Correct Feature Maps [J].Biological Cybernetics,1982,1(43):59-69.
[1] 柴慧敏, 张勇, 方敏.
基于特征相似度聚类的空中目标分群方法
Aerial Target Grouping Method Based on Feature Similarity Clustering
计算机科学, 2022, 49(9): 70-75. https://doi.org/10.11896/jsjkx.210800203
[2] 鲁晨阳, 邓苏, 马武彬, 吴亚辉, 周浩浩.
基于分层抽样优化的面向异构客户端的联邦学习
Federated Learning Based on Stratified Sampling Optimization for Heterogeneous Clients
计算机科学, 2022, 49(9): 183-193. https://doi.org/10.11896/jsjkx.220500263
[3] 王灿, 刘永坚, 解庆, 马艳春.
基于软标签和样本权重优化的Anchor Free目标检测算法
Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization
计算机科学, 2022, 49(8): 157-164. https://doi.org/10.11896/jsjkx.210600240
[4] 陈俊, 何庆, 李守玉.
基于自适应反馈调节因子的阿基米德优化算法
Archimedes Optimization Algorithm Based on Adaptive Feedback Adjustment Factor
计算机科学, 2022, 49(8): 237-246. https://doi.org/10.11896/jsjkx.210700150
[5] 李其烨, 邢红杰.
基于最大相关熵的KPCA异常检测方法
KPCA Based Novelty Detection Method Using Maximum Correntropy Criterion
计算机科学, 2022, 49(8): 267-272. https://doi.org/10.11896/jsjkx.210700175
[6] 王兵, 吴洪亮, 牛新征.
基于改进势场法的机器人路径规划
Robot Path Planning Based on Improved Potential Field Method
计算机科学, 2022, 49(7): 196-203. https://doi.org/10.11896/jsjkx.210500020
[7] 唐枫, 冯翔, 虞慧群.
基于自适应知识迁移与资源分配的多任务协同优化算法
Multi-task Cooperative Optimization Algorithm Based on Adaptive Knowledge Transfer andResource Allocation
计算机科学, 2022, 49(7): 254-262. https://doi.org/10.11896/jsjkx.210600184
[8] 张翀宇, 陈彦明, 李炜.
边缘计算中面向数据流的实时任务调度算法
Task Offloading Online Algorithm for Data Stream Edge Computing
计算机科学, 2022, 49(7): 263-270. https://doi.org/10.11896/jsjkx.210300195
[9] 赵冬梅, 吴亚星, 张红斌.
基于IPSO-BiLSTM的网络安全态势预测
Network Security Situation Prediction Based on IPSO-BiLSTM
计算机科学, 2022, 49(7): 357-362. https://doi.org/10.11896/jsjkx.210900103
[10] 陈钧吾, 余华山.
面向无尺度图的Δ-stepping算法改进策略
Strategies for Improving Δ-stepping Algorithm on Scale-free Graphs
计算机科学, 2022, 49(6A): 594-600. https://doi.org/10.11896/jsjkx.210400062
[11] 刘漳辉, 郑鸿强, 张建山, 陈哲毅.
多无人机使能移动边缘计算系统中的计算卸载与部署优化
Computation Offloading and Deployment Optimization in Multi-UAV-Enabled Mobile Edge Computing Systems
计算机科学, 2022, 49(6A): 619-627. https://doi.org/10.11896/jsjkx.210600165
[12] 范星泽, 禹梅.
改进灰狼算法的无线传感器网络覆盖优化
Coverage Optimization of WSN Based on Improved Grey Wolf Optimizer
计算机科学, 2022, 49(6A): 628-631. https://doi.org/10.11896/jsjkx.210500037
[13] 王显芳, 张亮, 张宁.
基于前景理论的微信健康信息质量三方博弈分析
Evolutionary Game Analysis of WeChat Health Information Quality Optimization Based on Prospect Theory
计算机科学, 2022, 49(6A): 694-704. https://doi.org/10.11896/jsjkx.210900186
[14] 鲁晨阳, 邓苏, 马武彬, 吴亚辉, 周浩浩.
基于DBSCAN聚类的集群联邦学习方法
Clustered Federated Learning Methods Based on DBSCAN Clustering
计算机科学, 2022, 49(6A): 232-237. https://doi.org/10.11896/jsjkx.211100059
[15] 郁舒昊, 周辉, 叶春杨, 王太正.
SDFA:基于多特征融合的船舶轨迹聚类方法研究
SDFA:Study on Ship Trajectory Clustering Method Based on Multi-feature Fusion
计算机科学, 2022, 49(6A): 256-260. https://doi.org/10.11896/jsjkx.211100253
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!