Computer Science ›› 2024, Vol. 51 ›› Issue (11A): 231000069-9.doi: 10.11896/jsjkx.231000069

• Intelligent Computing • Previous Articles     Next Articles

Design and Application of Attention-enhanced Dynamic Self-organizing Modular Neural Network

ZHANG Zhaozhao1, PAN Haoran1, ZHU Yingqin2   

  1. 1 Institute of Computer Science and Technology,Xi'an University of Science and Technology,Xi'an 710054,China
    2 Departamento de Control Automático CINVESTAV-IPN(National Polytechnic Institute),Mexico City 07360,Mexico
  • Online:2024-11-16 Published:2024-11-13
  • About author:ZHANG Zhaozhao,born in 1973,asso-ciate professor,master's supervisor.His main research interests include intelligent information processing and optimization design of neural network structures.
    PAN Haoran,born in 1998,master's research student.His main research intere-sts include optimization design of neural network structures.
  • Supported by:
    China Scholarship Council(CSC)(202310120001).

Abstract: In response to the complexity and non-linear characteristics of chaotic time series,this paper proposes a novel neural network model specifically designed to address these challenges:the attention-enhanced dynamic self-organizing modular neural network(ADAMNN).Grounded in the divide-and-conquer philosophy,this model employs an attention mechanism to compute the similarity between different sub-networks and input data,facilitating an adaptive partitioning of sub-networks through hierarchical clustering.Subsequently,a dynamic growth mechanism,based on hierarchical clustering,adjusts the size of sub-network clusters.Ultimately,activated sub-network clusters are employed for online learning of input samples.Simultaneously,we introduce a novel attention-based sub-network weighted ensemble output method,integrating traditional ensemble output approaches.Ultimately,experiments were conducted on the Mackey-Glass time series,the rapidly varying MG time series,in the realm of nonli-near system identification,and using gas concentration datasets from coal mining operations.The ADAMNN model exhibited its proficiency in real-time updates of sub-network centers and the dynamic formation of sub-network clusters.Moreover,compared to dynamic self-organizing modular neural networks based on Euclidean space,ADAMNN exhibits an approximately 40% improvement in prediction accuracy.

Key words: Modular neural networks, Self-organization neural networks, Chaotic time series, Attention mechanism, Hierarchical clustering

CLC Number: 

  • TP183
[1]BALKISSOON S,FOX N,LUPO A,et al.Determining chaoticcharacteristics and forecasting tall tower wind speeds in Missouri using empirical dynamical modeling(EDM)[J].Renewable Energy,2021,170:1292-1307.
[2]CHENG W,WANG Y,PENG Z,et al.High-efficiency chaotic time series prediction based on time convolution neural network[J].Chaos,Solitons & Fractals,2021,152:111304.
[3]HAN M,ZHANG S,XU M,et al.Multivariate chaotic time se-ries online prediction based on improved kernel recursive least squares algorithm[J].IEEE Transactions on Cybernetics,2018,49(4):1160-1172.
[4]BECKER S,HINTON G E.Self-organizing neural network that discovers surfaces in random-dot stereograms[J].Nature,1992.355(6356):161-163.
[5]QIAO J F,AN R,HAN H G.Design of self-organizing RBF neural network based on relative contribution index[J].CAAI Transactions on Intelligent Systems,2018,13(2):159-167.
[6]EL-SOUSY F F M,ABUHASEL K A.Adaptive nonlinear disturbance observer using a double-loop self-organizing recurrent wavelet neural network for a two-axis motion control system[J].IEEE Transactions on Industry Applications,2017,54(1):764-786.
[7]LI W L,GAO H W,JI D X,et al.Optimization Method of Sea-bed Sediment Texture Feature Based on Genetic Algorithm[J].Computer Science,2016,43(Z6):130-133.
[8]JACOBS R A,JORDAN M I,NOWLAN S J,et al.Adaptive Mixtures of Local Experts[J].Neural Computation,1991,3(1):79-87.
[9]ZHANG Z Z,QIAO J F,YU W.Structure Design of Hierarchical Adaptive Modular Neural Network[J].Chinese Journal of Computers,2017,40(12):2827-2838.
[10]TEREKHOV A V,MONTONE G,O'REGAN J K.Knowledge transfer in deep block-modular neural networks[C]//Biomime-tic and Biohybrid Systems:4th International Conference,Living Machines 2015,Barcelona,Spain.Springer International Publishing,2015:268-279.
[11]WATANABE C,HIRAMATSU K,KASHINO K.Modular representation of layered neural networks[J].Neural Networks,2018,97:62-73.
[12]PHAN K T,MAUL T H,VU T T.A parallel circuit approach for improving the speed and generalization properties of neural networks[C]//2015 11th International Conference on Natural Computation(ICNC).IEEE,2015:1-7.
[13]ERREIRA M D F,CORRÊA D C,NONATO L G,et al.Designing architectures of convolutional neural networks to solve practical problems[J].Expert Systems with Applications,2018,94:205-217.
[14]ZHANG Z Z.Design of dynamic adaptive modular neural network structure.Control and Decision,2014,29(1),64-70.
[15]QIAO J,ZHANG Z,BO Y.An online self-adaptive modular neural network for time-varying systems[J].Neurocomputing,2014,125:7-16.
[16]GUO X,LI W,QIAO J.An improved online adaptive modular neural network.Control and Decision,2020,35(7),1597-1605.
[17]GUO X,LI W,QIAO J.A self-organizing modular neural network based on empirical mode decomposition with sliding window for time series prediction[J].Applied Soft Computing,2023:110559.
[18]ZHANG Y,YANG Q.A survey on multi-task learning[J].IEEE Transactions on Knowledge and Data Engineering,2021,34(12):5586-5609.
[19]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[C]//Advances in Neural Information Processing Systems,2017,30:6000-6010.
[20]PHAM H,LUONG M T,MANNING C D.2015.Effective Approaches to Attention-based Neural Machine Translation[C]//Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing.Lisbon,Portugal:Association for Computational Linguistics.2015:1412-1421.
[21]MURTAGH F,LEGENDRE P.Ward's hierarchical agglomerative clustering method:which algorithms implement Ward's criterion?[J].Journal of classification,2014,31:274-295.
[22]BALTRUŠAITIS T,AHUJA C,MORENCY L P.Multimodal machine learning:A survey and taxonomy[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2018,41(2):423-443.
[23]MACKEY M C,GLASS L.Oscillation,and chaos in physiological control systems[J].Science,1977,197(4300):287-289.
[24]SU Y,YANG C,QIAO J.Self-organizing pipelined recurrentwavelet neural network for time series prediction[J].Expert Systems with Applications,2023,214:119215.
[25]LAI X W,ZHENG W B,WU Y Q,et al.Task Collaborative Process Network Model and Time Analysis of Mine Accident Emergency Rescue Digital Plan[J].Computer Science,2021,48(S1):596-602.
[26]WANG E Y,ZHANG G R,ZHANG C L,et al.Research progress and prospect on theory and technology for coal and gas outburst control and protection in China[J].Journal of China Coal Society,2022,47(1):297-322.
[1] LI Yunchen, ZHANG Rui, WANG Jiabao, LI Yang, WANG Ziqi, CHEN Yao. Re-parameterization Enhanced Dual-modal Realtime Object Detection Model [J]. Computer Science, 2024, 51(9): 162-172.
[2] HU Pengfei, WANG Youguo, ZHAI Qiqing, YAN Jun, BAI Quan. Night Vehicle Detection Algorithm Based on YOLOv5s and Bistable Stochastic Resonance [J]. Computer Science, 2024, 51(9): 173-181.
[3] LIU Qian, BAI Zhihao, CHENG Chunling, GUI Yaocheng. Image-Text Sentiment Classification Model Based on Multi-scale Cross-modal Feature Fusion [J]. Computer Science, 2024, 51(9): 258-264.
[4] LI Zhe, LIU Yiyang, WANG Ke, YANG Jie, LI Yafei, XU Mingliang. Real-time Prediction Model of Carrier Aircraft Landing Trajectory Based on Stagewise Autoencoders and Attention Mechanism [J]. Computer Science, 2024, 51(9): 273-282.
[5] LIU Qilong, LI Bicheng, HUANG Zhiyong. CCSD:Topic-oriented Sarcasm Detection [J]. Computer Science, 2024, 51(9): 310-318.
[6] YAO Yao, YANG Jibin, ZHANG Xiongwei, LI Yihao, SONG Gongkunkun. CLU-Net Speech Enhancement Network for Radio Communication [J]. Computer Science, 2024, 51(9): 338-345.
[7] LIU Sichun, WANG Xiaoping, PEI Xilong, LUO Hangyu. Scene Segmentation Model Based on Dual Learning [J]. Computer Science, 2024, 51(8): 133-142.
[8] ZHANG Rui, WANG Ziqi, LI Yang, WANG Jiabao, CHEN Yao. Task-aware Few-shot SAR Image Classification Method Based on Multi-scale Attention Mechanism [J]. Computer Science, 2024, 51(8): 160-167.
[9] WANG Qian, HE Lang, WANG Zhanqing, HUANG Kun. Road Extraction Algorithm for Remote Sensing Images Based on Improved DeepLabv3+ [J]. Computer Science, 2024, 51(8): 168-175.
[10] XIAO Xiao, BAI Zhengyao, LI Zekai, LIU Xuheng, DU Jiajin. Parallel Multi-scale with Attention Mechanism for Point Cloud Upsampling [J]. Computer Science, 2024, 51(8): 183-191.
[11] PU Bin, LIANG Zhengyou, SUN Yu. Monocular 3D Object Detection Based on Height-Depth Constraint and Edge Fusion [J]. Computer Science, 2024, 51(8): 192-199.
[12] ZHANG Junsan, CHENG Ming, SHEN Xiuxuan, LIU Yuxue, WANG Leiquan. Diversified Label Matrix Based Medical Image Report Generation [J]. Computer Science, 2024, 51(8): 200-208.
[13] WANG Chao, TANG Chao, WANG Wenjian, ZHANG Jing. Infrared Human Action Recognition Method Based on Multimodal Attention Network [J]. Computer Science, 2024, 51(8): 232-241.
[14] ZHANG Lu, DUAN Youxiang, LIU Juan, LU Yuxi. Chinese Geological Entity Relation Extraction Based on RoBERTa and Weighted Graph Convolutional Networks [J]. Computer Science, 2024, 51(8): 297-303.
[15] CHEN Shanshan, YAO Subin. Study on Recommendation Algorithms Based on Knowledge Graph and Neighbor PerceptionAttention Mechanism [J]. Computer Science, 2024, 51(8): 313-323.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!