多分支RA胶囊网络及在图像分类中的应用

doi:10.11896/jsjkx.210400087

Abstract

Abstract: Capsule Network is a new type of deep neural network that uses vectors to express information of image feature and overcomes two major problems of convolutional neural networks by introducing dynamic routing algorithms.First,convolutional neural networks cannot learn and express the part-whole relationship of images.Second,pooling operations lead to serious loss of image feature information.However,CapsNet needs to learn all the features of the image,and when the image background is complex,it has the problems of insufficient information of extracted image features,large number of training parameters and low training efficiency.To this end,firstly,a lightweight image feature extractor RA module is designed to extract image feature information faster and more completely.Secondly,two different depths of lightweight branches are designed to improve the training efficiency of the network.Finally,a new compression function hc-squash is designed to ensure that the network can acquire more useful information,and a multi-branch RA (Resnet Attention) capsule network is proposed.Through the application in the four image classification datasets of MNIST,Fashion-MNIST,affNIST and CIFAR-10,it is confirmed that the multi-branch RA capsule network outperforms CapsNet and MLCN in several performance metrics,and an improvement scheme is designed for the proposed network to achieve optimised classification performance.

Key words: Attention mechanism, Capsule network, Deep learning, Resnet attention module, Squash function

CLC Number:

TP391.41

WU Lin, SUN Jing-yu. Multi-branch RA Capsule Network and Its Application in Image Classification[J].Computer Science, 2022, 49(6): 224-230.

References

[1] JIANG J,LIU F,XU Y,et al.Multi-spectral RGB-NIR imageclassification using double-channel CNN[J].IEEE Access,2019,7:20607-20613.
[2] BAE S H.Object detection based on region decomposition and assembly[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019,33(1):8094-8101.
[3] FAHIM RAHMAN A K M,RAIHAN M R,MOHIDUL ISLAM S M.Pedestrian Detection in Thermal Images Using Deep Saliency Map and Instance Segmentation[J].International Journal of Image,Graphics and Signal Processing(IJIGSP),2021,13(1):40-49.
[4] SABOUR S,FROSST N,HINTON G E.Dynamic Routing Between Capsules[C]//Advances in Neural Information Proces-sing Systems.2017:3856-3866.
[5] HINTON G E,SABOUR S,FROSST N.Matrix capsules with EM routing[C]//International Conference on Learning Representations.2018.
[6] XIANG C,ZHANG L,TANG Y,et al.MS-CapsNet:A novel multi-scale capsule network[J].IEEE Signal Processing Letters,2018,25(12):1850-1854.
[7] NGUYEN H H,YAMAGISHI J,ECHIZEN I.Capsule-forensics:Using capsule networks to detect forged images and videos[C]//2019 IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP 2019).IEEE,2019:2307-2311.
[8] DO ROSARIO V M,BORIN E,JBRETER-NITZ M.The multi-lane capsule network[J].IEEE Signal Processing letters,2019,26(7):1006-1010.
[9] XIONG Y,SU G,YE S,et al.Deeper capsule network for complex data[C]//2019 International Joint Conference on Neural Networks (IJCNN).IEEE,2019:1-8.
[10] HAN T,SUN R,SHAO F,et al.Feature and spatial relation-ship coding capsule network[J/OL].Journal of Electronic Imaging.https://doi.org/10.1117/1.JEI.29.2.023004.
[11] CHANG S,LIU J.Multi-lane Capsule Network for classifying images with complex background[J].IEEE Access,2020,8:79876-79886.
[12] HOCHREITER S,SCHMIDHUBER J.LSTM can solve hardlong time lag problems[C]//Advances in Neural Information Processing Systems.1997:473-479.
[13] SRIVASTAVA R K,SCHMIDHUBER J,GREFF K.Highway Networks[J].arXiv:1505.00387,2015.
[14] HINTON G E,KRIZHEVSKY A,WANG S D.Transformingauto-encoders[C]//International Conference on Artificial Neural Networks.Berlin:Springer,2011:44-51.
[15] HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[16] HE K,ZHANG X,REN S,et al.Deep Residual Learning for Im-age Recognition[C]//IEEE Conference on Computer Vision and Pattern Recognition.IEEE Computer Society,2016:770-778.
[17] YANG Z,WANG X.Reducing the Dilution:analysis of the information sensitiveness of capsule network and one practical solution[J].arXiv:1903.10588v3,2019.

Related Articles 15

[1]	ZHOU Fang-quan, CHENG Wei-qing. Sequence Recommendation Based on Global Enhanced Graph Neural Network [J]. Computer Science, 2022, 49(9): 55-63.
[2]	DAI Yu, XU Lin-feng. Cross-image Text Reading Method Based on Text Line Matching [J]. Computer Science, 2022, 49(9): 139-145.
[3]	ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[4]	XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[5]	XIONG Li-qin, CAO Lei, LAI Jun, CHEN Xi-liang. Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization [J]. Computer Science, 2022, 49(9): 172-182.
[6]	RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[7]	TANG Ling-tao, WANG Di, ZHANG Lu-fei, LIU Sheng-yun. Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy [J]. Computer Science, 2022, 49(9): 297-305.
[8]	WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[9]	HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[10]	JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[11]	ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[12]	SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[13]	YAN Jia-dan, JIA Cai-yan. Text Classification Method Based on Information Fusion of Dual-graph Neural Network [J]. Computer Science, 2022, 49(8): 230-236.
[14]	WANG Ming, PENG Jian, HUANG Fei-hu. Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction [J]. Computer Science, 2022, 49(8): 40-48.
[15]	HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Multi-branch RA Capsule Network and Its Application in Image Classification

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0