基于离散小波变换的双域特征融合深度卷积神经网络

doi:10.11896/jsjkx.210900199

Abstract

Abstract: Pooling operation is an essential part of deep convolutional neural networks,and also one of the key factors for the success of deep convolutional neural network.However,in the process of image recognition,the traditional direct pooling operation will lead to the loss of feature information and affect the accuracy of recognition.In this paper,a dual-field feature fusion module based on discrete wavelet transform is proposed to overcome the disadvantage of the direct pooling operation.In this module,the dual-field feature fusion of spatial domain and channel domain is considered,and the pooling operation is embedded between spatial feature fusion module and channel feature fusion module,which effectively suppress the information loss of features caused by pooling directly.By replacing the existing pooling operation,the new dual-field feature fusion module can be easily embedded into the current popular deep neural network architectures.Extensive experimental results on CIFAR-10,CIFAR-100 and Mini-Imagenet datasets by using mainstream network architectures such as VGG,ResNet and DenseNet.The experimental results show that compared with the classical network,the popular network based on embedded attention mechanism or latest wavelet basis model,the proposed method can achieve higher classification accuracy.

Key words: Attention mechanisms, Deep convolutional neural networks, Discrete wavelet transform, Feature fusion, Pooling operation

CLC Number:

TP391

SUN Jie-qi, LI Ya-feng, ZHANG Wen-bo, LIU Peng-hui. Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation[J].Computer Science, 2022, 49(6A): 434-440.

References

[1] KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet classification with deep convolutional neural networks[J].Advances in Neural Information Processing Systems,2012,25(2):1097-1105.
[2] REN S,HE K,GIRSHICK R,et al.Faster R-CNN:towardsreal-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149.
[3] ZHANG K,ZUO W M,GU S H,et al.Learning deep cnn denoiser prior for image restoration[C]//IEEE Conference on Computer Vision and Pattern Recognition.Honolulu,HI,2017:2808-2817.
[4] BOUREAU Y,PONCE J,LECUN Y.A theoretical analysis of feature pooling in visual recognition[C]//Proceedings of the 27th International Conference on Machine Learning.Haifa,Is-rael,2010:111-118.
[5] NIELSEN M.Neural Networks and Deep Learning[M].Determination Press,2015.
[6] LEE C,GALLAGHER P,TU Z.Generalizing pooling functions in CNNs:mixed,gated,and tree[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,40(4):863-875.
[7] YU D J,WANG H L,CHEN P Q,et al.Mixed Pooling for Convolutional Neural Networks[C]//International Conference on Rough Sets and Knowledge Technology.2014:364-375.
[8] ZEILER M D,FERGUS R.Stochastic pooling for regularization of deep convolutional neural networks[EB/OL].(2013-01-16).https://arxiv.org/abs/1301.3557.
[9] WILLIAMS T,LI R.Wavelet pooling for convolutional neuralnetworks[C]//Proceedings of the International Conference on Learning Representations.Vancouver,BC,2018:1-12.
[10] HOU Q B,ZHANG L,CHENG M M,et al.Strip Pooling:rethinking spatial pooling for scene parsing[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Seattle,WA,2020:4002-4011.
[11] SPRINGENBERG J T,DOSOVITSKIY A,BROX T,et al.Striving for simplicity:the All convolutional net[EB/OL].(2014-12-21).https://arxiv.org/abs/1412.6806.
[12] ZHANG R.Making convolutional networks shiftinvariant again[EB/OL].(2019-04-25).https://arxiv.org/abs/1904.11486.
[13] DAUBECHIES I.Ten lectures on wavelets[M].United States:Journal of the Acoustical Society of America,1993.
[14] HUANG H,HE R,SUN Z,et al.Wavelet-srnet:A wavelet-based cnn for multi-scale face super resolution[C]//Proceedings of the IEEE International Conference on Computer Vision.Hono-lulu,HI,2017:1689-1697.
[15] FUJIEDA S,TAKAYAMA K,HACHISUNKA T.Waveletconvolutional neural networks for texture classification[EB/OL].(2017-07-24).https://arxiv.org/abs/1707.07394.
[16] LU H Y,WANG H F,ZHANG Q Q,et al.A dual-tree complex wavelet transform based convolutional neural network for human thyroid medical image segmentation[C]//Proceedings of the IEEE International Conference on Healthcare Informatics.569 Lexington Avenue,NY,2018:191-198.
[17] SZEGEDY C,LIU W,JIA Y Q,et al.Going deeper with convolutions[C]//IEEE Conference on Computer Vision and Pattern Recognition.Boston,MA,2015:1-9.
[18] SZEGEDY C,VANHOUCKE V,IOFFE S,et al.Rethinking the inception architecture for computer vision[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Seattle,WA,2016:2818-2826.
[19] DUAN Y P,LIU F,JIAO L C,et al.Sar Image segmentation based on convolutional wavelet neural network and markov random field[J].Pattern Recognition,2017,64:255-267.
[20] LIU P J,ZHANG H Z,ZHANG K,et al.Multi-level wavelet-cnn for image restoration[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition Workshops.Salt Lake City,UT,2018:773-782.
[21] RONNEBERGER O,FISCHER P,BROX T.U-Net:Convolu-tional Networks for Biomedical Image Segmentation[C]//International Conference on Medical Image Computing and Compu-ter-Assisted Intervention.Springer International Publishing,2015.
[22] LI Q F,SHEN L L,GUO S,et al.Wavelet integrated CNNs for noise-robust image classification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Seattle,WA,2020:7243-7252.
[23] MALLAT S.A theory for multiresolution signal decomposition:the wavelet representation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1989,11(4):674-693.
[24] SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[EB/OL].(2014-09-04).https://arxiv.org/abs/1409.1556.
[25] HE K M,ZHANG X Y,REN S Q,et al.Deep residual learning for image recognition[C]//IEEE Conference on Computer Vision and Pattern Recognition.Seattle,WA,2016:770-778.
[26] HUANG G,LIU Z,WEINBERGER K Q.Densely connectedconvolutional networks[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition.Honolulu,HI,2017:2261-2269.
[27] ADAM P,SAM C,FRANCISCO M,et al.Pytorch:An imperative style,high-performance deep learning library[EB/OL].https://arxiv.org/abs/1912.01703.
[28] HU J,SHEN L,SUN G.Squeeze-and-Excitation networks [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Salt Lake City,UT,2018:7132-7141.
[29] ZHANG Q L,YANG Y B.SA-Net:shuffle attention for deep convolutional neural networks[EB/OL].(2021-01-30).https://arxiv.org/abs/2102.00240.

Related Articles 15

[1]	ZHANG Ying-tao, ZHANG Jie, ZHANG Rui, ZHANG Wen-qiang. Photorealistic Style Transfer Guided by Global Information [J]. Computer Science, 2022, 49(7): 100-105.
[2]	CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
[3]	CHEN Yong-ping, ZHU Jian-qing, XIE Yi, WU Han-xiao, ZENG Huan-qiang. Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss [J]. Computer Science, 2022, 49(6A): 424-428.
[4]	YU Shu-hao, ZHOU Hui, YE Chun-yang, WANG Tai-zheng. SDFA:Study on Ship Trajectory Clustering Method Based on Multi-feature Fusion [J]. Computer Science, 2022, 49(6A): 256-260.
[5]	LAI Teng-fei, ZHOU Hai-yang, YU Fei-hong. Real-time Extend Depth of Field Algorithm for Video Processing [J]. Computer Science, 2022, 49(6A): 314-318.
[6]	YANG Yue, FENG Tao, LIANG Hong, YANG Yang. Image Arbitrary Style Transfer via Criss-cross Attention [J]. Computer Science, 2022, 49(6A): 345-352.
[7]	LAN Ling-xiang, CHI Ming-min. Remote Sensing Change Detection Based on Feature Fusion and Attention Network [J]. Computer Science, 2022, 49(6): 193-198.
[8]	FAN Xin-nan, ZHAO Zhong-xin, YAN Wei, YAN Xi-jun, SHI Peng-fei. Multi-scale Feature Fusion Image Dehazing Algorithm Combined with Attention Mechanism [J]. Computer Science, 2022, 49(5): 50-57.
[9]	LI Fa-guang, YILIHAMU·Yaermaimaiti. Real-time Detection Model of Insulator Defect Based on Improved CenterNet [J]. Computer Science, 2022, 49(5): 84-91.
[10]	DONG Qi-da, WANG Zhe, WU Song-yang. Feature Fusion Framework Combining Attention Mechanism and Geometric Information [J]. Computer Science, 2022, 49(5): 129-134.
[11]	LI Peng-zu, LI Yao, Ibegbu Nnamdi JULIAN, SUN Chao, GUO Hao, CHEN Jun-jie. Construction and Classification of Brain Function Hypernetwork Based on Overlapping Group Lasso with Multi-feature Fusion [J]. Computer Science, 2022, 49(5): 206-211.
[12]	GAO Xin-yue, TIAN Han-min. Droplet Segmentation Method Based on Improved U-Net Network [J]. Computer Science, 2022, 49(4): 227-232.
[13]	XU Tao, CHEN Yi-ren, LYU Zong-lei. Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm [J]. Computer Science, 2022, 49(4): 239-246.
[14]	XU Hua-jie, QIN Yuan-zhuo, YANG Yang. Scene Recognition Method Based on Multi-level Feature Fusion and Attention Module [J]. Computer Science, 2022, 49(4): 209-214.
[15]	YANG Xiao-yu, YIN Kang-ning, HOU Shao-qi, DU Wen-yi, YIN Guang-qiang. Person Re-identification Based on Feature Location and Fusion [J]. Computer Science, 2022, 49(3): 170-178.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0