Computer Science ›› 2024, Vol. 51 ›› Issue (9): 155-161.doi: 10.11896/jsjkx.230900109

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Semantic-guided Neural Network Critical Data Routing Path

ZHU Fukun1, TENG Zhen2, SHAO Wenze1, GE Qi1, SUN Yubao3   

  1. 1 School of Communications and Information Engineering,Nanjing University of Posts and Telecommunications,Nanjing 210003,China
    2 Bell Honors School,Nanjing University of Posts and Telecommunications,Nanjing 210042,China
    3 Engineering Research Center for Digital Forensics Ministry of Education,Nanjing University of Information Science and Technology,Nanjing 210044,China
  • Received:2023-09-19 Revised:2024-03-06 Online:2024-09-15 Published:2024-09-10
  • About author:ZHU Fukun,born in 1999,postgra-duate.His main research interests include interpretability and adversarial transferability of deep learning mo-dels.
    SHAO Wenze,born in 1981,Ph.D,professor.His main research interests include computational imaging,computer vision and machine learning.
  • Supported by:
    National Natural Science Foundation of China(61771250,61972213).

Abstract: In recent years,with the popularity of artificial intelligence in various fields,it has become an increasingly important topic to study the interpretable methods of neural networks and understand their running principles.As a subfield of neural network interpretability methods,the interpretability of network pathways garners increasing attention.This paper particularly focuses on the critical data routing path(CDRP),an interpretable method for network pathways.Firstly,the routing path visualization attribution of CDRP in the input domain is analyzed by use of the score-class activation map(Score-CAM) method,pointing out the inherent defects of the CDRP approach in terms of semantics.Then a channel semantic guided CDRP method termed as Score-CDRP is proposed,which improves the semantic consistency between the original deep neural network and its corresponding CDRP from the perspective of method mechanism.Lastly,experimental results demonstrate that the proposed Score-CDRP approach is more reasonable,effective and robust than CDRP in terms of visualization of the routing path heatmap as well as its corresponding prediction and localization accuracy.

Key words: Computer vision, Deep neural networks, Interpretability of neural networks, Feature visualization, Network pruning, Heatmap

CLC Number: 

  • TP183
[1]SUBAKAN C,RAVANELLI M,CORNELL S,et al.Attention is all you need in speech separation[C]//2021 IEEE International Conference on Acoustics,Speech and Signal Processing.(ICASSP 2021)IEEE,2021:21-25.
[2]ZHU X,LYU S,WANG X,et al.TPH-YOLOv5:ImprovedYOLOv5 based on transformer prediction head for object detection on drone-captured scenarios[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:2778-2788.
[3]RICCARDO G,ANNA M,SALVATORE R,et al.A Survey Of Methods For Explaining Black Box Models[J].ACM Computing Surveys,2018,51(5):1-42.
[4]YOSINSKI J,CLUNE J,NGUYEN A,et al.Understandingneural networks through deep visualization[J].arXiv:1506.06579,2015.
[5]ERHAN D,BENGIO Y,COURVILLE A,et al.Visualizinghigher-layer features of a deep network[J].University of Montreal,2009,1341(3):1-13.
[6]ZHOU B,KHOSLA A,LAPEDRIZA A,et al.Learning deepfeatures for discriminative localization[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:2921-2929.
[7]BACH S,BINDER A,MONTAVON G,et al.On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation[J].PloS One,2015,10(7):e0130140.
[8]SPRINGENBERG J T,DOSOVITSKIY A,BROX T,et al.Striving for simplicity:The all convolutional net[J].arXiv:1412.6806,2014.
[9]WANG Y,SU H,ZHANG B,et al.Interpret neural networks byidentifying critical data routing paths[C]//proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8906-8914.
[10]TIBSHIRANI R.Regression shrinkage and selection via the lasso[J].Journal of the Royal Statistical Society:Series B(Metho-dological),1996,58(1):267-288.
[11]HOEFLER T,ALISTARH D,BEN-NUN T,et al.Sparsity in deep learning:Pruning and growth for efficient inference and training in neural networks[J].The Journal of Machine Lear-ning Research,2021,22(1):10882-11005.
[12]HINTON G,VINYALS O,DEAN J.Distilling the Knowledgein a Neural Network[J].Computer Science,2015,14(7):38-39.
[13]SELVARAJU R R,COGSWELL M,DAS A,et al.Grad-cam:Visual explanations from deep networks via gradient-based localization[C]//Proceedings of the IEEE International Confe-rence on Computer Vision.2017:618-626.
[14]CHATTOPADHAY A,SARKAR A,HOWLADER P,et al.Grad-cam++:Generalized gradient-based visual explanations for deep convolutional networks[C]//2018 IEEE Winter Conference on Applications of Computer Vision.IEEE,2018:839-847.
[15]SATTARZADEH S,SUDHAKAR M,PLATANIOTIS K N,et al.Integrated grad-cam:Sensitivity-aware visual explanation of deep convolutional networks via integrated gradient-based scoring[C]//ICASSP 2021-2021 IEEE International Conference on Acoustics,Speech and Signal Processing.IEEE,2021:1775-1779.
[16]JIANG P,ZHANG C,HOU Q,et al.LayerCAM:Exploring Hi-erarchical Class Activation Maps for Localization.[J].IEEE Transactions on Image Processing:a Publication of the IEEE Signal Processing Society,2021,30:5875-5888.
[17]WANG H,WANG Z,DU M,et al.Score-CAM:Score-weighted visual explanations for convolutional neural networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.2020:24-25.
[18]IBRAHIM R,SHAFIQ M O.Augmented Score-CAM:High reso-lution visual interpretations for deep neural networks[J].Knowledge-Based Systems,2022,252:109287.
[19]KHAKZAR A,BASELIZADEH S,KHANDUJA S,et al.Neural response interpretation through the lens of critical pathways[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:13528-13538.
[20]ZHANG Y,TIÑO P,LEONARDIS A,et al.A survey on neural network interpretability[J].IEEE Transactions on Emerging Topics in Computational Intelligence,2021,5(5):726-742.
[21]QIU Y,LENG J,GUO C,et al.Adversarial defense throughnetwork profiling based path extraction[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:4777-4786.
[22]SIMONYAN K,ZISSERMAN A.Very Deep Convolutional Networks for Large-Scale Image Recognition[J].arXiv:1409.1556,2014.
[23]RUSSAKOVSKY O,DENG J,SU H,et al.Imagenet large scale visual recognition challenge[J].International Journal of Computer Vision,2015,115:211-252.
[24]DESAI S,RAMASWAMY H G.Ablation-CAM:Visual Explanations for Deep Convolutional Network via Gradient-free Localization[C]//Workshop on Applications of Computer Vision.IEEE,2020:972-980.
[1] HAN Bing, DENG Lixiang, ZHENG Yi, REN Shuang. Survey of 3D Point Clouds Upsampling Methods [J]. Computer Science, 2024, 51(7): 167-196.
[2] CAI Wenliang, HUANG Jun. Lane Detection Method Based on RepVGG [J]. Computer Science, 2024, 51(7): 236-243.
[3] XU Xiaohua, ZHOU Zhangbing, HU Zhongxu, LIN Shixun, YU Zhenjie. Lightweight Deep Neural Network Models for Edge Intelligence:A Survey [J]. Computer Science, 2024, 51(7): 257-271.
[4] YUAN Zhen, LIU Jinfeng. Denoising Autoencoders Based on Lossy Compress Coding [J]. Computer Science, 2024, 51(6A): 230400172-7.
[5] LU Dongsheng, LONG Hua. Method for Homologous Spectrum Monitoring Data Identification Based on Spectrum SIFT [J]. Computer Science, 2024, 51(6A): 230300177-7.
[6] HUANG Haixin, CAI Mingqi, WANG Yuyao. Review of Point Cloud Semantic Segmentation Based on Graph Convolutional Neural Networks [J]. Computer Science, 2024, 51(6A): 230400196-7.
[7] LI Wenting, XIAO Rong, YANG Xiao. Improving Transferability of Adversarial Samples Through Laplacian Smoothing Gradient [J]. Computer Science, 2024, 51(6A): 230800025-6.
[8] CAI Jiacheng, DONG Fangmin, SUN Shuifa, TANG Yongheng. Unsupervised Learning of Monocular Depth Estimation:A Survey [J]. Computer Science, 2024, 51(2): 117-134.
[9] CUI Zhenyu, ZHOU Jiahuan, PENG Yuxin. Survey on Cross-modality Object Re-identification Research [J]. Computer Science, 2024, 51(1): 13-25.
[10] SONG Xinyang, YAN Zhiyuan, SUN Muyi, DAI Linlin, LI Qi, SUN Zhenan. Review of Talking Face Generation [J]. Computer Science, 2023, 50(8): 68-78.
[11] WANG Xu, WU Yanxia, ZHANG Xue, HONG Ruize, LI Guangsheng. Survey of Rotating Object Detection Research in Computer Vision [J]. Computer Science, 2023, 50(8): 79-92.
[12] ZHOU Ziyi, XIONG Hailing. Image Captioning Optimization Strategy Based on Deep Learning [J]. Computer Science, 2023, 50(8): 99-110.
[13] SUN Kaiwei, WANG Zhihao, LIU Hu, RAN Xue. Maximum Overlap Single Target Tracking Algorithm Based on Attention Mechanism [J]. Computer Science, 2023, 50(6A): 220400023-5.
[14] QIN Jing, WANG Weibin, ZOU Qijie, WANG Zumin, JI Changqing. Review of 3D Target Detection Methods Based on LiDAR Point Clouds [J]. Computer Science, 2023, 50(6A): 220400214-7.
[15] WU Han, NIE Jiahao, ZHANG Zhaowei, HE Zhiwei, GAO Mingyu. Deep Learning-based Visual Multiple Object Tracking:A Review [J]. Computer Science, 2023, 50(4): 77-87.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!