Computer Science ›› 2022, Vol. 49 ›› Issue (8): 113-119.doi: 10.11896/jsjkx.210700153

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism

ZHU Cheng-zhang1,2,3,4, HUANG Jia-er1,3,4, XIAO Ya-long1,2, WANG Han1,3,4, ZOU Bei-ji1,3,4   

  1. 1 School of Computer Science and Engineering,Central South University,Changsha 410083,China
    2 School of Literature and Journalism,Central South University,Changsha 410083,China
    3 Mobile Health Ministry of Education-China Mobile Joint Laboratory,Changsha 410083,China
    4 Hunan Engineering Research Center of Machine Vision and Intelligent Medicine,Changsha 410083,China
  • Received:2021-07-14 Revised:2021-10-23 Published:2022-08-02
  • About author:ZHU Cheng-zhang,born in 1978,Ph.D,associate professor,master supervisor.Her main research interests include pattern recognition,computer vision,and image processing.
    XIAO Ya-long,born in 1985,Ph.D,associate professor,is a member of China Computer Federation.His main research interests include wireless sensing and computational communication.
  • Supported by:
    National Key R & D Program of China(2018AAA0102100) and Hunan Province High-tech Industry Science and Technology Innovation Leading Program(2020GK2021).

Abstract: A medical image retrieval method combining attention mechanism is proposed for a series of problems such as poor retrieval performance,low accuracy and lack of interpretability in current medical image retrieval.Based on deep convolutional neural networks and taking Bayesian models as the framework,the proposed algorithm introduces an attention mechanism module guided by semantic features.Local feature descriptors containing semantic information are generated under the guidance of the classification network.Both global features and local features rich in semantic information are used as inputs to the hash network,which enhances the feature representation capability of hash coding by guiding the hash network to pay attention to important feature regions from both global and local perspectives.And the weighted likelihood estimation function is introduced to solve the problem of the unbalanced number of positive and negative sample pairs.MAP and NDCG are used as evaluation metrics,and the ChestX-ray14 dataset is selected for experiments.The proposed algorithm is compared with the current commonly used deep ha-shing methods.Experiment results show that the MAP and NDCG values are much better than the existing deep hashing methods at different code levels of hash coding,which proves the effectiveness of the proposed algorithm.

Key words: Attention mechanism, Bayesian framework, Convolutional neural networks, Deep hashing, Medical image retrieval

CLC Number: 

  • TP391
[1]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet classification with deep convolutional neural networks[C]//Proceedings of the 25th International Conference on Neural Information Processing Systems.Red Hook,USA:Curran Asso-ciates Inc.,2012:1097-1105.
[2]RONNEBERGER O,FISCHER P,BROX T.U-Net:Convolu-tional networks for biomedical image segmentation[C]//2015 Medical Image Computing and Computer Assisted Intervention 2015.Cham,Switzerland:Springer,2015:234-241.
[3]LI Y,JIANG H N,WANG H B,et al.Generating GF-3 orthophoto image based on indirect rectification[J].Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition),2020,32(3):441-451.
[4]WANG J D,ZHANG T K,SONG J K,et al.A survey on lear-ning to hash[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2018,40(4):769-790.
[5]MNIH V,HEESS N,GRAVES A,et al.Recurrent models ofvisual attention[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems.Cambridge,USA:MIT Press,2014:2204-2212.
[6]HE K M,ZHANG X Y,REN S Q,et al.Deep residual learning for image recognition[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition.Piscataway,USA:IEEE Press,2016:770-778.
[7]LONG J,SHELHAMER E,DARRELL T.Fully convolutional networks for semantic segmentation[C]//2015 Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway,USA:IEEE Press,2015:3431-3440.
[8]XIA R K,PAN Y,LAI H J,et al.Supervised hashing for image retrieval via image representation learning[C]//Proceedings of the 28th AAAI Conference on Artificial Intelligence.Palo Alto,USA:AAAI Press,2014:2156-2162.
[9]LAI H J,PAN Y,LIU Y,et al.Simultaneous feature learning and hash coding with deep neural networks[C]//2015 IEEE Conference on Computer Vision and Pattern Recognition.Piscataway,USA:IEEE Press,2015:3270-3278.
[10]ZHU H,LONG M S,WANG J M,et al.Deep hashing network for efficient similarity retrieval[C]//Proceedings of the 30th AAAI Conference on Artificial Intelligence.Palo Alto,USA:AAAI Press,2016:2415-2421.
[11]CAO Z J,LONG M S,WANG J M,et al.HashNet:Deep lear-ning to hash by continuation[C]//2017 IEEE International Conference on Computer Vision.Piscataway,USA:IEEE Press,2017:5609-5618.
[12]CAO Y,LONG M S,LIU B,et al.Deep cauchy hashing for hamming space retrieval[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,USA:IEEE Press,2018:1229-1237.
[13]YUAN L,WANG T,ZHANG X P,et al.Central similarityquantization for efficient image and video retrieval[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,USA:IEEE Press,2020:3080-3089.
[14]WANG R K,WANG R P,QIAO S S,et al.Deep position-aware hashing for semantic continuous image retrieval[C]//2020 IEEE Winter Conference on Applications of Computer Vision.Piscataway,USA:IEEE Press,2020:2482-2491.
[15]CAI Y H,LI Y Y,QIU C Y,et al.Medical image retrieval based on convolutional neural network and supervised hashing[J].IEEE Access,2019,7:51877-51885.https://ieeexplore.ieee.org/document/8692349.
[16]PENG T Y,BOXBERG M,WEICHERT W,et al.Multi-task learning of a deep k-nearest neighbour network for histopathological image classification and retrieval[C]//2019 Medical Image Computing and Computer Assisted Intervention.Cham,Switzerland:Springer,2019:676-684.
[17]CHEN Z X,CAI R J,LU J W,et al.Order-sensitive deep ha-shing for multimorbidity medical image retrieval[C]//2018 Me-dical Image Computing and Computer Assisted Intervention.Cham,Switzerland:Springer,2018:620-628.
[18]ZHENG Y S,JIANG B N,SHI J,et al.Encoding histopathological WSIs using GNN for scalable diagnostically relevant regions retrieval[C]//2019 Medical Image Computing and Computer Assisted Intervention.Cham,Switzerland:Springer,2019:550-558.
[19]SILVA W,POELLINGER A,CARDOSO J S,et al.Interpre-tability-guided content-based medical image retrieval[C]//23th Medical Image Computing and Computer Assisted Intervention.Cham,Switzerland:Springer,2020:305-314.
[20]FANG J S,FU H Z,LIU J.Deep triplet hashing network for case-based medical image retrieval[J].Medical Image Analysis,2021,69:101981.https://www.sciencedirect.com/science/article/pii/S136184152100027X.
[21]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,USA:IEEE Press,2018:7132-7141.
[22]WOO S,PARK J,LEE J,et al.CBAM:Convolutional block attention module[C]//Proceedings of the 15th EuropeanConfe-rence on Computer Vision.Piscataway,USA:Springer,2018:3-19.
[23]SELVARAJU R R,COGSWELL M,DAS A,et al.Grad-CAM:Visual explanations from deep networks via gradient-based localization[C]//2017 IEEE International Conference on Compu-ter Vision.Piscataway,USA:IEEE Press,2017:618-626.
[24]OTSU N.A threshold selection method from gray-level histograms[J].IEEE Transactions on Systems,Man,and Cyberne-tics,1979,9(1):62-66.
[25]RAJPURKAR P,IRVIN J,ZHU K L,et al.CheXNet:Radiologist-level pneumonia detection on chest X-Rays with deep lear-ning[EB/OL].(2019-06-12)[2021-08-01].http://arxiv.org/abs/1711.05225.
[26]LIU H M,WANG R P,SHAN S G,et al.Deep supervised ha-shing for fast image retrieval[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition.Piscataway,USA:IEEE Press,2016:2064-2072.
[27]LI Y Q,PEI W J,ZHA Y F,et al.Push for quantization:Deep fisher hashing[C]//30th British Machine Vision Conference.Cardiff,United Kingdom:BMVC,2019:1-12.
[28]FAN L X,NG K W,JU C,et al.Deep polarized network for supervised learning of accurate binary hashing codes[C]//Proceedings of the 29th International Joint Conference on Artificial Intelligence.Palo Alto,USA:AAAI Press,2020:825-831.
[29]LI W J,WANG S,KANG W C.Feature learning based deep supervised hashing with pairwise labels[C]//Proceedings of the 25th International Joint Conference on Artificial Intelligence.Palo Alto,USA:AAAI Press,2016:1711-1717.
[30]ZHANG Z,ZOU Q,LIN Y W,et al.Improved deep hashing with soft pairwise similarity for multi-label image retrieval[J].IEEE Transactions on Multimedia,2020,22(2):540-553.
[31]WANG X F,SHI Y,KITANI K M.Deep supervised hashingwith triplet labels[C]//13th Asian Conference on Computer Vision.Cham,Switzerland:Springer,2017:70-84.
[32]ZHU H,GAO S H.Locality constrained deep supervised ha-shing for image retrieval[C]//Proceedings of the 26th International Joint Conference on Artificial Intelligence.Palo Alto,USA:AAAI Press,2017:3567-3573.
[1] RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[2] ZHOU Fang-quan, CHENG Wei-qing. Sequence Recommendation Based on Global Enhanced Graph Neural Network [J]. Computer Science, 2022, 49(9): 55-63.
[3] DAI Yu, XU Lin-feng. Cross-image Text Reading Method Based on Text Line Matching [J]. Computer Science, 2022, 49(9): 139-145.
[4] ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[5] XIONG Li-qin, CAO Lei, LAI Jun, CHEN Xi-liang. Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization [J]. Computer Science, 2022, 49(9): 172-182.
[6] JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[7] WANG Ming, PENG Jian, HUANG Fei-hu. Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction [J]. Computer Science, 2022, 49(8): 40-48.
[8] SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[9] YAN Jia-dan, JIA Cai-yan. Text Classification Method Based on Information Fusion of Dual-graph Neural Network [J]. Computer Science, 2022, 49(8): 230-236.
[10] ZHANG Ying-tao, ZHANG Jie, ZHANG Rui, ZHANG Wen-qiang. Photorealistic Style Transfer Guided by Global Information [J]. Computer Science, 2022, 49(7): 100-105.
[11] ZENG Zhi-xian, CAO Jian-jun, WENG Nian-feng, JIANG Guo-quan, XU Bin. Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism [J]. Computer Science, 2022, 49(7): 106-112.
[12] XU Ming-ke, ZHANG Fan. Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition [J]. Computer Science, 2022, 49(7): 132-141.
[13] MENG Yue-bo, MU Si-rong, LIU Guang-hui, XU Sheng-jun, HAN Jiu-qiang. Person Re-identification Method Based on GoogLeNet-GMP Based on Vector Attention Mechanism [J]. Computer Science, 2022, 49(7): 142-147.
[14] JIN Fang-yan, WANG Xiu-li. Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM [J]. Computer Science, 2022, 49(7): 179-186.
[15] XIONG Luo-geng, ZHENG Shang, ZOU Hai-tao, YU Hua-long, GAO Shang. Software Self-admitted Technical Debt Identification with Bidirectional Gate Recurrent Unit and Attention Mechanism [J]. Computer Science, 2022, 49(7): 212-219.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!