Computer Science ›› 2021, Vol. 48 ›› Issue (1): 175-181.doi: 10.11896/jsjkx.200200023

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Domain Alignment Based Object Detection of X-ray Images

HE Yan-hui1, WU Gui-xing1,2, WU Zhi-qiang1   

  1. 1 School of Software Engineering,University of Science and Technology of China,Suzhou,Jiangsu 215123,China
    2 Suzhou Research Institute,University of Science and Technology of China,Suzhou,Jiangsu 215123,China
  • Received:2020-02-05 Revised:2020-05-24 Online:2021-01-15 Published:2021-01-15
  • About author:HE Yan-hui,born in 1994,postgra-duate.His main research interests include object detection,transfer learning and so on.
    WU Gui-xing,born in 1972,Ph.D,professor,Ph.D supervisor,researcher,is a member of China Computer Federation.His main research interests include information theory,multimedia and so on.
  • Supported by:
    Natural Science Foundation ofJiangsu Province,China(BK20141209).

Abstract: Significant progress has been made towards building accurate automatic object detection systems for a variety of parcel security check applications using convolutional neural networks.However,the performance of these systems often degrades when they are applied to new data that differs from the training data,for example,due to variations in X-ray imaging.In this paper,we propose a context-based and transmittance adaptive domain alignment method to address the above performance degradation.Firstly,by using color information existed in X-ray images,we design an attention mechanism to process each color channel of an X-ray image separately to solve the problem of color differences among different X-ray machines.Next,we develop a feature alignment method to reduce the statistics difference among different X-ray images generated by various manufacturers.Finally,we propose to use a context vector as a regularization for the improvement of adversarial training to improve the precision.Theme-thod proposed in this paper solves the problem of the accuracy degradation of the object detection in the test domain,which is different from the training domain.

Key words: Convolutional neural network, Domain alignment, Object detection, Transmittance-adptation, X-ray images

CLC Number: 

  • TP183
[1] EVERINGHAM M,VAN GOOL L,WILLIAMS C K,et al.The pascal visual object classes(voc) challenge [J].IJCV 2010,88(2):303-338.
[2] REN S,HE K,GIRSHICK R,et al.Faster rcnn:Towards real-time object detection with region proposal networks[C]//NIPS.2015:1-9.
[3] MERY D,RIFFO V,ZSCHERPEL U,et al.Gdxray:The database of x-ray images for nondestructive testing[J].Journal of Nondestructive Evaluation,2010,34(4):42.
[4] MIAO C J,XIE L X,WAN F,et al.SIXray:A Large-scale Security Inspection X-ray Benchmark for Prohibited Item Discovery in Overlapping Images[J].arXiv:1901.00303.
[5] Ailibaba.Dataset :Logistics goods restriction monitoring[EB/OL].(2019-03-05) [2019-06-08].https://tianchi.aliyun.com/competition/entrance/231703/introduction.
[6] GANIN Y,LEMPITSKY V.Unsupervised domain adaptationby backpropagation[C]//ICML.2014.
[7] BEN-DAVID S,BLITZER J,CRAMMER K,et al.A theory of learning from different domains[J].Machine Learning,2010,79(1):151-175.
[8] TZENG E,HOFFMAN J,SAENKO K,et al.Adversarial dis-criminative domain adaptation[C]//CVPR.2017:11,19.
[9] TZENG E,HOFFMAN J,ZHANG N,et al.Deep domain confusion:Maximizing for domain invariance[J].arXiv:1412.3474.
[10] LONG M,CAO Y,WANG J,et al.Learning transferable fea-tures with deep adaptation networks[C]//ICML.2015.
[11] DUAN L,TSANG I W,XU D.Domain transfer multiple kernel learning[J].TPAMI,2012,34(3):465-479.
[12] INOUE N,FURUTA R,YAMASAKI T,et al.Cross-domainweakly-supervised object detection through progressive domain adaptation[J].arXiv:1803.11365.
[13] ZHU J Y,PARK T,ISOLA P,et al.Unpaired image-to-image translation using cycle-consistent adversarial networks[J].arXiv:1703.10593.
[14] CHEN Y,LI W,SAKARIDIS C,et al.Domain adaptive faster r-cnn for object detection in the wild[J].arXiv:1803.03243.
[15] SAITO K,USHIKU Y,HARADA T,et al.Strong-Weak Distribution Alignment for Adaptive Object Detection[C]//CVPR.2019:2154-2162.
[16] ZHU X G,JIANG M,YANG C Y.Adapting Object Detectors via Selective Cross-Domain Alignment [C]//CVPR.2019:687-696.
[17] SCHWAB P,KELLER E,MUROI C.Not to Cry Wolf:Distantly Supervised Multitask Learning in Critical Care[J].arXiv:1802.05027.
[18] CAI Z,VASCONCELOS N.Cascade R-CNN:Delving into high quality object detection[J].arXiv:1712.00726.
[19] HE K M,ZHANG X Y,REN S Q,et al.Deep Residual Learning for Image Recognition[J].arXiv:1512.03385.
[20] LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection[J].arXiv:1708.02002.
[21] LIN T Y,DOLLAR P,GIRSHICK R,et al.Feature pyramidnetworks for object detection[C]//CVPR.2017:2117-2125.
[22] DENG J,DONG W,SOCHER R,et al.Imagenet:A large-scale hierarchical image database[C]//CVPR.2009:248-255.
[23] CORDTS M,OMRAN M,RAMOS S,et al.The Cityscapesdataset for semantic urban scene understanding[C]//CVPR.2016:268-276.
[1] ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[2] CHEN Yong-quan, JIANG Ying. Analysis Method of APP User Behavior Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(8): 78-85.
[3] ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[4] LIU Dong-mei, XU Yang, WU Ze-bin, LIU Qian, SONG Bin, WEI Zhi-hui. Incremental Object Detection Method Based on Border Distance Measurement [J]. Computer Science, 2022, 49(8): 136-142.
[5] WANG Can, LIU Yong-jian, XIE Qing, MA Yan-chun. Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization [J]. Computer Science, 2022, 49(8): 157-164.
[6] DAI Zhao-xia, LI Jin-xin, ZHANG Xiang-dong, XU Xu, MEI Lin, ZHANG Liang. Super-resolution Reconstruction of MRI Based on DNGAN [J]. Computer Science, 2022, 49(7): 113-119.
[7] LIU Yue-hong, NIU Shao-hua, SHEN Xian-hao. Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(7): 127-131.
[8] XU Ming-ke, ZHANG Fan. Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition [J]. Computer Science, 2022, 49(7): 132-141.
[9] WU Zi-bin, YAN Qiao. Projected Gradient Descent Algorithm with Momentum [J]. Computer Science, 2022, 49(6A): 178-183.
[10] ZHANG Jia-hao, LIU Feng, QI Jia-yin. Lightweight Micro-expression Recognition Architecture Based on Bottleneck Transformer [J]. Computer Science, 2022, 49(6A): 370-377.
[11] WANG Jian-ming, CHEN Xiang-yu, YANG Zi-zhong, SHI Chen-yang, ZHANG Yu-hang, QIAN Zheng-kun. Influence of Different Data Augmentation Methods on Model Recognition Accuracy [J]. Computer Science, 2022, 49(6A): 418-423.
[12] CHEN Yong-ping, ZHU Jian-qing, XIE Yi, WU Han-xiao, ZENG Huan-qiang. Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss [J]. Computer Science, 2022, 49(6A): 424-428.
[13] SUN Jie-qi, LI Ya-feng, ZHANG Wen-bo, LIU Peng-hui. Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation [J]. Computer Science, 2022, 49(6A): 434-440.
[14] YANG Yue, FENG Tao, LIANG Hong, YANG Yang. Image Arbitrary Style Transfer via Criss-cross Attention [J]. Computer Science, 2022, 49(6A): 345-352.
[15] YANG Jian-nan, ZHANG Fan. Classification Method for Small Crops Combining Dual Attention Mechanisms and Hierarchical Network Structure [J]. Computer Science, 2022, 49(6A): 353-357.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!