Computer Science ›› 2024, Vol. 51 ›› Issue (10): 287-294.doi: 10.11896/jsjkx.230800013

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Infrared Dim and Small Target Detection Based on Cross-domain Migration of Visible Light andInfrared

XUE Ruxiang1, WEI Junjie2, ZHOU Huawei2, YANG Hai1, WANG Zhe1   

  1. 1 School of Information Science and Engineering,East China University of Science and Technology,Shanghai 200237,China
    2 Shanghai Academy of Spaceflight Technology,Shanghai 201109,China
  • Received:2023-08-03 Revised:2023-12-28 Online:2024-10-15 Published:2024-10-11
  • About author:XUE Ruxiang,born in 1999,postgra-duate.Her main research interests include image processing and object detection.
    WANG Zhe,born in 1981,Ph.D,professor,is a member of CCF(No.16666M).His main research interests include pattern recognition and image processing.
  • Supported by:
    Chinese Defense Program of Science and Technology(2021-JCJQ-JJ-0041) and China Aerospace Science and Technology Corporation Industry-University-Research Cooperation Foundation of the Eighth Research Institute(SAST2021-007).

Abstract: The task of infrared dim and small target detection is one of the key research contents in the field of infrared detection.However,due to the particularity of its application scenarios,the data containing infrared dim and small targets is rare,and often not fully labeled,which poses challenges and difficulties for data-driven deep learning object detection models.In order to solve the problems of limited datasets and lack of label information,an infrared dim and small target detection model based on cross-domain migration of visible light and infrared is proposed to migrate the more abundant visible light domain supervision information to the infrared domain,so as to achieve unsupervised training in the infrared domain.First,a channel augmentation data proces-sing method is designed on the basis of YOLOv5,utilizing low-cost channel separation techniques to convert visible light images into infrared like images,reducing the modal differences between the visible and infrared domains.Then,a multi-scale domain adaptive module is constructed,and the features of different scales extracted by the backbone network are used in the way of adversarial training.Domain confusion is performed in the feature space to reduce the impact of domain shift and improve the detection performance of dim and small target detection.Experimental results show that the improved model by the proposed method can improve the average detection precision compared to various versions of the YOLOv5 original model.Compared with other existing unsupervised domain adaptive target detection algorithms,the proposed method is obviously superior in the detection accuracy of small infrared targets.

Key words: Infrared dim and small targets, Object detection, Deep learning, Domain adaptive, Unsupervised

CLC Number: 

  • TP391
[1]ZHU Y,HAO Y G,WANG H Y.Deep learning based salientobject detection in infrared video [J].Computer Science,2023,50(9):227-234.
[2]ZHAO M,LI W,LI L,et al.Single-frame infrared small-target detection:A survey [J].IEEE Geoscience and Remote Sensing Magazine,2022,10(2):87-119.
[3]QIN Y,LI B.Effective infrared small target detection utilizing anovel local contrast method [J].IEEE Geoscience and Remote Sensing Letters,2016,13(12):1890-1894.
[4]JIAO J,XIE Y J,ZHANG H L,et al.Infrared dim and small target detection based on background prediction by wavelet filter [J].Computer Science,2016,43(2):60-63.
[5]CHEN C P,LI H,WEI Y T,et al.A local contrast method forsmall infrared target detection [J].IEEE Transactions on Geoscience and Remote Sensing,2013,52(1):574-581.
[6]DENG H,SUN X,LIU M,et al.Infrared small-target detection using multiscale gray difference weighted image entropy [J].IEEE Transactions on Aerospace and Electronic Systems,2016,52(1):60-72.
[7]GAO C,MENG D,YANG Y,et al.Infrared patch-image model for small target detection in a single image [J].IEEE Transactions on Image Processing,2013,22(12):4996-5009.
[8]REN S,HE K,GIRSHICK R,et al.Faster r-cnn:Towards real-time object detection with region proposal networks [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149.
[9]REDMON J,DIVVALA S.GIRSHICK R,et al.You only look once:Unified,real-time object detection [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas,NV,USA:IEEE,2016:779-788.
[10]DAI Y,WU Y,ZHOU F,et al.Asymmetric contextual modulation for infrared small target detection [C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.IEEE,2021:950-959.
[11]ZHANG M,ZHANG R,YANG Y,et al.ISNet:Shape matters for infrared small target detection [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.New Orleans,LA,USA:IEEE,2022:877-886.
[12]LI B,XIAO C,WANG Y,et al.Dense nested attention network for infrared small target detection [J].IEEE Transactions on Image Processing,2022,32:1745-1758.
[13]ZHUANG F,QI Z,DUAN K,et al.A comprehensive survey on transfer learning [J].Proceedings of the IEEE,2020,109(1):43-76.
[14]FARAHANI A,VOGHOEI S,RASHEED K,et al.A brief review of domain adaptation [C]//Advances in Data Science and Information Engineering:Proceedings from ICDATA 2020 and IKE 2020.2021:877-894.
[15]CHEN Y,LI W,SAKARIDIS C,et al.Domain adaptive faster r-cnn for object detection in the wild [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Salt Lake City,UT,USA:IEEE,2018:3339-3348.
[16]SAITO K,USHIKU Y,HARADA T,et al.Strong-weak distribution alignment for adaptive object detection [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Long Beach,CA,USA:IEEE,2019:6956-6965.
[17]LIU X Y,LI W Y,YANG Q S,et al.Towards robust adaptive object detection under noisy annotations [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.New Orleans,LA,USA:IEEE,2022:14207-14216.
[18]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas,NV,USA:IEEE,2016:770-778.
[19]HE K,ZHANG X,REN S,et al.Spatial pyramid pooling in deep convolutional networks for visual recognition [J].IEEE Tran-sactions on Pattern Analysis and Machine Intelligence,2015,37(9):1904-1916.
[20]LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramidnetworks for object detection [C]//Proceedings of the IEEE Conference on Computer Vision and PatternRecognition.Hono-lulu,HI,USA:IEEE,2017:2117-2125.
[21]LIU S,QI L,QIN H F,et al.Path aggregation network for instance segmentation [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Salt Lake City,UT,USA:IEEE,2018:8759-8768.
[22]BOCHKOVSKIY A,WANG C Y,LIAO H Y M,et al.YOLOv4:Optimal Speed and Accuracy of Object Detection [J].arXiv:2004.10934,2020.
[23]XIA D X,LIU H J,XU L L,et al.Visible-infrared person re-identification with data augmentation via cycle-consistent adversarial network [J].Neurocomputing,2021,443:35-46.
[24]YE M,RUAN W J,DU B,et al.Channel augmented joint lear-ning for visible-infrared recognition [C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.IEEE,2021:13567-13576.
[25]LUO Y W,ZHENG L,GUAN T,et al.Taking a closer look at domain shift:Category-level adversaries for semantics consistent domain adaptation [C]//Proceedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.Long Beach,CA,USA:IEEE,2019:2507-2516.
[26]GANIN Y,USTINOVA E,AJAKAN H,et al.Domain-adversarial training of neural networks [J].The Journal of Machine Learning Research,2016,17(1):2096-2030.
[27]BEN-DAVID S,BLITZER J,GRAMMAR K,et al.A theory of learning from different domains [J].Machine Learning,2010,79:151-175.
[28]COLUCCIA A,FASCISTA A,SCHUMANN A,et al.Drone-vs-bird detection challenge at IEEE AVSS2021 [C]//2021 17th IEEE International Conference on Advanced Video and Signal Based Surveillance(AVSS).IEEE,2021:1-8.
[29]SUN X L,GUO L C,ZHANG W L,et al.A dataset for small infrared moving target detection under clutter background[EB/OL].https://doi.org/10.11922/sciencedb.j00001.00.
[30]XU C,WANG J W,YANG W,et al.Dot distance for tiny object detection in aerial images [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.IEEE,2021:1192-1201.
[31]LIU T,ZHAO Y,WEI Y C,et al.Concealed object detection for activate millimeter wave image [J].IEEE Transactions on Industrial Electronics,2019,66(12):9909-9917.
[1] DU Yu, YU Zishu, PENG Xiaohui, XU Zhiwei. Padding Load:Load Reducing Cluster Resource Waste and Deep Learning Training Costs [J]. Computer Science, 2024, 51(9): 71-79.
[2] XU Jinlong, GUI Zhonghua, LI Jia'nan, LI Yingying, HAN Lin. FP8 Quantization and Inference Memory Optimization Based on MLIR [J]. Computer Science, 2024, 51(9): 112-120.
[3] WANG Jiahui, PENG Guangling, DUAN Liang, YUAN Guowu, YUE Kun. Few-shot Shadow Removal Method for Text Recognition [J]. Computer Science, 2024, 51(9): 147-154.
[4] LI Yunchen, ZHANG Rui, WANG Jiabao, LI Yang, WANG Ziqi, CHEN Yao. Re-parameterization Enhanced Dual-modal Realtime Object Detection Model [J]. Computer Science, 2024, 51(9): 162-172.
[5] SUN Yumo, LI Xinhang, ZHAO Wenjie, ZHU Li, LIANG Ya’nan. Driving Towards Intelligent Future:The Application of Deep Learning in Rail Transit Innovation [J]. Computer Science, 2024, 51(8): 1-10.
[6] KONG Lingchao, LIU Guozhu. Review of Outlier Detection Algorithms [J]. Computer Science, 2024, 51(8): 20-33.
[7] TANG Ruiqi, XIAO Ting, CHI Ziqiu, WANG Zhe. Few-shot Image Classification Based on Pseudo-label Dependence Enhancement and NoiseInterferenceReduction [J]. Computer Science, 2024, 51(8): 152-159.
[8] XIAO Xiao, BAI Zhengyao, LI Zekai, LIU Xuheng, DU Jiajin. Parallel Multi-scale with Attention Mechanism for Point Cloud Upsampling [J]. Computer Science, 2024, 51(8): 183-191.
[9] PU Bin, LIANG Zhengyou, SUN Yu. Monocular 3D Object Detection Based on Height-Depth Constraint and Edge Fusion [J]. Computer Science, 2024, 51(8): 192-199.
[10] ZHANG Junsan, CHENG Ming, SHEN Xiuxuan, LIU Yuxue, WANG Leiquan. Diversified Label Matrix Based Medical Image Report Generation [J]. Computer Science, 2024, 51(8): 200-208.
[11] GUO Fangyuan, JI Genlin. Video Anomaly Detection Method Based on Dual Discriminators and Pseudo Video Generation [J]. Computer Science, 2024, 51(8): 217-223.
[12] CHEN Siyu, MA Hailong, ZHANG Jianhui. Encrypted Traffic Classification of CNN and BiGRU Based on Self-attention [J]. Computer Science, 2024, 51(8): 396-402.
[13] YANG Heng, LIU Qinrang, FAN Wang, PEI Xue, WEI Shuai, WANG Xuan. Study on Deep Learning Automatic Scheduling Optimization Based on Feature Importance [J]. Computer Science, 2024, 51(7): 22-28.
[14] LI Jiaying, LIANG Yudong, LI Shaoji, ZHANG Kunpeng, ZHANG Chao. Study on Algorithm of Depth Image Super-resolution Guided by High-frequency Information ofColor Images [J]. Computer Science, 2024, 51(7): 197-205.
[15] LOU Zhengzheng, ZHANG Xin, HU Shizhe, WU Yunpeng. Foggy Weather Object Detection Method Based on YOLOX_s [J]. Computer Science, 2024, 51(7): 206-213.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!