Computer Science ›› 2024, Vol. 51 ›› Issue (2): 151-160.doi: 10.11896/jsjkx.221200045

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Infrared Small Target Detection Based on Dilated Convolutional Conditional GenerativeAdversarial Networks

ZHANG Guodong1, CHEN Zhihua1, SHENG Bin2   

  1. 1 School of Information Science and Engineering,East China University of Science and Technology,Shanghai 200237,China
    2 School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University,Shanghai 200240,China
  • Received:2022-12-07 Revised:2023-03-24 Online:2024-02-15 Published:2024-02-22
  • About author:ZHANG Guodong,born in 1997,postgraduate,is a member of CCF(No.E2434G).His main research interests include computer vision and small target detection.CHEN Zhihua,born in 1969,Ph.D,professor,Ph.Dsupervisor,is a member of CCF(No.12441D).His main research interests include computer vision,machine learning,object detection and image video processing.
  • Supported by:
    National Natural Science Foundation of China(62272164)and Science and Technology on Space Intelligent Control Laboratory(HTKJ2022KL502010).

Abstract: Deep-learning based object detection methods have achieved great performance in general object detection tasks by virtue of their powerful modeling capabilities.However,the design of deeper network and the abuse of pooling operations also lead to semantic information loss which suppress their performance when detecting infrared small targets with low signal-noise-ratio and small pixel essential features.This paper proposes a novel infrared small target detection algorithm based on dilated convolution conditional generative adversarial network.A dilated convolution stacked generative network makes full use of context information to establish layer-to-layer correlations and facilitate semantic information retainment of infrared small targets in the deep network.In addition,the generative network integrates the channel-space-mixed attention module which selectively amplifies target information and suppresses background clusters.Furthermore,a self-attention association module is proposed to deal with semantic conflict generated during the fusion process between layers.A variety of evaluation metrics are used to compare the proposed method with other state-of-the-arts at present to demonstrate the superiority of the proposed method in complex backgrounds.On the public SIRST dataset,the F score of the proposed model is 64.70% which is 8.29% higher than the traditional method and 7.29% higher than the deep learning method.On the public ISOS dataset,the F score is 64.54%,which is 23.59% higher than the traditional method and 6.58% higher than the deep learning method.

Key words: Infrared small target detection, Conditional generative adversarial network, Feature fusion, Attention mechanism, Dilated convolution

CLC Number: 

  • TP391
[1]HAN R Z,FENG W,GUO Q,et al.A review of the research progress of video single target tracking [J].Chinese Journal of Computers,2022,45(9):1877-1907.
[2]CHEN Q,WU C,WANG Y.Robust principal component analysis-based infrared small target detection[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:9925-9926.
[3]LIU T,LIU H,LI Y F,et al.Flexible FTIR spectral imaging enhancement for industrial robot infrared vision sensing[J].IEEE Transactions on Industrial Informatics,2019,16(1):544-554.
[4]ZHU X,HU Z,HUANG S,et al.Infrared Invisible Clothing:Hiding from Infrared Detectors at Multiple Angles in Real World[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:13317-13326.
[5]YU Q,XIE L,WANG Y,et al.Recurrent saliency transformation network:Incorporating multi-stage visual cues for small organ segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8280-8289.
[6]ZHANG M,ZHANG R,YANG Y,et al.ISNet:Shape Matters for Infrared Small Target Detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:877-886.
[7]YANG C,HUANG Z,WANG N.QueryDet:Cascaded sparsequery for accelerating high-resolution small object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:13668-13677.
[8]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single shotmultibox detector[C]// 2016 Computer Vision-ECCV,2016:21-37.
[9]LIM J S,ASTRID M,YOON H J,et al.Small object detection using context and attention[C]//2021 International Conference on Artificial Intelligence in Information and Communication(ICAIIC).IEEE,2021:181-186.
[10]HAMAGUCHI R,FUJITA A,NEMOTO K,et al.Effective use of dilated convolutions for segmenting small object instances in remote sensing imagery[C]//2018 IEEE Winter Conference on Applications of Computer Vision(WACV).IEEE,2018:1442-1450.
[11]WANG H,ZHOU L,WANG L.Miss detection vs.false alarm:Adversarial learning for small object segmentation in infrared images[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:8509-8518.
[12]ZHOU P,XIE L,NI B,et al.Omni-gan:On the secrets of cgans and beyond[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:14061-14071.
[13]BAI K,WANG Y,SONG Q.Patch similarity based edge-preserving background estimation for singleframe infrared small target detection[C]//2016 IEEE International Conference on Image Processing(ICIP).IEEE,2016:181-185.
[14]DENG H,SUN X,ZHOU X.A multiscale fuzzy metric for detecting small infrared targets against chaotic cloudy/sea-skybackgrounds[J].IEEE Transactions on Cybernetics,2018,49(5):1694-1707.
[15]LIANG Z,LIU W,YAO R.Contrast enhancement by nonlinear diffusion filtering[J].IEEE Transactions on Image Processing,2015,25(2):673-686.
[16]CHEN C L P,LI H,WEI Y,et al.A local contrast method for small infrared target detection[J].IEEE Transactions on Geo-science and Remote Sensing,2013,52(1):574-581.
[17]DAI Y,WU Y,ZHOU F,et al.Asymmetric contextual modulation for infrared small target detection[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.2021:950-959.
[18]WANG K,DU S,LIU C,et al.Interior Attention-Aware Network for Infrared Small Target Detection[J].IEEE Transactions on Geoscience and Remote Sensing,2022,60:1-13.
[19]CHEN Q,ZHANG W,ZHOU N,et al.Adaptive fractional dila-ted convolution network for image aesthetics assessment[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:14114-14123.
[20]ZHUANG C,LU Z,WANG Y,et al.ACDNet:Adaptively combined dilated convolution for monocular panorama depth estimation[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2022:3653-3661.
[21]LI Y,CHEN Y,WANG N,et al.Scale-aware trident networks for object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:6054-6063.
[22]TAKAHASHI N,MITSUFUJI Y.Densely connected multi-dilated convolutional networks for dense prediction tasks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:993-1002.
[23]LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramidnetworks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2117-2125.
[24]XU H,YAO L,ZHANG W,et al.Auto-fpn:Automatic network architecture adaptation for object detection beyond classification[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:6649-6658.
[25]LIU S,QI L,QIN H,et al.Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8759-8768.
[26]ZHAO B,WANG C,FU Q,et al.A novel pattern for infrared small target detection with generative adversarial network[J].IEEE Transactions on Geoscience and Remote Sensing,2020,59(5):4481-4492.
[27]RONNEBERGER O,FISCHER P,BROX T.U-net:Convolutional networks for biomedical image segmentation[C]//International Conference on Medical Image Computing and Compu-ter-assisted Intervention.Cham:Springer,2015:234-241.
[28]VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need[J].arXiv:1706.03762,2017.
[29]WANG X,GIRSHICK R,GUPTA A,et al.Non-local neural networks[C]//Proceedings of the IEEE Conference on Compu-ter Vision and Pattern Recognition.2018:7794-7803.
[30]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[31]FU J,LIU J,TIAN H,et al.Dual attention network for scene segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:3146-3154.
[32]BEHERA A,WHARTON Z,HEWAGE P R P G,et al.Con-text-aware attentional pooling(cap) for fine-grained visual classification[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:929-937.
[33]LI C,QIU Z,CAO X,et al.Hybrid dilated convolution with multi-scale residual fusion network for hyperspectral image classification[J].Micromachines,2021,12(5):545.
[34]GOODFELLOW I,POUGET-ABADIE J,MIRZA M,et al.Generative adversarial networks[J].Communications of the ACM,2020,63(11):139-144.
[35]DAI Y,WU Y,ZHOU F,et al.Attentional local contrast networks for infrared small target detection[J].IEEE Transactions on Geoscience and Remote Sensing,2021,59(11):9813-9824.
[36]WEI Y,YOU X,LI H.Multiscale patch-based contrast measure for small infrared target detection[J].Pattern Recognition,2016,58:216-226.
[37]DAI Y,WU Y,SONG Y,et al.Non-negative infrared patch-image model:Robust target-background separation via partial sum minimization of singular values[J].Infrared Physics & Technology,2017,81:182-194.
[38]DAI Y,WU Y.Reweighted infrared patch-tensor model withboth nonlocal and local priors for single-frame small target detection[J].IEEE journal of selected topics in applied earth observations and remote sensing,2017,10(8):3752-3767.
[39]GAO C,MENG D,YANG Y,et al.Infrared patch-image model for small target detection in a single image[J].IEEE transactions on image processing,2013,22(12):4996-5009.
[40]LI B,XIAO C,WANG L,et al.Dense nested attention network for infrared small target detection[J].arXiv:2106.00487,2021.
[1] ZHANG Mingdao, ZHOU Xin, WU Xiaohong, QING Linbo, HE Xiaohai. Unified Fake News Detection Based on Semantic Expansion and HDGCN [J]. Computer Science, 2024, 51(4): 299-306.
[2] WANG Ruiping, WU Shihong, ZHANG Meihang, WANG Xiaoping. Review of Vision-based Neural Network 3D Dynamic Gesture Recognition Methods [J]. Computer Science, 2024, 51(4): 193-208.
[3] XUE Jinqiang, WU Qin. Progressive Multi-stage Image Denoising Algorithm Combining Convolutional Neural Network and
Multi-layer Perceptron
[J]. Computer Science, 2024, 51(4): 243-253.
[4] WANG Zihong, SHAO Yingxia, HE Jiyuan, LIU Jinbao. Sequential Recommendation Based on Multi-space Attribute Information Fusion [J]. Computer Science, 2024, 51(3): 102-108.
[5] HAO Ran, WANG Hongjun, LI Tianrui. Deep Neural Network Model for Transmission Line Defect Detection Based on Dual-branch Sequential Mixed Attention [J]. Computer Science, 2024, 51(3): 135-140.
[6] ZHANG Yang, XIA Ying. Object Detection Method with Multi-scale Feature Fusion for Remote Sensing Images [J]. Computer Science, 2024, 51(3): 165-173.
[7] LI Yu, YANG Xiangli, ZHANG Le, LIANG Yalin, GAO Xian, YANG Jianxi. Combined Road Segmentation and Contour Extraction for Remote Sensing Images Based on Cascaded U-Net [J]. Computer Science, 2024, 51(3): 174-182.
[8] LIAO Meng, JIA Zhen, LI Tianrui. Chinese Named Entity Recognition Based on Label Information Fusion and Multi-task Learning [J]. Computer Science, 2024, 51(3): 198-204.
[9] SUN Shounan, WANG Jingbin, WU Renfei, YOU Changkai, KE Xifan, HUANG Hao. TMGAT:Graph Attention Network with Type Matching Constraint [J]. Computer Science, 2024, 51(3): 235-243.
[10] QIAO Fan, WANG Peng, WANG Wei. Multivariate Time Series Classification Algorithm Based on Heterogeneous Feature Fusion [J]. Computer Science, 2024, 51(2): 36-46.
[11] LIU Xuheng, BAI Zhengyao, XU Zhu, DU Jiajin, XIAO Xiao. Multi-guided Point Cloud Registration Network Combined with Attention Mechanism [J]. Computer Science, 2024, 51(2): 142-150.
[12] ZHAO Jiangfeng, HE Hongjie, CHEN Fan, YANG Shubin. Two-stage Visible Watermark Removal Model Based on Global and Local Features for Document Images [J]. Computer Science, 2024, 51(2): 172-181.
[13] ZHANG Feng, HUANG Shixin, HUA Qiang, DONG Chunru. Novel Image Classification Model Based on Depth-wise Convolution Neural Network andVisual Transformer [J]. Computer Science, 2024, 51(2): 196-204.
[14] LIAO Xingbin, QIAN Yangge, WANG Qianlei, QIN Xiaolin. Hierarchical Document Classification Method Based on Improved Self-attention Mechanism and Representation Learning [J]. Computer Science, 2024, 51(2): 238-244.
[15] WANG Weijia, XIONG Wenzhuo, ZHU Shengjie, SONG Ce, SUN He, SONG Yulong. Method of Infrared Small Target Detection Based on Multi-depth Feature Connection [J]. Computer Science, 2024, 51(1): 175-183.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!