计算机科学 ›› 2024, Vol. 51 ›› Issue (2): 151-160.doi: 10.11896/jsjkx.221200045

• 计算机图形学&多媒体 • 上一篇    下一篇

基于扩张卷积条件生成对抗网络的红外小目标检测

张国栋1, 陈志华1, 盛斌2   

  1. 1 华东理工大学信息科学与工程学院 上海200237
    2 上海交通大学电子信息与电气工程学院 上海200240
  • 收稿日期:2022-12-07 修回日期:2023-03-24 出版日期:2024-02-15 发布日期:2024-02-22
  • 通讯作者: 陈志华(czh@ecust.edu.cn)
  • 作者简介:(1831551351@qq.com)
  • 基金资助:
    国家自然科学基金(62272164);空间智能控制技术实验室开放基金(HTKJ2022KL502010)

Infrared Small Target Detection Based on Dilated Convolutional Conditional GenerativeAdversarial Networks

ZHANG Guodong1, CHEN Zhihua1, SHENG Bin2   

  1. 1 School of Information Science and Engineering,East China University of Science and Technology,Shanghai 200237,China
    2 School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University,Shanghai 200240,China
  • Received:2022-12-07 Revised:2023-03-24 Online:2024-02-15 Published:2024-02-22
  • About author:ZHANG Guodong,born in 1997,postgraduate,is a member of CCF(No.E2434G).His main research interests include computer vision and small target detection.CHEN Zhihua,born in 1969,Ph.D,professor,Ph.Dsupervisor,is a member of CCF(No.12441D).His main research interests include computer vision,machine learning,object detection and image video processing.
  • Supported by:
    National Natural Science Foundation of China(62272164)and Science and Technology on Space Intelligent Control Laboratory(HTKJ2022KL502010).

摘要: 基于深度神经网络的目标检测方法凭借自身强大的建模能力,在通用目标检测任务中取得了良好的表现。然而,在红外小目标信号弱、像素小的本质特征的影响下,深度神经网络层次的加深和池化操作的大量使用导致小目标语义信息丢失,使得现有方法的检测效果并不理想。文中从红外小目标特性这一关键问题出发,提出了一种新颖的基于扩张卷积条件生成对抗网络的目标检测算法。所提方法应用扩张卷积设计了生成网络,充分利用上下文信息建立层与层之间的关联,将红外小目标更多的语义信息保留到深层网络中,增强目标特征,进而提高检测性能。此外,设计了融合通道与空间维度的混合注意力模块,在特征提取时有选择性地放大目标信息,抑制背景信息;设计了自注意关联模块处理层与层之间信息融合过程中产生的语义冲突问题。文中使用多种评价指标将所提网络模型与目前先进的其他红外小目标检测方法进行对比,证明了该方法在复杂背景下目标检测性能的优越性。在公开的SIRST数据集上,所提模型的F分数为64.70%,相比传统方法提高了8.29%,相比深度学习方法提高了7.29%;在公开的ISOS数据集上,所提模型的F分数为64.54%,相比传统方法提高了23.59%,相比深度学习方法提高了6.58%。

关键词: 红外小目标检测, 条件生成对抗网络, 特征融合, 注意力机制, 扩张卷积

Abstract: Deep-learning based object detection methods have achieved great performance in general object detection tasks by virtue of their powerful modeling capabilities.However,the design of deeper network and the abuse of pooling operations also lead to semantic information loss which suppress their performance when detecting infrared small targets with low signal-noise-ratio and small pixel essential features.This paper proposes a novel infrared small target detection algorithm based on dilated convolution conditional generative adversarial network.A dilated convolution stacked generative network makes full use of context information to establish layer-to-layer correlations and facilitate semantic information retainment of infrared small targets in the deep network.In addition,the generative network integrates the channel-space-mixed attention module which selectively amplifies target information and suppresses background clusters.Furthermore,a self-attention association module is proposed to deal with semantic conflict generated during the fusion process between layers.A variety of evaluation metrics are used to compare the proposed method with other state-of-the-arts at present to demonstrate the superiority of the proposed method in complex backgrounds.On the public SIRST dataset,the F score of the proposed model is 64.70% which is 8.29% higher than the traditional method and 7.29% higher than the deep learning method.On the public ISOS dataset,the F score is 64.54%,which is 23.59% higher than the traditional method and 6.58% higher than the deep learning method.

Key words: Infrared small target detection, Conditional generative adversarial network, Feature fusion, Attention mechanism, Dilated convolution

中图分类号: 

  • TP391
[1]HAN R Z,FENG W,GUO Q,et al.A review of the research progress of video single target tracking [J].Chinese Journal of Computers,2022,45(9):1877-1907.
[2]CHEN Q,WU C,WANG Y.Robust principal component analysis-based infrared small target detection[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:9925-9926.
[3]LIU T,LIU H,LI Y F,et al.Flexible FTIR spectral imaging enhancement for industrial robot infrared vision sensing[J].IEEE Transactions on Industrial Informatics,2019,16(1):544-554.
[4]ZHU X,HU Z,HUANG S,et al.Infrared Invisible Clothing:Hiding from Infrared Detectors at Multiple Angles in Real World[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:13317-13326.
[5]YU Q,XIE L,WANG Y,et al.Recurrent saliency transformation network:Incorporating multi-stage visual cues for small organ segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8280-8289.
[6]ZHANG M,ZHANG R,YANG Y,et al.ISNet:Shape Matters for Infrared Small Target Detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:877-886.
[7]YANG C,HUANG Z,WANG N.QueryDet:Cascaded sparsequery for accelerating high-resolution small object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:13668-13677.
[8]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single shotmultibox detector[C]// 2016 Computer Vision-ECCV,2016:21-37.
[9]LIM J S,ASTRID M,YOON H J,et al.Small object detection using context and attention[C]//2021 International Conference on Artificial Intelligence in Information and Communication(ICAIIC).IEEE,2021:181-186.
[10]HAMAGUCHI R,FUJITA A,NEMOTO K,et al.Effective use of dilated convolutions for segmenting small object instances in remote sensing imagery[C]//2018 IEEE Winter Conference on Applications of Computer Vision(WACV).IEEE,2018:1442-1450.
[11]WANG H,ZHOU L,WANG L.Miss detection vs.false alarm:Adversarial learning for small object segmentation in infrared images[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:8509-8518.
[12]ZHOU P,XIE L,NI B,et al.Omni-gan:On the secrets of cgans and beyond[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:14061-14071.
[13]BAI K,WANG Y,SONG Q.Patch similarity based edge-preserving background estimation for singleframe infrared small target detection[C]//2016 IEEE International Conference on Image Processing(ICIP).IEEE,2016:181-185.
[14]DENG H,SUN X,ZHOU X.A multiscale fuzzy metric for detecting small infrared targets against chaotic cloudy/sea-skybackgrounds[J].IEEE Transactions on Cybernetics,2018,49(5):1694-1707.
[15]LIANG Z,LIU W,YAO R.Contrast enhancement by nonlinear diffusion filtering[J].IEEE Transactions on Image Processing,2015,25(2):673-686.
[16]CHEN C L P,LI H,WEI Y,et al.A local contrast method for small infrared target detection[J].IEEE Transactions on Geo-science and Remote Sensing,2013,52(1):574-581.
[17]DAI Y,WU Y,ZHOU F,et al.Asymmetric contextual modulation for infrared small target detection[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.2021:950-959.
[18]WANG K,DU S,LIU C,et al.Interior Attention-Aware Network for Infrared Small Target Detection[J].IEEE Transactions on Geoscience and Remote Sensing,2022,60:1-13.
[19]CHEN Q,ZHANG W,ZHOU N,et al.Adaptive fractional dila-ted convolution network for image aesthetics assessment[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:14114-14123.
[20]ZHUANG C,LU Z,WANG Y,et al.ACDNet:Adaptively combined dilated convolution for monocular panorama depth estimation[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2022:3653-3661.
[21]LI Y,CHEN Y,WANG N,et al.Scale-aware trident networks for object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:6054-6063.
[22]TAKAHASHI N,MITSUFUJI Y.Densely connected multi-dilated convolutional networks for dense prediction tasks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:993-1002.
[23]LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramidnetworks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2117-2125.
[24]XU H,YAO L,ZHANG W,et al.Auto-fpn:Automatic network architecture adaptation for object detection beyond classification[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:6649-6658.
[25]LIU S,QI L,QIN H,et al.Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8759-8768.
[26]ZHAO B,WANG C,FU Q,et al.A novel pattern for infrared small target detection with generative adversarial network[J].IEEE Transactions on Geoscience and Remote Sensing,2020,59(5):4481-4492.
[27]RONNEBERGER O,FISCHER P,BROX T.U-net:Convolutional networks for biomedical image segmentation[C]//International Conference on Medical Image Computing and Compu-ter-assisted Intervention.Cham:Springer,2015:234-241.
[28]VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need[J].arXiv:1706.03762,2017.
[29]WANG X,GIRSHICK R,GUPTA A,et al.Non-local neural networks[C]//Proceedings of the IEEE Conference on Compu-ter Vision and Pattern Recognition.2018:7794-7803.
[30]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[31]FU J,LIU J,TIAN H,et al.Dual attention network for scene segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:3146-3154.
[32]BEHERA A,WHARTON Z,HEWAGE P R P G,et al.Con-text-aware attentional pooling(cap) for fine-grained visual classification[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:929-937.
[33]LI C,QIU Z,CAO X,et al.Hybrid dilated convolution with multi-scale residual fusion network for hyperspectral image classification[J].Micromachines,2021,12(5):545.
[34]GOODFELLOW I,POUGET-ABADIE J,MIRZA M,et al.Generative adversarial networks[J].Communications of the ACM,2020,63(11):139-144.
[35]DAI Y,WU Y,ZHOU F,et al.Attentional local contrast networks for infrared small target detection[J].IEEE Transactions on Geoscience and Remote Sensing,2021,59(11):9813-9824.
[36]WEI Y,YOU X,LI H.Multiscale patch-based contrast measure for small infrared target detection[J].Pattern Recognition,2016,58:216-226.
[37]DAI Y,WU Y,SONG Y,et al.Non-negative infrared patch-image model:Robust target-background separation via partial sum minimization of singular values[J].Infrared Physics & Technology,2017,81:182-194.
[38]DAI Y,WU Y.Reweighted infrared patch-tensor model withboth nonlocal and local priors for single-frame small target detection[J].IEEE journal of selected topics in applied earth observations and remote sensing,2017,10(8):3752-3767.
[39]GAO C,MENG D,YANG Y,et al.Infrared patch-image model for small target detection in a single image[J].IEEE transactions on image processing,2013,22(12):4996-5009.
[40]LI B,XIAO C,WANG L,et al.Dense nested attention network for infrared small target detection[J].arXiv:2106.00487,2021.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!