计算机科学 ›› 2024, Vol. 51 ›› Issue (2): 151-160.doi: 10.11896/jsjkx.221200045
张国栋1, 陈志华1, 盛斌2
ZHANG Guodong1, CHEN Zhihua1, SHENG Bin2
摘要: 基于深度神经网络的目标检测方法凭借自身强大的建模能力,在通用目标检测任务中取得了良好的表现。然而,在红外小目标信号弱、像素小的本质特征的影响下,深度神经网络层次的加深和池化操作的大量使用导致小目标语义信息丢失,使得现有方法的检测效果并不理想。文中从红外小目标特性这一关键问题出发,提出了一种新颖的基于扩张卷积条件生成对抗网络的目标检测算法。所提方法应用扩张卷积设计了生成网络,充分利用上下文信息建立层与层之间的关联,将红外小目标更多的语义信息保留到深层网络中,增强目标特征,进而提高检测性能。此外,设计了融合通道与空间维度的混合注意力模块,在特征提取时有选择性地放大目标信息,抑制背景信息;设计了自注意关联模块处理层与层之间信息融合过程中产生的语义冲突问题。文中使用多种评价指标将所提网络模型与目前先进的其他红外小目标检测方法进行对比,证明了该方法在复杂背景下目标检测性能的优越性。在公开的SIRST数据集上,所提模型的F分数为64.70%,相比传统方法提高了8.29%,相比深度学习方法提高了7.29%;在公开的ISOS数据集上,所提模型的F分数为64.54%,相比传统方法提高了23.59%,相比深度学习方法提高了6.58%。
中图分类号:
[1]HAN R Z,FENG W,GUO Q,et al.A review of the research progress of video single target tracking [J].Chinese Journal of Computers,2022,45(9):1877-1907. [2]CHEN Q,WU C,WANG Y.Robust principal component analysis-based infrared small target detection[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:9925-9926. [3]LIU T,LIU H,LI Y F,et al.Flexible FTIR spectral imaging enhancement for industrial robot infrared vision sensing[J].IEEE Transactions on Industrial Informatics,2019,16(1):544-554. [4]ZHU X,HU Z,HUANG S,et al.Infrared Invisible Clothing:Hiding from Infrared Detectors at Multiple Angles in Real World[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:13317-13326. [5]YU Q,XIE L,WANG Y,et al.Recurrent saliency transformation network:Incorporating multi-stage visual cues for small organ segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8280-8289. [6]ZHANG M,ZHANG R,YANG Y,et al.ISNet:Shape Matters for Infrared Small Target Detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:877-886. [7]YANG C,HUANG Z,WANG N.QueryDet:Cascaded sparsequery for accelerating high-resolution small object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:13668-13677. [8]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single shotmultibox detector[C]// 2016 Computer Vision-ECCV,2016:21-37. [9]LIM J S,ASTRID M,YOON H J,et al.Small object detection using context and attention[C]//2021 International Conference on Artificial Intelligence in Information and Communication(ICAIIC).IEEE,2021:181-186. [10]HAMAGUCHI R,FUJITA A,NEMOTO K,et al.Effective use of dilated convolutions for segmenting small object instances in remote sensing imagery[C]//2018 IEEE Winter Conference on Applications of Computer Vision(WACV).IEEE,2018:1442-1450. [11]WANG H,ZHOU L,WANG L.Miss detection vs.false alarm:Adversarial learning for small object segmentation in infrared images[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:8509-8518. [12]ZHOU P,XIE L,NI B,et al.Omni-gan:On the secrets of cgans and beyond[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:14061-14071. [13]BAI K,WANG Y,SONG Q.Patch similarity based edge-preserving background estimation for singleframe infrared small target detection[C]//2016 IEEE International Conference on Image Processing(ICIP).IEEE,2016:181-185. [14]DENG H,SUN X,ZHOU X.A multiscale fuzzy metric for detecting small infrared targets against chaotic cloudy/sea-skybackgrounds[J].IEEE Transactions on Cybernetics,2018,49(5):1694-1707. [15]LIANG Z,LIU W,YAO R.Contrast enhancement by nonlinear diffusion filtering[J].IEEE Transactions on Image Processing,2015,25(2):673-686. [16]CHEN C L P,LI H,WEI Y,et al.A local contrast method for small infrared target detection[J].IEEE Transactions on Geo-science and Remote Sensing,2013,52(1):574-581. [17]DAI Y,WU Y,ZHOU F,et al.Asymmetric contextual modulation for infrared small target detection[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.2021:950-959. [18]WANG K,DU S,LIU C,et al.Interior Attention-Aware Network for Infrared Small Target Detection[J].IEEE Transactions on Geoscience and Remote Sensing,2022,60:1-13. [19]CHEN Q,ZHANG W,ZHOU N,et al.Adaptive fractional dila-ted convolution network for image aesthetics assessment[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:14114-14123. [20]ZHUANG C,LU Z,WANG Y,et al.ACDNet:Adaptively combined dilated convolution for monocular panorama depth estimation[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2022:3653-3661. [21]LI Y,CHEN Y,WANG N,et al.Scale-aware trident networks for object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:6054-6063. [22]TAKAHASHI N,MITSUFUJI Y.Densely connected multi-dilated convolutional networks for dense prediction tasks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:993-1002. [23]LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramidnetworks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2117-2125. [24]XU H,YAO L,ZHANG W,et al.Auto-fpn:Automatic network architecture adaptation for object detection beyond classification[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:6649-6658. [25]LIU S,QI L,QIN H,et al.Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8759-8768. [26]ZHAO B,WANG C,FU Q,et al.A novel pattern for infrared small target detection with generative adversarial network[J].IEEE Transactions on Geoscience and Remote Sensing,2020,59(5):4481-4492. [27]RONNEBERGER O,FISCHER P,BROX T.U-net:Convolutional networks for biomedical image segmentation[C]//International Conference on Medical Image Computing and Compu-ter-assisted Intervention.Cham:Springer,2015:234-241. [28]VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need[J].arXiv:1706.03762,2017. [29]WANG X,GIRSHICK R,GUPTA A,et al.Non-local neural networks[C]//Proceedings of the IEEE Conference on Compu-ter Vision and Pattern Recognition.2018:7794-7803. [30]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141. [31]FU J,LIU J,TIAN H,et al.Dual attention network for scene segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:3146-3154. [32]BEHERA A,WHARTON Z,HEWAGE P R P G,et al.Con-text-aware attentional pooling(cap) for fine-grained visual classification[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:929-937. [33]LI C,QIU Z,CAO X,et al.Hybrid dilated convolution with multi-scale residual fusion network for hyperspectral image classification[J].Micromachines,2021,12(5):545. [34]GOODFELLOW I,POUGET-ABADIE J,MIRZA M,et al.Generative adversarial networks[J].Communications of the ACM,2020,63(11):139-144. [35]DAI Y,WU Y,ZHOU F,et al.Attentional local contrast networks for infrared small target detection[J].IEEE Transactions on Geoscience and Remote Sensing,2021,59(11):9813-9824. [36]WEI Y,YOU X,LI H.Multiscale patch-based contrast measure for small infrared target detection[J].Pattern Recognition,2016,58:216-226. [37]DAI Y,WU Y,SONG Y,et al.Non-negative infrared patch-image model:Robust target-background separation via partial sum minimization of singular values[J].Infrared Physics & Technology,2017,81:182-194. [38]DAI Y,WU Y.Reweighted infrared patch-tensor model withboth nonlocal and local priors for single-frame small target detection[J].IEEE journal of selected topics in applied earth observations and remote sensing,2017,10(8):3752-3767. [39]GAO C,MENG D,YANG Y,et al.Infrared patch-image model for small target detection in a single image[J].IEEE transactions on image processing,2013,22(12):4996-5009. [40]LI B,XIAO C,WANG L,et al.Dense nested attention network for infrared small target detection[J].arXiv:2106.00487,2021. |
|