基于小目标特征增强RT-DETR的SAR图像舰船目标检测方法

doi:10.11896/jsjkx.250100097

Abstract

Abstract: In ship detection tasks,SAR images are widely used in maritime resource management,search and rescue,and other scenarios due to their excellent imaging conditions.However,traditional target detection algorithms perform poorly due to issues such as the small size of ships and sea surface clutter.Recently,many algorithms have introduced the attention mechanism of Transformer to achieve better semantic interpretation or adopted more complex network structures to improve feature extraction capabilities.This has improved detection accuracy to some extent but has sacrificed detection speed.This paper proposes a ship detection method for SAR images based on small target feature enhanced RT-DETR.The method consists of three parts:1)Large model prompt generation network:Leveraging the zero-shot learning capability of multimodal large models,prompts are generated to extract more discriminative information from the image modality;2)AIFI-EAA module:Using RT-DETR as the baseline,the scale-invariant feature interaction module is improved by introducing an efficient additive attention mechanism to reduce the computational complexity of the algorithm;3)Lightweight small target feature enhancement fusion network:A small target detection layer is added to the multi-scale feature fusion network,and the CSP-OmniKernel module is designed for multi-scale feature fusion to enhance small target detection performance.Experiments on three public datasets(SSDD,HRSID,and SAR-Ship-Dataset) demonstrate that the proposed method has advantages in terms of accuracy.

Key words: Ship detection,SAR image,Lightweight,RT-DETR,Small target detection

CLC Number:

TP751

ZHANG Hongsen, WU Wei, XU Jian, WU Fei, JI Yimu. Ship Detection Method for SAR Images Based on Small Target Feature Enhanced RT-DETR[J].Computer Science, 2025, 52(10): 151-158.

References

[1]CUI Z,QUAN H,CAO Z,et al.Sar target cfar detection via gpu oarallel operation[J].IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing,2018,11(12):4884-4894.
[2]LIN H,LIU J,LI X,et al.Dcea:Detr with concentrated deformable attention for end-to-end ship detection in sar images[J].IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing,2024,17:17292-17307.
[3]REN S,HE K,GIRSHICK R,et al.Faster r-cnn:Towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2016,39(6):1137-1149.
[4]REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:Unified,real-time object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition.2016:779-788.
[5]CARION N,MASSA F,SYNNAEVE G,et al.End-to-end object detection with transformers[C]//European Conference on Computer Vision.2020:213-229.
[6]ZHAO Y,LV W,XU S,et al.Detrs beat yolos on real-time object detection[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition.2024:16965-16974.
[7]LIU L,FU L,ZHANG Y,et al.Clfr-det:Cross-level feature refinement detector for tiny-ship detection in sar images[J].Knowledge-Based Systems,2024,284:111284.
[8]CAI X,LAI Q,WANG Y,et al.Poly kernel inception network for remote sensing detection[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition.2024:27706-27716.
[9]QIN C,WANG X,LIU Y,et al.A novel end-to-end transformer network for small scale ship detection in sar images[C]//International Geoscience and Remote Sensing Symposium.2024:8158-8162.
[10]BEYER L,STEINER A,PINTO A S,et al.Paligemma:A versatile 3b vlm for transfer[J].arXiv:2407.07726,2024.
[11]SHAKER A,MAAZ M,RASHEED H,et al.Swiftformer:Efficient additive attention for transformer-based real-time mobile vision applications[C]//IEEE/CVF International Conference on Computer Vision.2023:17425-17436.
[12]CUI Y,REN W,KNOLL A.Omni-kernel network for image restoration[C]//AAAI Conference on Artificial Intelligence.2024:1426-1434.
[13]LI J,QU C,SHAO J.Ship detection in sar images based on an improved faster r-cnn[C]//SAR in Big Data Era:Models,Me-thods and Applications.2017:1-6.
[14]WEI S,ZENG X,QU Q,et al.HRSID:A high-resolution sar images dataset for ship detection and instance segmentation[J].IEEE Access,2020,8:120234-120254.
[15]WANG Y,WANG C,ZHANG H,et al.A sar dataset of ship detection for deep learning under complex backgrounds[J].Remote Sensing,2019,11(7):765.
[16]HE K,ZHANG X,REN S,et al.Deep residual learning for image recognition[C]//IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[17]CHEN J,LEI B,SONG Q,et al.A hierarchical graph network for 3d object detection on point clouds[C]//IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.2020:392-401.
[18]DING X,ZHANG X,MA N,et al.Repvgg:Making vgg-styleconvnets great again[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:13733-13742.
[19]YAN H,LIU Y L,JIN L W,et al.The development,applica-tion,and future of llm similar to chatgpt[J].Journal of Image and Graphics,2023,28(9):2749-2762.
[20]LI L H,ZHANG P,ZHANG H,et al.Grounded language-image pre-training[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:10965-10975.
[21]XU Y,ZHANG M,FU C,et al.Multi-modal queried object detection in the wild[C]//Advances in Neural Information Processing Systems.2024:1-18.
[22]ZHOU K,YANG J,LOY C C,et al.Learning to prompt for vision-language models[J].International Journal of Computer Vision,2022,130(9):2337-2348.
[23]KHATTAK M U,RASHEED H,MAAZ M,et al.Maple:Multi-modal prompt learning[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition.2023:19113-19122.
[24]ZHANG H,LI F,LIU S,et al.Dino:Detr with improved denoi-sing anchor boxes for end-to-end object detection[J].arXiv:2203.03605,2022.
[25]ZHU X,SU W,LU L,et al.Deformable detr:Deformable transformers for end-to-end object detection[C]//International Conference on Learning Representations.2021:1-16.
[26]ZONG Z,SONG G,LIU Y.Detrs with collaborative hybrid assignments training[C]//IEEE/CVF International Conference on Computer Vision.2023:6748-6758.
[27]JOCHER G,NISHIMURA K,MINEEVA T,et al.Yolov8 byultralytics[EB/OL].https://github.com/ultralytics/ultraly-tics.
[28]SHEN J,BAI L,ZHANG Y,et al.Ellk-net:An efficient light-weight large kernel network for sar ship detection[J].IEEE Transactions on Geoscience and Remote Sensing,2024,62:5221514.

Related Articles 15

[1]	YANG Feixia, LI Zheng, MA Fei. Research on Hyperspectral Image Super-resolution Methods Based on Tensor Ring SubspaceSmoothing and Graph Regularization [J]. Computer Science, 2025, 52(8): 240-250.
[2]	FEI Chunguo, CHEN Shihong. FOD Segmentation Method Based on Dual-channel Sparrow Search Algorithm-enhanced OTSU [J]. Computer Science, 2025, 52(6A): 240700089-7.
[3]	ZHANG Dabin, WU Qin, ZHOU Haojie. Oriented Object Detection Based on Multi-scale Perceptual Enhancement [J]. Computer Science, 2025, 52(6): 247-255.
[4]	YUAN Lin, HUANG Ling, HAO Kaile, ZHANG Jiawei, ZHU Mingrui, WANG Nannan, GAO Xinbo. Adversarial Face Privacy Protection Based on Makeup Style Patch Activation [J]. Computer Science, 2025, 52(6): 405-413.
[5]	SI Weina, YE Jun, JIANG Bin. Hyperspectral Image Denoising Combining Group Sparse and Representative Coefficient Bidirectional Spatial Spectral Total Variation [J]. Computer Science, 2024, 51(12): 199-208.
[6]	GUO Zhangxiang, YAN Tianhong, ZHOU Guoqiang. Research Progress of 3D Point Cloud Data Processing Methods [J]. Computer Science, 2024, 51(11A): 240100132-13.
[7]	DONG Yan, WEI Minghong, GAO Guangshuai, LIU Zhoufeng, LI Chunlei. Remote Sensing Orineted Object Detection Method Based on Dual-label Assignment [J]. Computer Science, 2024, 51(11A): 240100058-9.
[8]	GAO Wenbin. A Robust Method for Range Grating Lobe Suppression in Stepped Frequency SAR [J]. Computer Science, 2024, 51(8): 209-216.
[9]	SUN Jifei, JIA Kebin. Classification and Detection Algorithm of Ground-based Cloud Images Based on Multi-scale Features [J]. Computer Science, 2024, 51(6A): 230400041-6.
[10]	HE Xinyu, LU Chenxin, FENG Shuyi, OUYANG Shangrong, MU Wentao. Ship Detection and Recognition of Optical Remote Sensing Images for Embedded Platform [J]. Computer Science, 2024, 51(6A): 230700117-7.
[11]	JIANG Bin, YE Jun, ZHANG Lihong, SI Weina. Hyperspectral Image Recovery Model Based on Bi-smoothing Function Rank Approximation andGroup Sparse [J]. Computer Science, 2024, 51(5): 151-161.
[12]	LIU Changxin, WU Ning, HU Lirui, GAO Ba, GAO Xueshan. Recursive Gated Convolution Based Super-resolution Network for Remote Sensing Images [J]. Computer Science, 2024, 51(2): 205-216.
[13]	CHEN Meiying, BI Xiuli, LIU Bo. Image Retargeting Method Based on Grids and Superpixels [J]. Computer Science, 2023, 50(11A): 221100153-8.
[14]	SHI Ying, HE Xinguang, LIU Binrui. Remote Sensing Image Fusion Method Combining Edge Detection and Parameter-adaptive PCNN [J]. Computer Science, 2023, 50(11A): 220900264-6.
[15]	WANG Shan, LIU Lu. Soil Moisture Data Reconstruction Based on Low Rank Matrix Completion Method [J]. Computer Science, 2023, 50(11A): 230300073-6.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Ship Detection Method for SAR Images Based on Small Target Feature Enhanced RT-DETR

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0