Computer Science ›› 2025, Vol. 52 ›› Issue (6): 247-255.doi: 10.11896/jsjkx.240300076

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Oriented Object Detection Based on Multi-scale Perceptual Enhancement

ZHANG Dabin1, WU Qin1,2, ZHOU Haojie1   

  1. 1 School of Artificial Intelligence and Computer Science,Jiangnan University,Wuxi,Jiangsu 214122,China
    2 Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence,Wuxi,Jiangsu 214122,China
  • Received:2024-03-12 Revised:2024-07-12 Online:2025-06-15 Published:2025-06-11
  • About author:ZHANG Dabin,born in 1999,postgra-duate,is a member of CCF(No.T4849G).His main research interests include computer vision and deep lear-ning.
    HOU Haojie,born in 1981,Ph.D,asso-ciate professor,is a member of CCF(No.19225S).His main research interests include system architecture,intelligent system and distributed computing.

Abstract: Oriented object detection in remote sensing images is more challenging due to the issues of complex background,dense distribution and with arbitrary direction,large-scale variation,high aspect-ratio of objects.To address these issues,this paper proposes a framework for oriented object detection in remote sensing images based on multi-scale perception enhancemen.Firstly,a multi-scale perceptual enhancement module is proposed in the feature extraction stage,which employs different convolutional blocks for extracting features for different levels of feature maps to ensure that the low-level feature maps retain enough detail information and the high-level feature maps extract enough semantic information.So that the extracted multilevel feature maps have the ability of adaptive feature learning for different scales.Meanwhile,an adaptive channel attention module is used to adaptively learn the channel weights to mitigate the effects of the complex background.Secondly,a size-sensitive rotated Itersection over Union(IoU) loss is proposed to supervise the network to learn the size information of the target and increase the sensitivity to high aspect ratio targets by adding the loss terms of objects' aspect ratio and area in the loss.The proposed method achieves 77.64%,98.32%,and 66.14% mAP on the publicly available remote sensing image datasets DOTA,HRSC2016,and DIOR-R,respectively.The detection accuracies of the proposed framework outperform existing state-of-the-art remote sensing image detection networks.

Key words: Remote sensing image, Oriented object detection, Multi-scale perceptual enhancement, Adaptive feature learning, Rotated IoU loss

CLC Number: 

  • TP751
[1]YU Z,CHU H,BO S,et al.Multrlayer Feature Selection Based Hierarchal Component Model for Aero-planeDetection on Remote Sensing Image [J].Geomatics and Information Science of Wuhan University,2014,39(12):1406-1411.
[2]TANG F,WANG W,LI J,et al.Aircraft rotation detection inremote sensing image based on multi-feature fusion and rotation-aware anchor [J].Applied Sciences,2022,12(3):1291.
[3]JI H,GAO Z,MEI T,et al.Improved faster R-CNN with multiscale feature fusion and homography augmentation for vehicle detection in remote sensing images [J].IEEE Geoscience and Remote Sensing Letters,2019,16(11):1761-1765.
[4]ZHANG Y_Z,GUO W,LI W_B.Omnidirectional accurate detection algorithm for dense small objects in remote sensing images[J].Journal of Jilin University(Engineering and Technology Edition),2024,54(4):1105-1113.
[5]WANG F_S,BAO P.Intelligent Recognition of sensitive small targets with fine grains in complex background remote sensing images[J].Journal of Jilin University(Engineering and Techno-logy Edition),2024,54(11):3289-3295.
[6]LIU Z,WANG H,WENG L,et al.Ship rotated bounding box space for ship extraction from high-resolution optical satellite images with complex backgrounds [J].IEEE Geoscience and Remote Sensing Letters,2016,13(8):1074-1078.
[7]RAN B,YOU Y,LI Z,et al.Arbitrary-Oriented Ship Detection Method Based on Improved Regression Model for Target Direction Detection Network[C]//IGARSS 2020-2020 IEEE International Geoscience and Remote Sensing Symposium.IEEE,2020:964-967.
[8]CHEN L,WENG T,XING J,et al.A new deep learning net-work for automatic bridge detection from SAR images based on balanced and attention mechanism [J].Remote Sensing,2020,12(3):441.
[9]LIN T Y,DOLLáR P,GIRSHICK R,et al.Feature pyramid networks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2117-2125.
[10]WANG J,WANG Y,WU Y,et al.FRPNet:A feature-reflowing pyramid network for object detection of remote sensing images [J].IEEE Geoscience and Remote Sensing Letters,2020,19:1-5.
[11]ZHANG W,JIAO L,LI Y,et al.Laplacian feature pyramid network for object detection in VHR optical remote sensing images [J].IEEE Transactions on Geoscience and Remote Sensing,2021,60:1-14.
[12]GU L,EUGENE P,DONG G.Fast Fourier Convolution Based Remote Sensor Image Object Detection for Earth Observation [J].arXiv:2209.00551,2022.
[13]HUANG Z,LI W,XIA X G,et al.A novel nonlocal-aware pyramid and multiscale multitask refinement detector for object detection in remote sensing images [J].IEEE Transactions on Geo-science and Remote Sensing,2021,60:1-20.
[14]LIU Y,LI Q,YUAN Y,et al.ABNet:Adaptive balanced network for multiscale object detection in remote sensing imagery [J].IEEE Transactions on Geoscience and Remote Sensing,2021,60:1-14.
[15]ZHOU Y,CHEN S,ZHAO J,et al.CLT-Det:Correlation lear-ning based on transformer for detecting dense objects in remote sensing images [J].IEEE Transactions on Geoscience and Remote Sensing,2022,60:1-15.
[16]YU J,JIANG Y,WANG Z,et al.Unitbox:An advanced object detection network [C]//Proceedings of the 24th ACM International Conference on Multimedia.2016:516-520.
[17]REZATOFIGHI H,TSOI N,GWAK J,et al.Generalized intersection over union:A metric and a loss for bounding box regression [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:658-666.
[18]ZHENG Z,WANG P,LIU W,et al.Distance-IoU loss:Fasterand better learning for bounding box regression[C]//Procee-dings of the AAAI Conference on Artificial Intelligence.2020:12993-13000.
[19]ZHANG Y F,REN W,ZHANG Z,et al.Focal and efficient IOU loss for accurate bounding box regression [J].Neurocomputing,2022,506:146-157.
[20]GUO Z,LIU C,ZHANG X,et al.Beyond bounding-box:Con-vex-hull feature adaptation for oriented and densely packed object detection [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:8792-8801.
[21]XIE E,SUN P,SONG X,et al.Polarmask:Single shot instance segmentation with polar representation [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:12193-12202.
[22]YANG X,YANG J,YAN J,et al.Scrdet:Towards more robust detection for small,cluttered and rotated objects[C]//Procee-dings of the IEEE/CVF International Conference on Computer Vision.2019:8232-8241.
[23]CHEN Z,CHEN K,LIN W,et al.Piou loss:Towards accurate oriented object detection in complex environments [C]//European Conference on Computer Vision.Springer,2020:195-211.
[24]YANG X,YAN J,MING Q,et al.Rethinking rotated object detection with gaussian wasserstein distance loss [C]//International Conference on Machine Learning.PMLR,2021:11830-11841.
[25]YANG X,YANG X,YANG J,et al.Learning high-precisionbounding box for rotated object detection via kullback-leibler divergence [J].Advances in Neural Information Processing Systems,2021,34:18381-18394.
[26]YANG X,ZHOU Y,ZHANG G,et al.The KFIoU loss for rotated object detection [J].arXiv:2201.12558,2022.
[27]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[28]YU F,KOLTUN V,FUNKHOUSER T.Dilated residual net-works [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:472-480.
[29]LI H,LI J,WEI H,et al.Slim-neck by GSConv:A better design paradigm of detector architectures for autonomous vehicles [J].arXiv:2206.02424,2022.
[30]XIE X,CHENG G,WANG J,et al.Oriented R-CNN for object detection [C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:3520-3529.
[31]XIA G S,BAI X,DING J,et al.DOTA:A large-scale dataset for object detection in aerial images [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:3974-3983.
[32]CHENG G,WANG J,LI K,et al.Anchor-free oriented proposal generator for object detection [J].arXiv:2110.01931,2022.
[33]DENG J,DONG W,SOCHER R,et al.Imagenet:A large-scale hierarchical image database[C]//2009 IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2009:248-255.
[34]CHEN K,WANG J,PANG J,et al.MMDetection:Open mmlab detection toolbox and benchmark [J].arXiv:1906.07155,2019.
[35]LIN TY,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection [C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2980-2988.
[36]TIAN Z,SHEN C,CHEN H,et al.Fcos:Fully convolutional one-stage object detection [C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:9627-9636.
[37]DING J,XUE N,LONG Y,et al.Learning RoI transformer for oriented object detection in aerial images[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:2849-2858.
[38]LI C,XU C,CUI Z,et al.Feature-attentioned object detection in remote sensing imagery[C]//IEEE International Conference on Image Processing.IEEE,2019:3886-3890.
[39]ZHAO P,QU Z,BU Y,et al.Polardet:A fast,more precise detector for rotated target in aerial images [J].International Journal of Remote Sensing,2021,42(15):5831-5861.
[40]FU K,CHANG Z,ZHANG Y,et al.Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images [J].ISPRS Journal of Photogrammetry and Remote Sensing,2020,161:294-308.
[41]CHENG G,YAO Y,LI S,et al.Dual-aligned oriented detector [J].IEEE Transactions on Geoscience and Remote Sensing,2022,60:1-11.
[42]LU D,LI D,LI Y,et al.OSKDet:Orientation-sensitive keypoint localization for rotated object detection [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:1182-1192.
[43]MING Q,ZHOU Z,MIAO L,et al.Dynamic anchor learning for arbitrary-oriented object detection [C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:2355-2363.
[44]MA J,SHAO W,YE H,et al.Arbitrary-oriented scene text detection via rotation proposals [J].IEEE Transactions on Multimedia,2018,20(11):3111-3122.
[45]XU Y,FU M,WANG Q,et al.Gliding vertex on the horizontal bounding box for multi-oriented object detection [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,43(4):1452-1459.
[46]WANG J,YANG W,LI H C,et al.Learning center probability map for detecting objects in aerial images [J].IEEE Transactions on Geoscience and Remote Sensing,2020,59(5):4307-4323.
[47]HOU L,LU K,XUE J,et al.Shape-adaptive selection and mea-surement for oriented object detection [C]//Proceedings of the AAAI Conference on Artificial Intelligence.2022:923-932.
[48]YANG X,YAN J,FENG Z,et al.R3det:Refined single-stagedetector with feature refinement for rotating object[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:3163-3171.
[49]HAN J,DING J,XUE N,et al.Redet:A rotation-equivariant detector for aerial object detection [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:2786-2795.
[50]HAN J,DING J,LI J,et al.Align deep features for oriented object detection [J].IEEE Transactions on Geoscience and Remote Sensing,2021,60:1-11.
[51]ZENG Y,YANG X,LI Q,et al.Ars-detr:Aspect ratio sensitive oriented object detection with transformer [J].arXiv:2303.04989,2023.
[1] CHEN Tianpeng, HU Jianwen, LI Haitao. Ships Detection in Remote Sensing Images Based on Improved PPYOLOE-R [J]. Computer Science, 2025, 52(6A): 240600118-8.
[2] JIANG Wenwen, XIA Ying. Improved U-Net Multi-scale Feature Fusion Semantic Segmentation Network for RemoteSensing Images [J]. Computer Science, 2025, 52(5): 212-219.
[3] WANG Qian, HE Lang, WANG Zhanqing, HUANG Kun. Road Extraction Algorithm for Remote Sensing Images Based on Improved DeepLabv3+ [J]. Computer Science, 2024, 51(8): 168-175.
[4] HOU Linhao, LIU Fan. Remote Sensing Image Fusion Combining Multi-scale Convolution Blocks and Dense Convolution Blocks [J]. Computer Science, 2024, 51(6A): 230400110-6.
[5] CHEN Tianpeng, HU Jianwen. Ships Detection in Remote Sensing Images Based on Improved FCOS [J]. Computer Science, 2024, 51(6A): 230700166-7.
[6] HE Xinyu, LU Chenxin, FENG Shuyi, OUYANG Shangrong, MU Wentao. Ship Detection and Recognition of Optical Remote Sensing Images for Embedded Platform [J]. Computer Science, 2024, 51(6A): 230700117-7.
[7] ZHANG Yang, XIA Ying. Object Detection Method with Multi-scale Feature Fusion for Remote Sensing Images [J]. Computer Science, 2024, 51(3): 165-173.
[8] LI Yu, YANG Xiangli, ZHANG Le, LIANG Yalin, GAO Xian, YANG Jianxi. Combined Road Segmentation and Contour Extraction for Remote Sensing Images Based on Cascaded U-Net [J]. Computer Science, 2024, 51(3): 174-182.
[9] LIU Changxin, WU Ning, HU Lirui, GAO Ba, GAO Xueshan. Recursive Gated Convolution Based Super-resolution Network for Remote Sensing Images [J]. Computer Science, 2024, 51(2): 205-216.
[10] SHI Jingye, ZUO Yiping, ZHI Ruicong, LIU Jiqiang, ZHANG Mengge. Multi-scale Dual Self-attention Based Remote Sensing Image Change Detection [J]. Computer Science, 2024, 51(11A): 231000097-9.
[11] WANG Kunyang, LIU Yang, YE Ning, ZHANG Kai. Road Extraction from Complex Urban Remote Sensing Images Based on Multi-task Learning [J]. Computer Science, 2024, 51(11A): 240300095-8.
[12] DONG Yan, WEI Minghong, GAO Guangshuai, LIU Zhoufeng, LI Chunlei. Remote Sensing Orineted Object Detection Method Based on Dual-label Assignment [J]. Computer Science, 2024, 51(11A): 240100058-9.
[13] FAN Xuejing, XUE Xiaorong, DU Yichao. Spatiotemporal Fusion Method for Remote Sensing Images Based on Dual Attention Mechanisms [J]. Computer Science, 2024, 51(11A): 240200097-6.
[14] YAN Yan, SUI Yi, SI Jianwei. Remote Sensing Image Pan-sharpening Method Based on Generative Adversarial Network [J]. Computer Science, 2023, 50(8): 133-141.
[15] GU Yuhang, HAO Jie, CHEN Bing. Semi-supervised Semantic Segmentation for High-resolution Remote Sensing Images Based on DataFusion [J]. Computer Science, 2023, 50(6A): 220500001-6.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!