Computer Science ›› 2026, Vol. 53 ›› Issue (6A): 250600036-7.doi: 10.11896/jsjkx.250600036

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Aerial Image Object Detection Model Based on Dual-domain Attention and Feature Fusion

MAO Lihong1, TANG Jianjun2,3, CHEN Tong1, ZHANG Rui1   

  1. 1 School of Information Engineering,Tarim University,Alar,Xinjiang 843300,China
    2 School of Software,Nanchang University,Nanchang 330000,China
    3 School of Network Engineering,Jiangxi University of Software Professional Technology,Nanchang 330000,China
  • Online:2026-06-16 Published:2026-06-12
  • About author:MAO Lihong,born in 1995,postgra-duate,lecturer.His main research in-terests include object detection and energy Internet.
    TANG Jianjun,born in 1983,bachelor,teaching assistant.His main research interests include object detection and blockchain.

Abstract: With the escalating strategic importance of China's low-altitude economy,precise object detection in aerial imagery under complex scenarios has emerged as a pivotal technology.However,challenges including dense object clustering,intricate background environments,and the prevalence of small targets in aerial images continue to impede feature extraction and detection accuracy for deep learning-based models.This paper introduces an aerial imagery object detection framework integrating dual-domain attention mechanisms and hierarchical feature fusion.Firstly,a channel-spatial dual-domain attention module is engineered to suppress background interference while amplifying salient feature channels through adaptive weight calibration.Secondly,a cross-layer multi-scale feature fusion architecture is developed,incorporating residual fusion pathways and learnable weighting coefficients to enable effective multi-resolution feature interactions.Finally,a dedicated 20×20 pixel small-object detection branch is appended to enhance fine-grained target recognition capabilities.Experimental evaluations on the VisDrone2019 dataset demonstrate substantial performance gains over state-of-the-art baselines:the proposed model achieves relative mAP0.5 improvements of 97.3%,18.6%,22.7%,33.7%,12.2%,18.4%,and 37.9% compared to Faster R-CNN,LFET-NetYOLOv5x,YOLOv8,YOLOv9,YOLOv10,and YOLOv11 respectively.These results fully demonstrate the effectiveness of the proposed model inaerial image object detection tasks.

Key words: Aerial imagery, Object detection, YOLOv5, Dual-domain attention module, Cross-level multi-scale feature fusion network

CLC Number: 

  • TP391.4
[1] CHEN H.Brief Discussion on the Role and Significance of UAV Channel Surveying and Mapping in the Development of Low-Altitude Economy Strategy [J].Pearl River Water Transport,2024,(22):11-14.
[2] LI B H,LI Z.Research on Small Target Detection Algorithm for Drone Aerial Photography Based on Improved YOLOv8n [J].Intelligent Computer and Applications,2025,48(3):26-34.
[3] XU H,WANG L,HAN W,et al.A survey on UAV applications in smart city management:Challenges,advances,and opportunities [J].IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing,2023,16:8982-9010.
[4] BAI J,HU H,LIU X,et al.UAV image object detection based on self-attention guidance and global feature fusion [J].Image and Vision Computing,2024,151:105262.
[5] CHENG X,WANG F,HU X,et al.IRWT-YOLO:A Back-ground Subtraction-Based Method for Anti-Drone Detection [J].Drones,2025,9(4):297.
[6] ZHAO Z Q,ZHENG P,XU S T,et al.Object Detection with Deep Learning:A Review [J].arXiv:1807.05511,2018.
[7] ZHANG Y,ZHAO Y T,LI G X,et al.Research Progress on Edge Detection Technology of Images [J].Computer and Digital Engineering,2020,48(5):1176-118,1232.
[8] TANG C,FENG Y,YANG X,et al.The object detection based on deep learning[C]//2017 4th International Conference on Information Science and Control Engineering(ICISCE).2017:723-728.
[9] CHEN Z,WANG H,WU X,et al.Object detection in aerial images using DOTA dataset:A survey [J].International Journal of Applied Earth Observation and Geoinformation,2024,134:104208.
[10] BI H,DAI R,HAN F,et al.DR-YOLO:An improved multi-scale small object detection model for drone aerial photography scenes based on YOLOv7 [J].Digital Signal Processing,2025,164:105265.
[11] WANG C Y,BOCHKOVSKIY A,LIAO H Y M.YOLOv7:Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2023:7464-7475.
[12] LU S,GUO Y,LONG J,et al.Dense small target detection algorithm for UAV aerial imagery [J].Image and Vision Computing,2025,156:105485.
[13] DU D,ZHU P,WEN L,et al.VisDrone-DET2019:The vision meets drone object detection in image challenge results[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops.2019.
[14] HAROON M,SHAHZAD M,FRAZ M M.Multisized object detection using spaceborne optical imagery [J].IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing,2020,13:3032-3046.
[15] HSIEH M-R,LIN Y-L,HSU W H.Drone-based object counting by spatially regularized regional proposal network[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:4145-4153.
[16] BATTISH N,KAUR D,CHUGH M,et al.SDMNet:spatially dilated multi-scale network for object detection for drone aerial imagery [J].Image and Vision Computing,2024,150:105232.
[17] DU D,QI Y,YU H,et al.The unmanned aerial vehicle benchmark:Object detection and tracking[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:370-386.
[18] FENG C,WANG C,ZHANG D,et al.Enhancing Dense SmallObject Detection in UAV Images Based on Hybrid Transformer [J].Computers,Materials & Continua,2024,78(3):3993-4013.
[19] VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need [C]//Proceedings of the 31st International Confe-rence on Neural Information Processing Systems.2017:6000-6010.
[20] CHEN H Y,ZHEN X J,ZHAO T T.Small object detection model based on feature fusion of attention mechanism [J].Journal of Huazhong University of Science and Technology(Natural Science Edition),2023,51(3):60-66.
[21] LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramidnetworks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2117-2125.
[22] LIU S,QI L,QIN H,et al.Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8759-8768.
[23] WANG H T,ZHANG S,CHEN X,et al.Target detection algorithm of lightweight UAV aerial photography [J].Electronic Measurement Technology,2022,45(19):167-174.
[24] HE Z X,CHEN L L,WANG X,et al.DMF-YOLOv11:TargetDetection Algorithm for UAV Images Based on Improved YOLOv11n [J].Computer Engineering and Applications,2025,61(14):88-100.
[25] SHEN Z Y,ZHU F H,WANG Z X,et al.Uav aerial image target detection based on efficient feature extrac technology.receptive field [J].CAAI Transactions on Intelligent Systems,2025,20(4):813-821.
[1] ZHANG Shouyi, SHEN Qiang, GUO Yiran, WANG Hanyu. Rain and Fog Weather Object Detection Algorithm Based on Improved YOLOv8 Model [J]. Computer Science, 2026, 53(6A): 250300090-7.
[2] LIU Dai, AN Pengyu, WANG Kai. Improved YOLOv5s-based Algorithm for Emergency Situation Detection in Airport Terminals [J]. Computer Science, 2026, 53(6A): 250300174-7.
[3] DONG Ye, LIAN Xinyue, WANG Yuyang, OU Xinyu. RGB-IR Multi-modal Fusion-based Tomato Small Object Detection [J]. Computer Science, 2026, 53(6A): 250700173-8.
[4] ZHOU Wenwu, LEI Lei, XUAN Xin. Armory Equipment Detection Based on Improved YOLOv5 [J]. Computer Science, 2026, 53(6A): 250800049-6.
[5] SHAN Chengcheng, MEI Chun, LI Weiting, GUO Yuanyuan, QIAN Weixing, XIONG Zhi. Semantic Perception Active Learning Method for the Datum Map of Scene Matching Navigation System [J]. Computer Science, 2026, 53(6A): 250600228-8.
[6] CHEN Nuo, ZHAO Peng, HUAN Haisheng. Review of Small Object Detection Based on Deep Learning [J]. Computer Science, 2026, 53(6A): 250700022-9.
[7] ZHENG Haibin, LIN Xiuhao, HAN Ye, CHEN Jinyin, LI Beibei. Black-box Physical Adversarial Attack Against Multimodal Object Detector [J]. Computer Science, 2026, 53(6A): 250700023-10.
[8] QU Jiewu, LU Xinxi, SUN Jian, LIU Yan, GAO Ling, XU Binbin. Object Detection Method Based on Phased Training Strategy and Multi-scale Feature Fusion [J]. Computer Science, 2026, 53(6A): 250700088-7.
[9] JI Wenyu, LI Yang, WANG Jiabao, FU Ruizhi, LIU Xiaoyu, MIAO Zhuang. Review of 3D Object Detection Based on LiDAR-camera Fusion [J]. Computer Science, 2026, 53(6): 214-231.
[10] LI Peng, ZHANG Zihao, HAN Yahong. Primitive Dynamic Weighting for Multi-modal Salient Object Detection [J]. Computer Science, 2026, 53(6): 242-251.
[11] LIU Jikang, HUANG Lei, ZHANG Ke, NIE Jie, WEI Zhiqiang. Object Detection Method Based on Dynamic Feature Fusion [J]. Computer Science, 2026, 53(6): 263-269.
[12] SONG Jianhua, LIU Chun, ZHANG Yan. Lightweight Camouflaged Object Detection Model Based on Structured Knowledge Distillation [J]. Computer Science, 2026, 53(4): 299-307.
[13] ZHAO Binbei, ZHU Li, ZHAO Hongli, LI Yutong. Computer Vision Applications in Rail Transit Systems [J]. Computer Science, 2026, 53(3): 214-224.
[14] HUANG Jing, WANG Teng, LIU Jian, HU Kai, PENG Xin, HUANG Yamin, WEN Yuanqiao. Multimodal Visual Detection for Underwater Sonar Target Images [J]. Computer Science, 2026, 53(2): 227-235.
[15] ZHOU Bingquan, JIANG Jie, CHEN Jiangmin, ZHAN Lixin. EvR-DETR:Event-RGB Fusion for Lightweight End-to-End Object Detection [J]. Computer Science, 2026, 53(1): 153-162.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!