Computer Science ›› 2025, Vol. 52 ›› Issue (6A): 240700097-5.doi: 10.11896/jsjkx.240700097

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Small Target Detection Algorithm in UAV Images Integrating Multi-scale Features

HUANG Hong1, SU Han1,2, MIN Peng1   

  1. 1 College of Computer Science,Sichuan Normal University,Chengdu 610101,China
    2 Visual Computing and Virtual Reality Key Laboratory of Sichuan Province,Chengdu 610101,China
  • Online:2025-06-16 Published:2025-06-12
  • About author:SU Han,born in 1979,Ph.D,professor,is a member of CCF(No.A3506M).Her main research interests include computer vision,intelligent information processing and biometric recognition.
  • Supported by:
    Ministry of Human Resources and Social Security’s Returned Overseas Students Selection Program,Sichuan Natural Science Foundation(2023NSFSC1080,2023NSFSC0210,2023YFS0202),National Forestry and Grassland Administration International Cooperation Fund Project (Hudonghan (2023) No. 94),Chengdu Science and Technology Plan(2022-YF09-00019-SN) and Chengdu Research Base of Giant Panda Breeding(2021CPB-B06).

Abstract: To address the issues of missed and false detections caused by overly dense distributions of small objects in the task of small object detection in drone aerial imagery,which can lead to mutual occlusion,a lightweight object detection method with multi-scale feature fusion is proposed.Firstly,a multi-scale occlusion module is introduced to enhance the network’s multi-scale information extraction capability,reduce semantic differences between different scales,and improve detection performance for occluded small objects.Secondly,a more efficient shared detection head strategy is proposed,which shares feature information of different scales through shared convolution across different detection heads,significantly reducing the model’s parameter count and achieving model lightweighting.Finally,a softened non-maximum suppression(NMS) method is introduced to solve the pro-blem of missed and false detections in dense occlusion scenarios caused by traditional greedy NMS,further improving detection accuracy.The effectiveness of the improved model is evaluated on the Visdrone-2019 and RSOD datasets,with the mean average precision(mAP) of the improved model increases by 9.0% and 6.0% respectively compared to the baseline model,and the model parameters reduce by 12.6%.Experimental results show that the improved algorithm can enhance the accuracy of object detection in drone aerial imagery while ensuring lightweighting,helping drone systems to identify and track targets more accurately,thereby improving the reliability and efficiency of task execution.

Key words: Drone aerial photography, Small object detection, Deep learning, Non-maximum suppression, YOLOv8

CLC Number: 

  • TP391
[1]WANG S,JIANG H,LI Z,et al.PHSI-RTDETR:A Light-weight Infrared Small Target Detection Algorithm Based on UAV Aerial Photography[J].Drones,2024,8(6):240.
[2]BAI Z,PEI X,QIAO Z,et al.Improved YOLOv7 Target Detection Algorithm Based on UAV Aerial Photography[J].Drones,2024,8(3):104.
[3]ZHU X,LYU S,WANG X,et al.TPH-YOLOv5:ImprovedYOLOv5 based on transformer prediction headfor object detection on drone-captured scenarios[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.Piscataway:IEEE,2021:2778-2788.
[4]ZHOU J,SU T,LI K,et al.Small Target-YOLOv5:Enhancing the Algorithm for Small Object Detection in Drone Aerial Imagery Based on YOLOv5[J].Sensors,2023,24(1):134.
[5]ZHANG M,ZHANG B,LIU M,et al.Robust object detection in aerial imagery based on multi-scale detector and soft densely connected[J].IEEE Access,2020,8:92791-92801.
[6]MA J,SHAO W,YE H,et al.Arbitrary-oriented scene text detection via rotation proposals[J].IEEE Transactions on Multimedia,2018,20(11):3111-3122.
[7]ZHAO L L,ZHU M L.MS-YOLOv7:YOLOv7 based on multi-scale for object detection on UAV aerial photography[J].Drones,2023,7(3):188.
[8]LIU W,LU H,FU H,et al.Learning to Upsample by Learning to Sample[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2023:6027-6037.
[9]KANG M,TING C M,TING F F,et al.ASF-YOLO:A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance Segmentation[J].arXiv:2312.06458,2023.
[10]TIAN Z,SHEN C,CHEN H,et al.FCOS:A simple and strong anchor-free object detector[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,44(4):1922-1933.
[11]NEUBECK A,VAN GOOL L.Efficient non-maximum suppression[C]//18th International Conference on Pattern Recognition(ICPR’06).IEEE,2006:850-855.
[12]BODLA N,SINGH B,CHELLAPPAR,et al.Soft-NMS-improving object detection with one line of code[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:5561-5569.
[13]DU D,ZHANG Y,SHI B L,et al.VisDrone-SOT2019:The vision meets drone single object tracking challenge results[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshop(ICCVW).Seoul,Republic of Korea,2019:27-28.
[14]UTTARKABAT S,SUNKARA N D,PATRA B K.RSOD:Efficient Technique for Outlier Detection using Reverse Nearest Neighbors Statistics[C]//2020 4th International Conference on Computational Intelligence and Networks(CINE).2020.
[15]WANG J,YU J,HE Z.ARFP:A novel adaptive recursive feature pyramid for object detection in aerial images[J].Applied Intelligence,2022,52(11):12844-12859.
[16]REDMON J,FARHADI A.Yolov3:An incremental improve-ment[J].arXiv:1804.02767,2018.
[17]CHEN H,LIU H,SUN T,et al.MC-YOLOv5:A Multi-ClassSmall Object Detection Algorithm[J].Biomimetics,2023,8(4):342.
[18]CHEN N,LI Y,YANG Z,et al.LODNU:lightweight object detection network in UAV vision[J].The Journal of Supercomputing,2023,79(9):10117-10138.
[19]CAI Z,HONG Z,YU W,et al.CNXResNet:A Light-weightBackbone based on PP-YOLOE for Drone-captured Scenarios[C]//2023 8th International Conference on Signal and Image Processing(ICSIP).IEEE,2023:460-464.
[20]BOCHKOVSKIY A,WANG C Y,LIAOH Y M.Yolov4:Optimal speed and accuracy of object detection[J].arXiv:2004.10934,2020.
[21]DING W,ZHANG L.Building detection in remote sensing image based on improved YOLOv5[C]//2021 17th International Conference on Computational Intelligence and Security(CIS).IEEE,2021:133-136.
[22]LANG L,XU K,ZHANG Q,et al.Fast and accurate object detection in remote sensing images based on lightweight deep neural network[J].Sensors,2021,21(16):5460.
[23]LIU W,ANGUELOV D,ERHAN D,et al.Ssd:Single shotmultibox detector[C]//Computer Vision-ECCV 2016:14th European Conference,Amsterdam,The Netherlands,October 11-14,2016,Proceedings,Part I 14.Springer International Publishing,2016:21-37.
[24]HUANG W,LI G,CHEN Q,et al.CF2PN:A cross-scale feature fusion pyramid network based remote sensing target detection[J].Remote Sensing,2021,13(5):847.
[1] WANG Jiamin, WU Wenhong, NIU Hengmao, SHI Bao, WU Nier, HAO Xu, ZHANG Chao, FU Rongsheng. Review of Concrete Defect Detection Methods Based on Deep Learning [J]. Computer Science, 2025, 52(6A): 240900137-12.
[2] HAO Xu, WU Wenhong, NIU Hengmao, SHI Bao, WU Nier, WANG Jiamin, CHU Hongkun. Survey of Man-Machine Distance Detection Method in Construction Site [J]. Computer Science, 2025, 52(6A): 240700098-10.
[3] HUANG Bocheng, WANG Xiaolong, AN Guocheng, ZHANG Tao. Transmission Line Fault Identification Method Based on Transfer Learning and Improved YOLOv8s [J]. Computer Science, 2025, 52(6A): 240800044-8.
[4] WU Zhihua, CHENG Jianghua, LIU Tong, CAI Yahui, CHENG Bang, PAN Lehao. Human Target Detection Algorithm for Low-quality Laser Through-window Imaging [J]. Computer Science, 2025, 52(6A): 240600069-6.
[5] CHEN Shijia, YE Jianyuan, GONG Xuan, ZENG Kang, NI Pengcheng. Aircraft Landing Gear Safety Pin Detection Algorithm Based on Improved YOlOv5s [J]. Computer Science, 2025, 52(6A): 240400189-7.
[6] BAI Shaokang, WANG Baohui, CHEN Jixuan. Improved Helmet Detection Algorithm of Electric Bicycle Based on YOLOv8n [J]. Computer Science, 2025, 52(6A): 240900167-8.
[7] GAO Junyi, ZHANG Wei, LI Zelin. YOLO-BFEPS:Efficient Attention-enhanced Cross-scale YOLOv10 Fire Detection Model [J]. Computer Science, 2025, 52(6A): 240800134-9.
[8] ZHANG Hang, WEI Shoulin, YIN Jibin. TalentDepth:A Monocular Depth Estimation Model for Complex Weather Scenarios Based onMultiscale Attention Mechanism [J]. Computer Science, 2025, 52(6A): 240900126-7.
[9] WANG Baohui, GAO Zhan, XU Lin, TAN Yingjie. Research and Implementation of Mine Gas Concentration Prediction Algorithm Based on Deep Learning [J]. Computer Science, 2025, 52(6A): 240400188-7.
[10] LIU Chengming, LI Haixia, LI Shaochuan, LI Yinghao. Ensemble Learning Model for Stock Manipulation Detection Based on Multi-scale Data [J]. Computer Science, 2025, 52(6A): 240700108-8.
[11] ZHOU Lei, SHI Huaifeng, YANG Kai, WANG Rui, LIU Chaofan. Intelligent Prediction of Network Traffic Based on Large Language Model [J]. Computer Science, 2025, 52(6A): 241100058-7.
[12] GUAN Xin, YANG Xueyong, YANG Xiaolin, MENG Xiangfu. Tumor Mutation Prediction Model of Lung Adenocarcinoma Based on Pathological [J]. Computer Science, 2025, 52(6A): 240700010-8.
[13] TAN Jiahui, WEN Chenyan, HUANG Wei, HU Kai. CT Image Segmentation of Intracranial Hemorrhage Based on ESC-TransUNet Network [J]. Computer Science, 2025, 52(6A): 240700030-9.
[14] RAN Qin, RUAN Xiaoli, XU Jing, LI Shaobo, HU Bingqi. Function Prediction of Therapeutic Peptides with Multi-coded Neural Networks Based on Projected Gradient Descent [J]. Computer Science, 2025, 52(6A): 240800024-6.
[15] FAN Xing, ZHOU Xiaohang, ZHANG Ning. Review on Methods and Applications of Short Text Similarity Measurement in Social Media Platforms [J]. Computer Science, 2025, 52(6A): 240400206-8.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!