Computer Science ›› 2024, Vol. 51 ›› Issue (11A): 231100119-8.doi: 10.11896/jsjkx.231100119

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Improved Lightweight Aerial Photography Object Detection Model Based on YOLOv5s

CHEN Haiyan, MAO Lihong   

  1. School of Computer and Communication,Lanzhou University of Technology,Lanzhou 730050,China
  • Online:2024-11-16 Published:2024-11-13
  • About author:CHEN Haiyan,born in 1978,Ph.D,associate professor.Her main research interests include image processing and pattern recognition.
  • Supported by:
    National Natural Science Foundation of China(62161019,62061024).

Abstract: The difficulty of target detection is increased by complex backgrounds,dense targets,and a high proportion of small objects in the unmanned aerial vehicle(UAV)aerial images.Deployment on embedded devices by drones is difficult due to the high computational complexity of the target detection model based on deep learning.Aiming at the above problems,an improved lightweight aerial image object detection model based on YOLOv5s is proposed.Firstly,the C3 module BottleNeck of the YOLOv5s backbone network is replaced with lightweight ShuffleNetv2 to reduce model parameters and computational complexity.Secondly,cross-layer information cross-fusion,SE channel attention mechanism,and residual connections are introduced in the ShuffleNetv2 network to alleviate the problem of reducing the number of feature channels caused by convolution operations and insufficient information utilization of feature maps in the middle layer of the network.Then,the SE channel attention mechanism is introduced into the YOLOv5s multi-scale feature fusion network,augmenting the network′s ability to capture and extract key features.Finally,the proposed target detection model in this paper is further lightened by channel pruning.Experimental results on the NWPU VHR-10 dataset show that,compared with the YOLOv5s model,the proposed model is increased of 3.5% in precision and 1.9% in mean average precision.The number of parameters and computational workload is reduced by 76% and 48.7%,the model size is compressed by 73.8% and detection speed improved by 48%.

Key words: Object detection, Lightweight network, YOLOv5s, SE channel attention mechanism, Channel pruning

CLC Number: 

  • TP391.4
[1]JUNOS M H,KHAIRUDDIN A S M,DAHARI M.Automated object detection on aerial images for limited capacity embedded device using a lightweight CNN model [J].Alexandria Engineering Journal,2022,61(8):6023-6041.
[2]LIU S,D LIU Y H,SUN Y M,et al.Small object detection in UAV aerial images based on inverted residual attention[J].Journal of Beijing University of Aeronautics and Astronautics,2023,49(3):514-524.
[3]DENG L,BI L,LI H,et al.Lightweight aerial image object detection algorithm based on improved YOLOv5s [J].Scientific Reports,2023,13(1):7817.
[4]CORTES C,VAPNIK V N.Support Vector Networks [J].Machine Learning,1995,20(3):273-297.
[5]STATISTICS L B,BREIMAN L.Random Forests [J].Machine Learning,2001,45(1):5-32.
[6]LOWE D G.Distinctive Image Features from Scale-InvariantKeypoints [J].International Journal of Computer Vision,2004,60(2):91-110.
[7]GUO L,WANG Q L,XUE W,et al.A Small Object Detection Algorithm Based on Improved YOLOv5[J].Journal of University of Electronic Science and Technology of China,2022,51(2):251-258.
[8]LIU P,LI K,ZHOU X,et al.RFP-based Faster R-CNN in Aeri-al Image Detection [C]//Proceedings of the 2020 International Conference on Artificial Intelligence and Computer Engineering(ICAICE).IEEE,2020:325-329.
[9]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks [J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2017,39(6):1137-1149.
[10]HUANG Y C,LI L H,MA J J,et al.Research on aerial photography target detection algorithm based on improved SSD algorithm[J].Journal of Telemetry,Tracking and Command,2022,43(3):79-85.
[11]LIU W,ANGUELOV D,ERHAN D,et al.Ssd:Single shotmultibox detector [C]// proceedings of the Computer Vision-ECCV 2016:14th European Conference,Amsterdam,The Netherlands,Part I 14.Springer,2016:21-37.
[12]FENG Z Q,XIE Z J,BAO Z W,et al.Real-time dense small object detection algorithm for UAV based on improved YOLOv5[J].Acta Aeronautica et Astronautica Sinica,2023,44(7):251-265.
[13]XIE C H,WU J M,XU H Y.Small Object Detection Algorithm Based on Improved YOLOv5 in UAV Image[J].Computer Engineering and Applications,2023,59(9):198-206.
[14]HOWARD A G,ZHU M,CHEN B,et al.Mobilenets:Efficient convolutional neural networks for mobile vision applications [J].arXiv:1704.04861,2017.
[15]SANDLER M,HOWARD A,ZHU M,et al.Mobilenetv2:In-verted residuals and linear bottlenecks [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:4510-4520.
[16]HOWARD A,SANDLER M,CHU G,et al.Searching for mobilenetv3 [C]// Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:1314-1324.
[17]ZHANG X,ZHOU X,LIN M,et al.Shufflenet:An extremelyefficient convolutional neural network for mobile devices [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:6848-6856.
[18]MA N,ZHANG X,ZHENG H T,et al.Shufflenet v2:Practical guidelines for efficient cnn architecture design [C]// Proceedings of the European Conference on Computer Vision(ECCV).2018:116-131.
[19]HAN K,WANG Y,TIAN Q,et al.Ghostnet:More featuresfrom cheap operations [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:1580-1589.
[20]YE L.AugShuffleNet:Communicate More,Compute Less [J].arXiv:2203.06589,2022.
[21]ZHANG L,CAI J.Target detection system based on lightweight Yolov5 algorithm[J].Computer Technology and Development,2022,32(11):134-139.
[22]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[23]LIU S,QI L,QIN H,et al.Path aggregation network for instance segmentation [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8759-8768.
[1] XU Jinlong, GUI Zhonghua, LI Jia'nan, LI Yingying, HAN Lin. FP8 Quantization and Inference Memory Optimization Based on MLIR [J]. Computer Science, 2024, 51(9): 112-120.
[2] WANG Jiahui, PENG Guangling, DUAN Liang, YUAN Guowu, YUE Kun. Few-shot Shadow Removal Method for Text Recognition [J]. Computer Science, 2024, 51(9): 147-154.
[3] LI Yunchen, ZHANG Rui, WANG Jiabao, LI Yang, WANG Ziqi, CHEN Yao. Re-parameterization Enhanced Dual-modal Realtime Object Detection Model [J]. Computer Science, 2024, 51(9): 162-172.
[4] HU Pengfei, WANG Youguo, ZHAI Qiqing, YAN Jun, BAI Quan. Night Vehicle Detection Algorithm Based on YOLOv5s and Bistable Stochastic Resonance [J]. Computer Science, 2024, 51(9): 173-181.
[5] PU Bin, LIANG Zhengyou, SUN Yu. Monocular 3D Object Detection Based on Height-Depth Constraint and Edge Fusion [J]. Computer Science, 2024, 51(8): 192-199.
[6] LOU Zhengzheng, ZHANG Xin, HU Shizhe, WU Yunpeng. Foggy Weather Object Detection Method Based on YOLOX_s [J]. Computer Science, 2024, 51(7): 206-213.
[7] ZHENG Shenhai, GAO Xi, LIU Pengwei, LI Weisheng. Occluded Video Instance Segmentation Method Based on Feature Fusion of Tracking and Detection in Time Sequence [J]. Computer Science, 2024, 51(6A): 230600186-6.
[8] LIU Hongli, WANG Yulin, SHAO Lei, LI Ji. Study on Monocular Vision Vehicle Ranging Based on Lower Edge of Detection Frame [J]. Computer Science, 2024, 51(6A): 231000077-6.
[9] CHEN Yuzhang, WANG Shiqi, ZHOU Wen, ZHOU Wanting. Small Object Detection for Fish Based on SPD-Conv and NAM Attention Module [J]. Computer Science, 2024, 51(6A): 230500176-7.
[10] QUE Yue, GAN Menghan, LIU Zhiwei. Object Detection with Receptive Field Expansion and Multi-branch Aggregation [J]. Computer Science, 2024, 51(6A): 230600151-6.
[11] HE Xinyu, LU Chenxin, FENG Shuyi, OUYANG Shangrong, MU Wentao. Ship Detection and Recognition of Optical Remote Sensing Images for Embedded Platform [J]. Computer Science, 2024, 51(6A): 230700117-7.
[12] LI Yuanxin, GUO Zhongfeng, YANG Junlin. Container Lock Hole Recognition Algorithm Based on Lightweight YOLOv5s [J]. Computer Science, 2024, 51(6A): 230900021-6.
[13] ZHANG Lanxin, XIANG Ling, LI Xianze, CHEN Jinpeng. Intelligent Fault Diagnosis Method for Rolling Bearing Based on SAMNV3 [J]. Computer Science, 2024, 51(6A): 230700167-6.
[14] JIAO Ruodan, GAO Donghui, HUANG Yanhua, LIU Shuo, DUAN Xuanfei, WANG Rui, LIU Weidong. Study and Verification on Few-shot Evaluation Methods for AI-based Quality Inspection in Production Lines [J]. Computer Science, 2024, 51(6A): 230700086-8.
[15] LI Yuehao, WANG Dengjiang, JIAN Haifang, WANG Hongchang, CHENG Qinghua. LiDAR-Radar Fusion Object Detection Algorithm Based on BEV Occupancy Prediction [J]. Computer Science, 2024, 51(6): 215-222.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!