Computer Science ›› 2021, Vol. 48 ›› Issue (11A): 265-269.doi: 10.11896/jsjkx.201000152

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Light-weight Object Detection Network Optimized Based on YOLO Family

XU Yu-jun, LI Chen   

  1. School of Electronic Science and Engineering,Southeast University,Nanjing 210096,China
  • Online:2021-11-10 Published:2021-11-12
  • About author:XU Yu-jun,born in 1996,postgraduate.His main research interests include deep learning and computer vision.
    LI Chen,born in 1982,Ph.D,associate professor.His main research interests include physical electronics and so on.

Abstract: Object detection is an active research field in the computer vision field.It is a very effective method to improve object detection precision by designing a large-scale deep convolutional neural network.However,it is unfavorable to deploy a large-scale object detection network in memory-limited applications.To solve the above problems,this paper proposes a light-weight object detection network which is based on design principles from the YOLO family of single-shot object detection network architectures.This network integrates the Ghost Module in GhostNet,in addition,a better Efficient Channel Attention (ECA) module is added to the convolution block by referring to the Squeeze-and-Excitation (SE) module in MobileNet-v3.This module can make better use of the available network capacity,making the network achieve a strong balance between reducing the complexity of architecture and computation and improving the performance of the model.In addition,Distance-IoU loss is used to solve the problem of inaccurate regression position of bounding box and effectively speeds up network convergence.Finally,the number of parameters of the model was compressed to 1.54 MB less than YOLO Nano (4.0MB),and the mAP on the VOC2007 data set was 72.1% higher than the existing YOLO Nano (69.1%).

Key words: Light-weight, Object detection, Pascal VOC, YOLO deep convolutional neural network

CLC Number: 

  • TP391
[1]LIU W,ANGUELOV D,ERHAN D,et al.Ssd:Single shotmultibox detector[C]//European Conference on Computer Vision.Springer,Cham,2016:21-37.
[2]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:580-587.
[3]HE K,GKIOXARI G,DOLLÁR P,et al.Mask r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2961-2969.
[4]REN S,HE K,GIRSHICK R,et al.Faster r-cnn:Towards real-time object detection with region proposal networks[C]//Advances in Neural Information Processing Systems.2015:91-99.
[5]ZOPH B,CUBUK E D,GHIASI G,et al.Learning data augmentation strategies for object detection[J].arXiv:1906.11172,2019.
[6]LIU Z,LI J,SHEN Z,et al.Learning efficient convolutional networks through network slimming[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2736-2744.
[7]ZHANG D,YANG J,YE D,et al.Lq-nets:Learned quantization for highly accurate and compact deep neural networks[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018:365-382.
[8]REDMON J,FARHADI A.YOLO9000:better,faster,stronger[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:7263-7271.
[9]REDMON J,FARHADI A.Yolov3:An incremental improvement[J].arXiv:1804.02767,2018.
[10]HOWARD A G,ZHU M,CHEN B,et al.Mobilenets:Efficient convolutional neural networks for mobile vision applications[J].arXiv:1704.04861,2017.
[11]SANDLER M,HOWARD A,ZHU M,et al.Mobilenetv2:Inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:4510-4520.
[12]HOWARD A,SANDLER M,CHU G,et al.Searching for mobilenetv3[C]//Proceedings of the IEEE International Conference on Computer Vision.2019:1314-1324.
[13]TAN M,PANG R,LE Q V.Efficientdet:Scalable and efficient object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:10781-10790.
[14]HAN K,WANG Y,TIAN Q,et al.GhostNet:More features from cheap operations[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:1580-1589.
[15]ZHANG X,ZHOU X,LIN M,et al.Shufflenet:An extremely efficient convolutional neural network for mobile devices[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:6848-6856.
[16]MA N,ZHANG X,ZHENG H T,et al.Shufflenet v2:Practical guidelines for efficient cnn architecture design[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018:116-131.
[17]IANDOLA F N,HAN S,MOSKEWICZ M W,et al.Squeeze-Net:AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size[J].arXiv:1602.07360,2016.
[18]WONG A,FAMUORI M,SHAFIEE M J,et al.YOLO nano:A highly compact you only look once convolutional neural network for object detection[J].arXiv:1910.01271,2019.
[19]WANG Q,WU B,ZHU P,et al.ECA-net:Efficient channel attention for deep convolutional neural networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:11534-11542.
[20]RADOSAVOVIC I,KOSARAJU R P,GIRSHICK R,et al.Designing network design spaces[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:10428-10436.
[21]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[22]ORHAN A E,PITKOW X.Skip connections eliminate singularities[J].arXiv:1701.09175,2017.
[23]HE K,ZHANG X,REN S,et al.Deep Residual Learning for Image Recognition[C]//IEEE Conference on Computer Vision & Pattern Recognition.IEEE Computer Society,2016:770-778.
[24]ZHENG Z,WANG P,LIU W,et al.Distance-IoU Loss:Faster and Better Learning for Bounding Box Regression[C]//AAAI.2020:12993-13000.
[25]LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramid networks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2117-2125.
[26]LI Y,HUANG H,XIE Q,et al.Research on a Surface Defect Detection Algorithm Based on MobileNet-SSD[J].Applied Sciences,2018,8(9):1678.
[1] LIU Dong-mei, XU Yang, WU Ze-bin, LIU Qian, SONG Bin, WEI Zhi-hui. Incremental Object Detection Method Based on Border Distance Measurement [J]. Computer Science, 2022, 49(8): 136-142.
[2] WANG Can, LIU Yong-jian, XIE Qing, MA Yan-chun. Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization [J]. Computer Science, 2022, 49(8): 157-164.
[3] CHEN Yong-ping, ZHU Jian-qing, XIE Yi, WU Han-xiao, ZENG Huan-qiang. Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss [J]. Computer Science, 2022, 49(6A): 424-428.
[4] CHEN Jia-zhou, ZHAO Yi-bo, XU Yang-hui, MA Ji, JIN Ling-feng, QIN Xu-jia. Small Object Detection in 3D Urban Scenes [J]. Computer Science, 2022, 49(6): 238-244.
[5] HU Fu-yuan, WAN Xin-jun, SHEN Ming-fei, XU Jiang-lang, YAO Rui, TAO Zhong-ben. Survey Progress on Image Instance Segmentation Methods of Deep Convolutional Neural Network [J]. Computer Science, 2022, 49(5): 10-24.
[6] XU Tao, CHEN Yi-ren, LYU Zong-lei. Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm [J]. Computer Science, 2022, 49(4): 239-246.
[7] YUAN Lei, LIU Zi-yan, ZHU Ming-cheng, MA Shan-shan, CHEN Lin-zhou-ting. Improved YOLOv3 Remote Sensing Target Detection Based on Improved Dense Connection and Distributional Ranking Loss [J]. Computer Science, 2021, 48(9): 168-173.
[8] GONG Hao-tian, ZHANG Meng. Lightweight Anchor-free Object Detection Algorithm Based on Keypoint Detection [J]. Computer Science, 2021, 48(8): 106-110.
[9] LI Lin, LIU Xue-liang, ZHAO Ye, JI Ping. Low Light Image Fusion Detection Method Based on Lego Filter and SSD [J]. Computer Science, 2021, 48(7): 213-218.
[10] XIN Yuan-xue, SHI Peng-fei, XUE Rui-yang. Moving Object Detection Based on Region Extraction and Improved LBP Features [J]. Computer Science, 2021, 48(7): 233-237.
[11] ZHANG Man, LI Jie, ZHU Xin-zhong, SHEN Ji, CHENG Hao-tian. Augmentation Technology of Remote Sensing Dataset Based on Improved DCGAN Algorithm [J]. Computer Science, 2021, 48(6A): 80-84.
[12] PAN Ming-yuan, SONG Hui-hui, ZHANG Kai-hua, LIU Qing-shan. Learning Global Guided Progressive Feature Aggregation Lightweight Network for Salient Object Detection [J]. Computer Science, 2021, 48(6): 103-109.
[13] ZHANG Shao-qin, DU Sheng-dong, ZHANG Xiao-bo, LI Tian-rui. Social Rumor Detection Method Based on Multimodal Fusion [J]. Computer Science, 2021, 48(5): 117-123.
[14] SHI Xian-rang, SONG Ting-lun, TANG De-zhi, DAI Zhen-yong. Novel Deep Learning Algorithm for Monocular Vision:H_SFPN [J]. Computer Science, 2021, 48(4): 130-137.
[15] YUAN Xing-xing, WU Qin. Object Detection in Remote Sensing Images Based on Saliency Feature and Angle Information [J]. Computer Science, 2021, 48(4): 174-179.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!