基于YOLO优化的轻量级目标检测网络

doi:10.11896/jsjkx.201000152

Abstract

Abstract: Object detection is an active research field in the computer vision field.It is a very effective method to improve object detection precision by designing a large-scale deep convolutional neural network.However,it is unfavorable to deploy a large-scale object detection network in memory-limited applications.To solve the above problems,this paper proposes a light-weight object detection network which is based on design principles from the YOLO family of single-shot object detection network architectures.This network integrates the Ghost Module in GhostNet,in addition,a better Efficient Channel Attention (ECA) module is added to the convolution block by referring to the Squeeze-and-Excitation (SE) module in MobileNet-v3.This module can make better use of the available network capacity,making the network achieve a strong balance between reducing the complexity of architecture and computation and improving the performance of the model.In addition,Distance-IoU loss is used to solve the problem of inaccurate regression position of bounding box and effectively speeds up network convergence.Finally,the number of parameters of the model was compressed to 1.54 MB less than YOLO Nano (4.0MB),and the mAP on the VOC2007 data set was 72.1% higher than the existing YOLO Nano (69.1%).

Key words: Light-weight, Object detection, Pascal VOC, YOLO deep convolutional neural network

CLC Number:

TP391

XU Yu-jun, LI Chen. Light-weight Object Detection Network Optimized Based on YOLO Family[J].Computer Science, 2021, 48(11A): 265-269.

References

[1]LIU W,ANGUELOV D,ERHAN D,et al.Ssd:Single shotmultibox detector[C]//European Conference on Computer Vision.Springer,Cham,2016:21-37.
[2]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:580-587.
[3]HE K,GKIOXARI G,DOLLÁR P,et al.Mask r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2961-2969.
[4]REN S,HE K,GIRSHICK R,et al.Faster r-cnn:Towards real-time object detection with region proposal networks[C]//Advances in Neural Information Processing Systems.2015:91-99.
[5]ZOPH B,CUBUK E D,GHIASI G,et al.Learning data augmentation strategies for object detection[J].arXiv:1906.11172,2019.
[6]LIU Z,LI J,SHEN Z,et al.Learning efficient convolutional networks through network slimming[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2736-2744.
[7]ZHANG D,YANG J,YE D,et al.Lq-nets:Learned quantization for highly accurate and compact deep neural networks[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018:365-382.
[8]REDMON J,FARHADI A.YOLO9000:better,faster,stronger[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:7263-7271.
[9]REDMON J,FARHADI A.Yolov3:An incremental improvement[J].arXiv:1804.02767,2018.
[10]HOWARD A G,ZHU M,CHEN B,et al.Mobilenets:Efficient convolutional neural networks for mobile vision applications[J].arXiv:1704.04861,2017.
[11]SANDLER M,HOWARD A,ZHU M,et al.Mobilenetv2:Inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:4510-4520.
[12]HOWARD A,SANDLER M,CHU G,et al.Searching for mobilenetv3[C]//Proceedings of the IEEE International Conference on Computer Vision.2019:1314-1324.
[13]TAN M,PANG R,LE Q V.Efficientdet:Scalable and efficient object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:10781-10790.
[14]HAN K,WANG Y,TIAN Q,et al.GhostNet:More features from cheap operations[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:1580-1589.
[15]ZHANG X,ZHOU X,LIN M,et al.Shufflenet:An extremely efficient convolutional neural network for mobile devices[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:6848-6856.
[16]MA N,ZHANG X,ZHENG H T,et al.Shufflenet v2:Practical guidelines for efficient cnn architecture design[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018:116-131.
[17]IANDOLA F N,HAN S,MOSKEWICZ M W,et al.Squeeze-Net:AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size[J].arXiv:1602.07360,2016.
[18]WONG A,FAMUORI M,SHAFIEE M J,et al.YOLO nano:A highly compact you only look once convolutional neural network for object detection[J].arXiv:1910.01271,2019.
[19]WANG Q,WU B,ZHU P,et al.ECA-net:Efficient channel attention for deep convolutional neural networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:11534-11542.
[20]RADOSAVOVIC I,KOSARAJU R P,GIRSHICK R,et al.Designing network design spaces[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:10428-10436.
[21]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[22]ORHAN A E,PITKOW X.Skip connections eliminate singularities[J].arXiv:1701.09175,2017.
[23]HE K,ZHANG X,REN S,et al.Deep Residual Learning for Image Recognition[C]//IEEE Conference on Computer Vision & Pattern Recognition.IEEE Computer Society,2016:770-778.
[24]ZHENG Z,WANG P,LIU W,et al.Distance-IoU Loss:Faster and Better Learning for Bounding Box Regression[C]//AAAI.2020:12993-13000.
[25]LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramid networks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2117-2125.
[26]LI Y,HUANG H,XIE Q,et al.Research on a Surface Defect Detection Algorithm Based on MobileNet-SSD[J].Applied Sciences,2018,8(9):1678.

Related Articles 15

[1]	LIU Dong-mei, XU Yang, WU Ze-bin, LIU Qian, SONG Bin, WEI Zhi-hui. Incremental Object Detection Method Based on Border Distance Measurement [J]. Computer Science, 2022, 49(8): 136-142.
[2]	WANG Can, LIU Yong-jian, XIE Qing, MA Yan-chun. Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization [J]. Computer Science, 2022, 49(8): 157-164.
[3]	CHEN Yong-ping, ZHU Jian-qing, XIE Yi, WU Han-xiao, ZENG Huan-qiang. Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss [J]. Computer Science, 2022, 49(6A): 424-428.
[4]	CHEN Jia-zhou, ZHAO Yi-bo, XU Yang-hui, MA Ji, JIN Ling-feng, QIN Xu-jia. Small Object Detection in 3D Urban Scenes [J]. Computer Science, 2022, 49(6): 238-244.
[5]	HU Fu-yuan, WAN Xin-jun, SHEN Ming-fei, XU Jiang-lang, YAO Rui, TAO Zhong-ben. Survey Progress on Image Instance Segmentation Methods of Deep Convolutional Neural Network [J]. Computer Science, 2022, 49(5): 10-24.
[6]	XU Tao, CHEN Yi-ren, LYU Zong-lei. Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm [J]. Computer Science, 2022, 49(4): 239-246.
[7]	YUAN Lei, LIU Zi-yan, ZHU Ming-cheng, MA Shan-shan, CHEN Lin-zhou-ting. Improved YOLOv3 Remote Sensing Target Detection Based on Improved Dense Connection and Distributional Ranking Loss [J]. Computer Science, 2021, 48(9): 168-173.
[8]	GONG Hao-tian, ZHANG Meng. Lightweight Anchor-free Object Detection Algorithm Based on Keypoint Detection [J]. Computer Science, 2021, 48(8): 106-110.
[9]	LI Lin, LIU Xue-liang, ZHAO Ye, JI Ping. Low Light Image Fusion Detection Method Based on Lego Filter and SSD [J]. Computer Science, 2021, 48(7): 213-218.
[10]	XIN Yuan-xue, SHI Peng-fei, XUE Rui-yang. Moving Object Detection Based on Region Extraction and Improved LBP Features [J]. Computer Science, 2021, 48(7): 233-237.
[11]	ZHANG Man, LI Jie, ZHU Xin-zhong, SHEN Ji, CHENG Hao-tian. Augmentation Technology of Remote Sensing Dataset Based on Improved DCGAN Algorithm [J]. Computer Science, 2021, 48(6A): 80-84.
[12]	PAN Ming-yuan, SONG Hui-hui, ZHANG Kai-hua, LIU Qing-shan. Learning Global Guided Progressive Feature Aggregation Lightweight Network for Salient Object Detection [J]. Computer Science, 2021, 48(6): 103-109.
[13]	ZHANG Shao-qin, DU Sheng-dong, ZHANG Xiao-bo, LI Tian-rui. Social Rumor Detection Method Based on Multimodal Fusion [J]. Computer Science, 2021, 48(5): 117-123.
[14]	SHI Xian-rang, SONG Ting-lun, TANG De-zhi, DAI Zhen-yong. Novel Deep Learning Algorithm for Monocular Vision:H_SFPN [J]. Computer Science, 2021, 48(4): 130-137.
[15]	YUAN Xing-xing, WU Qin. Object Detection in Remote Sensing Images Based on Saliency Feature and Angle Information [J]. Computer Science, 2021, 48(4): 174-179.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Light-weight Object Detection Network Optimized Based on YOLO Family

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0