Computer Science ›› 2020, Vol. 47 ›› Issue (4): 108-111.doi: 10.11896/jsjkx.190600067

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Light-weight Object Detection Network Based on Grouping Heterogeneous Convolution

YAN Xiao-tian, HUANG Shan   

  1. College of Electrical Engineering,Sichuan University,Chengdu 610065,China
  • Received:2019-06-13 Online:2020-04-15 Published:2020-04-15
  • Contact: HUANG Shan,born in 1969,Ph.D,professor,Ph.D supervisor.His main research interests include digital image processing.
  • About author:YAN Xiao-tian,born in 1992,postgra-duate.His main research interests include lightweight object detection networks and so on.

Abstract: The current object detection model has disadvantages like large number of parameters,large size and slow detection speed,and it cannot be applied in real-time scenarios.For example,automatic driving technology requires not only accurate detection to ensure safety,but also rapid detection to ensure real-time decision-making of vehicles.For the above questions,this paper presented an end-to-end light-weight object detection network FGHDet.Firstly,grouping heterogeneous convolution(GHConv) is proposed to solve the problem of low efficiency of HetConv channel by channel convolution.Secondly,the basic module FGH Module is built by combining GHConv and Fire Module.Finally,the end-to-end light-weight object detection network FGHDet is built based on FGH Module.FGHDet reduces the amount of parameters mainly in two ways.One is to reduce the number of input channels of the 3×3 filter,and other is to replace the traditional convolution kernel with GHConv.This paper took KITTI data set as experimental data to complete the training and evaluation of the model on the deep learning framework Keras.The experimental results indicate that the mAP of FGHDet in KITTI data set can reach 74.4%,higher than 70.8% of Faster R-CNN,and the model detection speed is 28.7FPS,better than SqueezeDet,the fastest model in the comparison models.Moreover,the size of the proposed model is only 2.6MB,1/200 times the volume of the Faster R-CNN model.

Key words: FGHDet, Grouping heterogeneous convolution, KITTI, Light-weight, Object detection

CLC Number: 

  • TP391
[1]REN S,HE K,GIRSHICK R,et al.Faster r-cnn:Towards real-time object detection with region proposal networks [M]//Advances in Neural Information Processing Systems.Berlin:Springer,2015:91-98.
[2]CHENG Y,WANG D,ZHOU P,et al.A survey of model compression and acceleration for deep neural networks [J].arXiv:1710.09282,2017,[3]CHENG J,WANG P-S,LI G,et al.Recent advances in efficient computation of deep convolutional neural networks [J].Frontiers of Information Technology & Electronic Engineering,2018,19(1):64-77.
[4]JI R R,LIN S H,CHAO F,et al.A review of deep neural network compression and acceleration[J].Journal of Computer Research and Development,2018,55(9):1871-1888.
[5]MOLCHANOV P,TYREE S,KARRAS T,et al.Pruning convolutional neural networks for resource efficient transfer learning [J].arXiv:1611.06440,2016.
[6]LI H,KADAV A,DURDANOVIC I,et al.Pruning filters for efficient convnets [J].arXiv:1608.08710,2016.
[7]YU X,LIU T,WANG X,et al.On compressing deep models by low rank and sparse decomposition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017.
[8]ZHANG D,YANG J,YE D,et al.Lq-nets:Learned quantization for highly accurate and compact deep neural networks[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018.
[9]HINTON G,VINYALS O,DEAN J.Distilling the knowledge in a neural network [J].arXiv:1503.02531,2015.
[10]MISHRA A,MARR D.Apprentice:Using knowledge distillation techniques to improve low-precision network accuracy [J].arXiv:1711.05852,2017.
[11]HOWARD A G,ZHU M,CHEN B,et al.Mobilenets:Efficient convolutional neural networks for mobile vision applications [J].arXiv:1704.04861,2017.
[12]SANDLER M,HOWARD A,ZHU M,et al.Mobilenetv2:Inverted residuals and linear bottlenecks [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018.
[13]HOWARD A,SANDLER M,CHU G,et al.Searching for mobilenetv3 [J].arXiv:1905.02244,2019.
[14]ZHANG X,ZHOU X,LIN M,et al.Shufflenet:An extremely efficient convolutional neural network for mobile devices[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018.
[15]MA N,ZHANG X,ZHENG H-T,et al.Shufflenet v2:Practical guidelines for efficient cnn architecture design[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018.
[16]IANDOLA F N,HAN S,MOSKEWICZ M W,et al.SqueezeNet:AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size [J].arXiv:1602.07360,2016.
[17]SINGH P,VERMA V K,RAI P,et al.HetConv:Heterogeneous Kernel-Based Convolutions for Deep CNNs [J].arXiv:1903.04120,2019.
[18]LIN M,CHEN Q,YAN S.Network in network [J].arXiv:1312.4400,2013.
[19]GEIGER A,LENZ P,STILLER C,et al.Vision meets robotics:The KITTI dataset [J].The International Journal of Robotics Research,2013,32(11):1231-1237.
[20]WU B,IANDOLA F,JIN P H,et al.Squeezedet:Unified,small,low power fully convolutional neural networks for real-time object detection for autonomous driving[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops.2017.
[1] LIU Dong-mei, XU Yang, WU Ze-bin, LIU Qian, SONG Bin, WEI Zhi-hui. Incremental Object Detection Method Based on Border Distance Measurement [J]. Computer Science, 2022, 49(8): 136-142.
[2] WANG Can, LIU Yong-jian, XIE Qing, MA Yan-chun. Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization [J]. Computer Science, 2022, 49(8): 157-164.
[3] CHEN Yong-ping, ZHU Jian-qing, XIE Yi, WU Han-xiao, ZENG Huan-qiang. Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss [J]. Computer Science, 2022, 49(6A): 424-428.
[4] CHEN Jia-zhou, ZHAO Yi-bo, XU Yang-hui, MA Ji, JIN Ling-feng, QIN Xu-jia. Small Object Detection in 3D Urban Scenes [J]. Computer Science, 2022, 49(6): 238-244.
[5] HU Fu-yuan, WAN Xin-jun, SHEN Ming-fei, XU Jiang-lang, YAO Rui, TAO Zhong-ben. Survey Progress on Image Instance Segmentation Methods of Deep Convolutional Neural Network [J]. Computer Science, 2022, 49(5): 10-24.
[6] XU Tao, CHEN Yi-ren, LYU Zong-lei. Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm [J]. Computer Science, 2022, 49(4): 239-246.
[7] YUAN Lei, LIU Zi-yan, ZHU Ming-cheng, MA Shan-shan, CHEN Lin-zhou-ting. Improved YOLOv3 Remote Sensing Target Detection Based on Improved Dense Connection and Distributional Ranking Loss [J]. Computer Science, 2021, 48(9): 168-173.
[8] GONG Hao-tian, ZHANG Meng. Lightweight Anchor-free Object Detection Algorithm Based on Keypoint Detection [J]. Computer Science, 2021, 48(8): 106-110.
[9] LI Lin, LIU Xue-liang, ZHAO Ye, JI Ping. Low Light Image Fusion Detection Method Based on Lego Filter and SSD [J]. Computer Science, 2021, 48(7): 213-218.
[10] XIN Yuan-xue, SHI Peng-fei, XUE Rui-yang. Moving Object Detection Based on Region Extraction and Improved LBP Features [J]. Computer Science, 2021, 48(7): 233-237.
[11] ZHANG Man, LI Jie, ZHU Xin-zhong, SHEN Ji, CHENG Hao-tian. Augmentation Technology of Remote Sensing Dataset Based on Improved DCGAN Algorithm [J]. Computer Science, 2021, 48(6A): 80-84.
[12] PAN Ming-yuan, SONG Hui-hui, ZHANG Kai-hua, LIU Qing-shan. Learning Global Guided Progressive Feature Aggregation Lightweight Network for Salient Object Detection [J]. Computer Science, 2021, 48(6): 103-109.
[13] ZHANG Shao-qin, DU Sheng-dong, ZHANG Xiao-bo, LI Tian-rui. Social Rumor Detection Method Based on Multimodal Fusion [J]. Computer Science, 2021, 48(5): 117-123.
[14] SHI Xian-rang, SONG Ting-lun, TANG De-zhi, DAI Zhen-yong. Novel Deep Learning Algorithm for Monocular Vision:H_SFPN [J]. Computer Science, 2021, 48(4): 130-137.
[15] YUAN Xing-xing, WU Qin. Object Detection in Remote Sensing Images Based on Saliency Feature and Angle Information [J]. Computer Science, 2021, 48(4): 174-179.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!