Computer Science ›› 2021, Vol. 48 ›› Issue (6A): 118-121.doi: 10.11896/jsjkx.200700122

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Tiny YOLOv3 Target Detection Algorithm Based on Region Activation Strategy

YU Han-qing, YANG Zhen, YIN Zhi-jian   

  1. School of Communication and Electronics,Jiangxi Science and Technology Normal University,Nanchang 330000,China
  • Online:2021-06-10 Published:2021-06-17
  • About author:YU Han-qing,born in 1996,postgra-duate.Her main research interests include object detection and so on.
    YANG Zhen,born in 1985,Ph.D.His main research interests include pattern recognition and intelligent systems,machine learning and image processing.
  • Supported by:
    National Natural Science Foundation of China(61866016),Youth Top-notch Project of Jiangxi Science and Technology Normal University(2018QNBJRC002),General Project of Jiangxi Provincial Department of Education (GJJ190587),General Project of Natural Science Foundation of Jiangxi Province(20202BABL202014) and Oracle Information Processing Ministry of Education Key Laboratory Open Project Funding Project(OIP2019E008).

Abstract: Aiming at the problem of low detection accuracy of Tiny YOLOv3 model,a method to introduce segmentation information into deep convolutional neural network structure is proposed.During the model training,the real position information of the target is added to the network layer,and these target areas are manually activated.The size of the excitation gradually decreases as the training proceeds until it drops to zero.The test results show that on the VOC2007 data set,the average accuracy of the improved Tiny YOLOv3 model is increased to 58.9%,and the detection speed is consistent with the original model to meet the needs of real-time detection.

Key words: Deep convolutional neural network, Location information, Segmentation information, Tiny YOLOv3

CLC Number: 

  • TP391
[1] REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks[J].IEEE Transactions on Pattern Analysis & Machince Intelligence,2017,39(6):1137-1149.
[2] CHEN C,QI F.The development of convolutional neuralnet-work and its application in the field of computer vision[J].Computer Science,2019,46(3):69-79.
[3] REDMON J,DIVVALA S,GIRSHICK R,et al.You Only Look Once:Unified,Real-Time Object Detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:779-788.
[4] REDMON J,FARHADI A.YOLO9000:Better,Faster,Stronger[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017.
[5] REDMON J,FARHADI A.YOLOv3:An Incremental Improvement[J].arXiv:1804.02767,2018.
[6] BOCHKOVSKIY A,WANG C Y,LIAO H Y M.YOLOv4:Op-timal Speed and Accuracy of Object Detection[J].arXiv:2004.10934,2020.
[7] LIU Z H.Structural model learning based on local feat-ures and its application in target detection and location[D].Shanghai:Shanghai Jiaotong University,2012.
[8] DAI J F,HE K M,SUN J.Instance-aware semantic segmentation via multi-task network cascades[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:3150-3158.
[9] HARIHARAN B,ARBELAEZ P,GIRSHICK R,et al.Simultaneous detection and segmentation[C]//European Conferenc on Computer Vision.Springer,2014:297-312.
[10] GIDARIS S,KOMODAKIS N.Object detection via a multi-region and semantic segmentation-aware cnn model[C]//Procee-dings of the IEEE International Conference on Computer Vision.2015:1134-1142.
[11] ZHANG Z S,QIAO S Y,XIE C H,et al.Single-shot object detection with enriched semantics[R].Technical report,Center for Brains,Minds and Machines (CBMM),2018.
[12] HE P,HUANG W L,HE T,et al.Single Shot Text Detector with Regional Attention[C]//2017 IEEE International Conference on Computer Vision(ICCV).IEEE,2017.
[13] DERAKHSHANI M M,MASOUDNIA S,SHAKER A H,et al.Assisted Excitation of Activations:A Learning Technique to Improve Object Detectors[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).IEEE,2020.
[14] HE T,ZHANG Z,ZHANG H,et al.Bag of Tricks for ImageClassification with Convolutional Neural Networks[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).IEEE,2019.
[15] EVERINGHAM M,VAN GOOL L,WILLIAMS C K,et al.The pascal visual object classes (voc) challenge[J].International Journal of Computer Vision,2010,88(2):303-338.
[16] BENGIO Y S,LOURADOUR J,COLLOBERT R,et al.Curriculum learning[C]//Proceedings of the 26th Annual International Conference on Machine Learning.ACM,2009:41-48.
[1] SUN Jie-qi, LI Ya-feng, ZHANG Wen-bo, LIU Peng-hui. Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation [J]. Computer Science, 2022, 49(6A): 434-440.
[2] HU Fu-yuan, WAN Xin-jun, SHEN Ming-fei, XU Jiang-lang, YAO Rui, TAO Zhong-ben. Survey Progress on Image Instance Segmentation Methods of Deep Convolutional Neural Network [J]. Computer Science, 2022, 49(5): 10-24.
[3] CHENG Song-sheng, PAN Jin-shan. Video Super-resolution Method Based on Deep Learning Feature Warping [J]. Computer Science, 2021, 48(7): 184-189.
[4] SHI Xian-rang, SONG Ting-lun, TANG De-zhi, DAI Zhen-yong. Novel Deep Learning Algorithm for Monocular Vision:H_SFPN [J]. Computer Science, 2021, 48(4): 130-137.
[5] XU Yu-jun, LI Chen. Light-weight Object Detection Network Optimized Based on YOLO Family [J]. Computer Science, 2021, 48(11A): 265-269.
[6] XU Hua-jie, YANG Yang, LI Gui-lan. Material Recognition Method Based on Attention Mechanism and Deep Convolutional Neural Network [J]. Computer Science, 2021, 48(10): 220-225.
[7] CHEN Qing-chao, WANG Tao, YIN Shi-zhuang, FENG Wen-bo. Chain Merging Method for Unknown Text Protocol Candidate Keyword Stored in Multi-levelDictionary [J]. Computer Science, 2020, 47(12): 332-335.
[8] QU Jia, SHI Zeng-lin, YE Yang-dong. Unbalanced Crowd Density Estimation Based on Convolutional Features [J]. Computer Science, 2018, 45(8): 236-241.
[9] LEI Qian, HAO Cun-ming,ZHANG Wei-ping. Vehicle Recognition Based on Super-resolution and Deep Neural Networks [J]. Computer Science, 2018, 45(6A): 230-233.
[10] PAN Xiang and WANG Heng. Similar Character Recognition of License Plates Based on Deep Learning [J]. Computer Science, 2017, 44(Z6): 229-231.
[11] CHEN Xiang-tao and XIAO Bi-wen. Emerging Sequences Pattern Mining Based on Location Information [J]. Computer Science, 2017, 44(7): 175-179.
[12] WANG Qing-yun and CHENG Chun-ling. Mobile SNS Data Dynamic Partitioning and Replication Algorithm Based on Location Information [J]. Computer Science, 2017, 44(3): 220-225.
[13] XI Rui, LI Yu-jun and HOU Meng-shu. Survey on Indoor Localization [J]. Computer Science, 2016, 43(4): 1-6.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!