Computer Science ›› 2021, Vol. 48 ›› Issue (10): 233-238.doi: 10.11896/jsjkx.200900172

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Small Object Detection Oriented Improved-RetinaNet Model and Its Application

LUO Yue-tong, JIANG Pei-feng, DUAN Chang, ZHOU Bo   

  1. Visualization & Cooperative Computing,Hefei University of Technology,Hefei 230601,China
  • Received:2020-09-24 Revised:2021-02-02 Online:2021-10-15 Published:2021-10-18
  • About author:LUO Yue-tong,born in 1978,Ph.D,professor,is a member of Chinese Compu-ter Society,Computer Aided Design and Graphics Committee.His main research interests include image processing and scientific visualization.
    ZHOU Bo,born in 1981,Ph.D,associate professor.His main research interests include digital terrain analysis and object detection.
  • Supported by:
    National Natural Science Foundation of China(61602146),National Basic Research Program of China(2017YFB1402200) and Key Science and Technology Program of Anhui Province,China(1604d0802009).

Abstract: Object detection algorithms based on deep learning are widely used in industrial detection.The RetinaNet algorithm has attracted much attention because of its advantages in both speed and accuracy.However,for small objects smaller than 32×32 pixels,the detection accuracy of this algorithm cannot meet the requirements of industrial detection.To this end,this article takes the enhancement of small object training as the basic idea,and makes the following improvements to the RetinaNet algorithm:in the sampling phase,the low-level feature map P2 is added to the FPN to ensure that the small object can be fully sampled,and adaptive training sample selection(ATSS) strategy is introduced to ensure that the detection speed is still fast enough after the feature layer is increased;the loss weight adjustment strategy is adopted in the later training stage to improve the fit of difficult samples in small objects.For the public data set MS COCO 2017 and the LED dispensing industrial data set in practical applications,the detection accuracy of this method for objects smaller than 32×32 increases by 4.1% and 10.7%,respectively,indicating that this method can significantly improve the detection ability of small objects.

Key words: Adaptive sampling, Deep learning, RetinaNet, Small object detection

CLC Number: 

  • TP391
[1]HU G X,YANG Z,HU L,et al.Small object detection withmultiscale features[J].International Journal of Digital Multimedia Broadcasting,2018(2):1-10.
[2]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:580-587.
[3]GIRSHICK R.Fast r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision.2015:1440-1448.
[4]REN S,HE K,GIRSHICK R,et al.Faster r-cnn:Towards real-time object detection with region proposal networks[C]//Advances in Neural Information Processing Systems.2015:91-99.
[5]UIJLINGS J R R,VAN DE SANDE K E A,GEVERS T,et al.Selective search for object recognition[J].International Journal of Computer Vision,2013,104(2):154-171.
[6]REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:Unified,real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:779-788.
[7]LIU W,ANGUELOV D,ERHAN D,et al.Ssd:Single shotmultibox detector[C]//European Conference on Computer Vision.Springer,Cham,2016:21-37.
[8]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2980-2988.
[9]SINGH B,DAVIS L S.An analysis of scale invariance in object detection snip[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:3578-3587.
[10]SINGH B,NAJIBI M,DAVIS L S.SNIPER:Efficient multi-scale training[C]//Advances in Neural Information Processing Systems.2018:9310-9320.
[11]HU P,RAMANAN D.Finding tiny faces[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:951-959.
[12]LIU S,QI L,QIN H,et al.Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8759-8768.
[13]QIN Z,LI Z,ZHANG Z,et al.ThunderNet:Towards Real-Time Generic Object Detection on Mobile Devices[C]//Proceedings of the IEEE International Conference on Computer Vision.2019:6718-6727.
[14]PANG J,CHEN K,SHI J,et al.Libra r-cnn:Towards balanced learning for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2019:821-830.
[15]BAI Y,ZHANG Y,DING M,et al.Sod-mtgan:Small object detection via multi-task generative adversarial network[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018:206-221.
[16]LI J,LIANG X,WEI Y,et al.Perceptual generative adversarial networks for small object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:1222-1230.
[17]NOH J,BAE W,LEE W,et al.Better to Follow,Follow to Be Better:Towards Precise Supervision of Feature Super-Resolution for Small Object Detection[C]//Proceedings of the IEEE International Conference on Computer Vision.2019:9725-9734.
[18]ZHU C,TAO R,LUU K,et al.Seeing Small Faces from Robust Anchor's Perspective[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:5127-5136.
[19]ZHANG S,CHI C,YAO Y,et al.Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection[J].arXiv:1912.02424,2019.
[20]CHEN Y,ZHANG P,LI Z,et al.Stitcher:Feedback-driven Data Provider for Object Detection[J].arXiv:2004.12432,2020.
[21]YU X,GONG Y,JIANG N,et al.Scale Match for Tiny Person Detection[C]//The IEEE Winter Conference on Applications of Computer Vision.2020:1257-1265.
[22]KISANTAL M,WOJNA Z,MURAWSKI J,et al.Augmentation for small object detection[J].arXiv:1902.07296,2019.
[23]LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramidnetworks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2117-2125.
[24]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[25]LONG J,SHELHAMER E,DARRELL T.Fully convolutional networks for semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:3431-3440.
[26]DUAN K,DU D,QI H,et al.Detecting small objects using a channel-aware deconvolutional network[C]//IEEE Transactions on Circuits and Systems for Video Technology.2019.
[27]LI B,LIU Y,WANG X.Gradient harmonized single-stage detector[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019,33:8577-8584.
[28]CHEN K,WANG J,PANG J,et al.MMDetection:Open mmlab detection toolbox and benchmark[J].arXiv:1906.07155,2019.
[1] XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[2] RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[3] TANG Ling-tao, WANG Di, ZHANG Lu-fei, LIU Sheng-yun. Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy [J]. Computer Science, 2022, 49(9): 297-305.
[4] WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[5] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[6] JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[7] SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[8] HU Yan-yu, ZHAO Long, DONG Xiang-jun. Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification [J]. Computer Science, 2022, 49(7): 73-78.
[9] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
[10] HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.
[11] ZHOU Hui, SHI Hao-chen, TU Yao-feng, HUANG Sheng-jun. Robust Deep Neural Network Learning Based on Active Sampling [J]. Computer Science, 2022, 49(7): 164-169.
[12] SU Dan-ning, CAO Gui-tao, WANG Yan-nan, WANG Hong, REN He. Survey of Deep Learning for Radar Emitter Identification Based on Small Sample [J]. Computer Science, 2022, 49(7): 226-235.
[13] LIU Wei-ye, LU Hui-min, LI Yu-peng, MA Ning. Survey on Finger Vein Recognition Research [J]. Computer Science, 2022, 49(6A): 1-11.
[14] SUN Fu-quan, CUI Zhi-qing, ZOU Peng, ZHANG Kun. Brain Tumor Segmentation Algorithm Based on Multi-scale Features [J]. Computer Science, 2022, 49(6A): 12-16.
[15] KANG Yan, XU Yu-long, KOU Yong-qi, XIE Si-yu, YANG Xue-kun, LI Hao. Drug-Drug Interaction Prediction Based on Transformer and LSTM [J]. Computer Science, 2022, 49(6A): 17-21.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!