一种新颖的单目视觉深度学习算法:H_SFPN

doi:10.11896/jsjkx.200400090

Computer Science ›› 2021, Vol. 48 ›› Issue (4): 130-137.doi: 10.11896/jsjkx.200400090

• Computer Graphics & Multimedia • Previous Articles Next Articles

Novel Deep Learning Algorithm for Monocular Vision:H_SFPN

SHI Xian-rang¹, SONG Ting-lun^1,2, TANG De-zhi², DAI Zhen-yong¹

1 College of Energy and Power Engineering,Nanjing University of Aeronautics and Astronautics,Nanjing 210001,China
2 Chery Advanced Engineering & Technology Center,Wuhu,Anhui 241006,China

Received:2020-06-24 Revised:2020-07-29 Online:2021-04-15 Published:2021-04-09
About author:SHI Xian-rang,born in 1996,postgra-duate.His main research interests include autonomous driving,object detection and pattern recognition.(nuaasxr@163.com)
SONG Ting-lun,born in 1965,Ph.D,professor,Ph.D supervisor.His main research interests include simulation driven vehicle architecture design and development,autonomous driving vehicles,and data driven energy management strategies for new energy vehicles.
Supported by:
Anhui Provincial Development and Reform Commission’s Major R&D Project.

Abstract

Abstract: This paper proposes a single-stage deep learning based H_SFPN algorithm for monocular visual object detection.Compared with the existing YOLOv3 and CenterNet algorithms,the proposed algorithm can effectively improve the accuracy of small object detection without sacrificing the real-time performance.This paper designs a new network architecture (backbone),which uses an improved Hourglass network model to extract feature maps in order to make full use of the high resolution of the underlying features and the high semantic information of the high-level features.Then in the feature map fusion stage,a method SFPN based on the weighted fusion of feature maps is proposed.Finally,the proposed H_SFPN algorithm improves the loss function of the object position and size,which can effectively reduce the training error and accelerate the convergence speed.According to the experimental results on the MSCOCO data set,the proposed H_SFPN algorithm is significantly better than the existing mainstream deep learning object detection algorithms such as Faster-RCNN,YOLOv3 and EfficientDet.Among them,the small object detection index AP_s of this algorithm is the highest,reaching 32.7.

Key words: Backbone, Deep convolutional neural network, Loss function, Object detection, Weighted fusion

CLC Number:

TP391.41

SHI Xian-rang, SONG Ting-lun, TANG De-zhi, DAI Zhen-yong. Novel Deep Learning Algorithm for Monocular Vision:H_SFPN[J].Computer Science, 2021, 48(4): 130-137.

References

[1]DALAL N.Histograms of Oriented Gradients for Human Detection[C]//IEEE Conference on Computer Vision & Pattern Recognition.San Diego,2005:886-893.
[2]LOWE D G.Distinctive image features from scale-invariant keypoints[J].International Journal of Computer Vision,2004,60(2):91-110.
[3]CORTES C,VAPNIK V N.Support-Vector Networks[J].Machine Learning,1995,20(3):273-297.
[4]ROSENBERG C,HEBERT M,SCHNEIDERMAN H.Semi-Supervised Self-Training of Object Detection Models[C]//IEEE Workshops on Application of Computer Vision.Breckenridge,2005:29-36.
[5]HINTON G E,OSINDERO S,TEH Y W.A fast learning algorithm for deep belief nets[J].Neural Computation,2006,18(7):1527-1554.
[6]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.Image Net-classification with deep convolutional neural networks[J].Communications of the ACM,2017,60(6):84-90.
[7]DENG J,DONG W,SOCHER R,et al.ImageNet:A large-scale hierarchical image database[C]//IEEE Conference on Computer Vision & Pattern Recognition.2009:248-255.
[8]GIRSHICK R.Fast R-CNN[C]//IEEE International Confe-rence on Computer Vision.2015:1440-1448.
[9]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2017,39(6):1137-1149.
[10]KAIMING H,GEORGIA G,PIOTR D,et al.Mask R-CNN[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2017:2961-2969.
[11]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single ShotMultiBox Detector[C]//European Conference on Computer Vision.2016:21-37.
[12]REDMON J,DIVVALA S,GIRSHICK R,et al.You Only Look Once:Unified,Real-Time Object Detection[C]//IEEE Confe-rence on Computer Vision& Pattern Recognition.2016:779-788.
[13]REDMON J,FARHADI A.YOLO9000:Better,Faster,Stronger[C]//IEEE Conference on Computer Vision & Pattern Recognition.2017:6517-6525.
[14]REDMON J,FARHADI A.YOLOv3:An Incremental Improvement[J].arXiv:1804.02767,2018.
[15]TAN M,PANG R,LEQ V.EfficientDet:Scalable and Efficient Object Detection[J].arXiv:1911.09070.
[16]LIN T Y,DOLLAR,PIOT R,et al.Feature Pyramid Networks for Object Detection[C]//IEEE Conference on Computer Vision &Pattern Recognition.2017:4-9.
[17]HE K,ZHANG X,REN S,et al.Deep Residual Learning for Image Recognition[C]//IEEE Conference on Computer Vision & Pattern Recognition.2016:770-778.
[18]ZHOU X Y,WANG D Q,KRHENBUHL P.Objects as Points[J].arXiv:1904.07850,2019.
[19]NEWELL A,YANG K,DENG J.Stacked Hourglass Networks for Human Pose Estimation[C]//European Conference on Computer Vision.Springer,Charm,2016:483-499.
[20]YU F,WANG D,SHELHAMER E,et al.Deep Layer Aggregation[J].arXiv:1707.06484,2017.
[21]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal Loss for Dense Object Detection[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2017(99):2999-3007.
[22]LIN T Y,MAIRE M,BELONGIE S,et al.Microsoft COCO:Common Objects in Context[C]//European Conference on Computer Vision.2014:740-755.

Related Articles 15

[1]	LIU Dong-mei, XU Yang, WU Ze-bin, LIU Qian, SONG Bin, WEI Zhi-hui. Incremental Object Detection Method Based on Border Distance Measurement [J]. Computer Science, 2022, 49(8): 136-142.
[2]	WANG Can, LIU Yong-jian, XIE Qing, MA Yan-chun. Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization [J]. Computer Science, 2022, 49(8): 157-164.
[3]	MENG Yue-bo, MU Si-rong, LIU Guang-hui, XU Sheng-jun, HAN Jiu-qiang. Person Re-identification Method Based on GoogLeNet-GMP Based on Vector Attention Mechanism [J]. Computer Science, 2022, 49(7): 142-147.
[4]	GAO Rong-hua, BAI Qiang, WANG Rong, WU Hua-rui, SUN Xiang. Multi-tree Network Multi-crop Early Disease Recognition Method Based on Improved Attention Mechanism [J]. Computer Science, 2022, 49(6A): 363-369.
[5]	CHEN Yong-ping, ZHU Jian-qing, XIE Yi, WU Han-xiao, ZENG Huan-qiang. Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss [J]. Computer Science, 2022, 49(6A): 424-428.
[6]	SUN Jie-qi, LI Ya-feng, ZHANG Wen-bo, LIU Peng-hui. Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation [J]. Computer Science, 2022, 49(6A): 434-440.
[7]	CHEN Jia-zhou, ZHAO Yi-bo, XU Yang-hui, MA Ji, JIN Ling-feng, QIN Xu-jia. Small Object Detection in 3D Urban Scenes [J]. Computer Science, 2022, 49(6): 238-244.
[8]	HU Fu-yuan, WAN Xin-jun, SHEN Ming-fei, XU Jiang-lang, YAO Rui, TAO Zhong-ben. Survey Progress on Image Instance Segmentation Methods of Deep Convolutional Neural Network [J]. Computer Science, 2022, 49(5): 10-24.
[9]	XU Tao, CHEN Yi-ren, LYU Zong-lei. Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm [J]. Computer Science, 2022, 49(4): 239-246.
[10]	HUANG Ying-qi, CHEN Hong-mei. Cost-sensitive Convolutional Neural Network Based Hybrid Method for Imbalanced Data Classification [J]. Computer Science, 2021, 48(9): 77-85.
[11]	YUAN Lei, LIU Zi-yan, ZHU Ming-cheng, MA Shan-shan, CHEN Lin-zhou-ting. Improved YOLOv3 Remote Sensing Target Detection Based on Improved Dense Connection and Distributional Ranking Loss [J]. Computer Science, 2021, 48(9): 168-173.
[12]	ZHANG Xiao-yu, WANG Bin, AN Wei-chao, YAN Ting, XIANG Jie. Glioma Segmentation Network Based on 3D U-Net＋+ with Fusion Loss Function [J]. Computer Science, 2021, 48(9): 187-193.
[13]	GONG Hao-tian, ZHANG Meng. Lightweight Anchor-free Object Detection Algorithm Based on Keypoint Detection [J]. Computer Science, 2021, 48(8): 106-110.
[14]	CHENG Song-sheng, PAN Jin-shan. Video Super-resolution Method Based on Deep Learning Feature Warping [J]. Computer Science, 2021, 48(7): 184-189.
[15]	LI Lin, LIU Xue-liang, ZHAO Ye, JI Ping. Low Light Image Fusion Detection Method Based on Lego Filter and SSD [J]. Computer Science, 2021, 48(7): 213-218.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Novel Deep Learning Algorithm for Monocular Vision:H_SFPN

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0