计算机科学 ›› 2021, Vol. 48 ›› Issue (11): 268-275.doi: 10.11896/jsjkx.200900098

• 计算机图形学&多媒体 • 上一篇    下一篇

基于改进YOLO v4的安全帽佩戴检测算法

金雨芳, 吴祥, 董辉, 俞立, 张文安   

  1. 浙江工业大学信息工程学院 杭州310023
    浙江省嵌入式系统联合重点实验室 杭州310023
  • 收稿日期:2020-09-13 修回日期:2021-02-09 出版日期:2021-11-15 发布日期:2021-11-10
  • 通讯作者: 吴祥(xiangwu@zjut.edu.cn)
  • 作者简介:m17681535896@163.com
  • 基金资助:
    :浙江省重点研发计划(2020C01109);NSFC-浙江两化融合联合基金(U1709213);浙江新苗人才计划(GZ20411030017)

Improved YOLO v4 Algorithm for Safety Helmet Wearing Detection

JIN Yu-fang, WU Xiang, DONG Hui, YU Li, ZHANG Wen-an   

  1. College of Information Engineering,Zhejiang University of Technology,Hangzhou 310023,China
    United Key Laboratory of Embedded System of Zhejiang Province,Hangzhou 310023,China
  • Received:2020-09-13 Revised:2021-02-09 Online:2021-11-15 Published:2021-11-10
  • About author:JIN Yu-fang,born in 1994,master.Her main research interests include deep learning and target detection.
    WU Xiang,born in 1990,Ph.D.His main research interests include intelligent learning algorithm and networked motion control.
  • Supported by:
    Key Research and Development Program of Zhejiang Province(2020C01109),NSFC-Zhejiang Joint Fund for the Integration of Industrialization and Development Plan of Zhejiang Province(U1709213) and Zhejiang Xinmiao Talent Projects(GZ20411030017).

摘要: 安全生产管理是建筑、重工业等高危企业发展的重要方针,安全帽在施工生产环境中对人员头部防护起着关键作用,因此加强安全帽佩戴监管十分必要。近年来,基于图像视觉的安全帽佩戴监测方法成为了企业实施管理的主要手段,如何提高安全帽佩戴检测精度和检测速度是应用的关键难题。针对上述问题,文中提出了一种基于改进YOLO v4的安全帽佩戴检测算法。首先,在YOLO v4算法的3个特征图输出的基础上增加了128×128特征图输出,从而将特征图输出的8倍下采样改为4倍下采样,为后续特征融合提供了更多小目标特征。其次,基于密集连接的思想对特征融合模块进行改进以实现特征重用,使得负责小目标检测的Yolo Head分类器可以结合不同层次特征层的特征,从而得到更好的目标检测分类结果。最后,对比实验的结果表明,所提方法的平均精度高达91.17%,相比原网络检测精度提高了2.96%,检测速度基本不变,可达52.9 frame/s,从而在满足实时检测需求的同时可以得到更优的检测精度,有效实现了安全帽佩戴的高速高精度检测。

关键词: K-means聚类, YOLO v4, 安全帽佩戴检测, 深度学习, 小目标检测

Abstract: Safety production management is an important policy for the development of high-risk enterprises such as the construction industry and heavy industry,and safety helmets play a key role in head protection in the production environment.Therefore,it is necessary to strengthen the supervision of helmet wearing.In recent years,the monitoring method of helmet wearing based on image vision has become the main means for enterprises to implement management.How to improve the detection accuracy and speed of helmet wearing is a crucial issue for applications.To deal with this issue,an improved YOLO v4 algorithm is proposed to promote the accuracy and efficiency of safety helmet wearing detection in this paper.First,a 128×128 feature map output is added to the original three feature map outputs of the YOLO v4 algorithm,and the 8 times downsampling of the feature map output is changed to 4 times downsampling to provide more small target features for subsequent feature fusion.Second,the feature fusion module is improved based on the idea of dense connection to realize feature reuse,so that the Yolo Head classifier responsible for small target detection can utilize the features of different levels,to obtain better target detection and classification results.Finally,comparative experiments are carried out.The results show that the average accuracy of the proposed method is 2.96% higher than the original network detection accuracy to be 91.17%,and the detection speed is basically unchanged to be 52.9 frame/s.Thereupon,the proposed algorithm can achieve better detection accuracy while meeting real-time detection requirements,and effectively realize the high-speed and high-precision detection of helmet wearing.

Key words: K-means clustering, Deep learning, Safety helmet wearing detection, Small target detection, YOLO v4

中图分类号: 

  • TP391
[1]ZHANG P,SONG Y F,ZONG L B,et al.Advances in 3D Object Detection:A Brief Survey[J].Computer Science,2020,47(4):94-102.
[2]LIN J,DANG W C,PAN L H,et al.Safety Helmet Detection Based on YOLO[J].Computer Systems & Applications,2019,28(9):174-179.
[3]FANG M,SUN T T,SHAO Z.Fast Helmet-wearing-condition Detection Based on Improved YOLO v2[J].Optics and Precision Engineering,2019,27(5):1196-1205.
[4]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single ShotMultiBox Detector[C]//European Conference on Computer Vision,Amsterdam.Netherlands,2016:21-37.
[5]WANG H L,QI X L,WU G S.Research Progress of Object Detection Technology Based on Convolutional Neural Network in Deep Learning[J].Computer Science,2018,45(9):11-19.
[6]GIRSHICK R,DONAHUE J,DARRELLAND T,et al.RichFeature Hierarchies for Object Detection and Semantic Segmentation[C]//IEEE Conference on Computer Vision and Pattern Recognition,Columbus,OH.IEEE,2014:580-587.
[7]GIRSHICK R.Fast R-CNN[C]//International Conference on Computer Vision.IEEE,2015:1440-1448.
[8]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:Towards Real-time Object Detection with Region Proposal Networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149.
[9]DAI J F,LI Y,HE K M,et al.R-FCN:Object Detection via Region-based Fully Convolutional Networks[C]//Conference and Workshop on Neural Information Processing Systems(NIPS).Barcelona,SPAIN,2016:379-387.
[10]REDMON J,DIVVALA S,GIRSHICK R,et al.You Only Look Once:Unified,Real-time Object Detection[C]//IEEE Confe-rence on Computer Vision and Pattern Recognition,Las Vegas,NV.IEEE,2016:779-788.
[11]REDMON J,FARHADI A.YOLO9000:Better,Faster,Stronger[C]//IEEE Conference on Computer Vision and Pattern Recognition,Honolulu,HI.IEEE,2017:6517-6525.
[12]REDMON J,FARHADI A.YOLOv3:An Incremental Improvement[C]//IEEE Conference on Computer Vision and Pattern Recognition.Salt Lake City,UT,2018.
[13]BOCHKOVSKIY A,WANG C Y,LIAO H Y.YOLO v4:Optimal Speed and Accuracy of Object Detection[J].arXiv:2004.10934v1,2020.
[14]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal Loss for Dense Object Detection[C]//2017 IEEE International Conference on Computer Vision (ICCV).Venice,2017:2999-3007.
[15]DUAN K,BAI S,XIE L,et al.CenterNet:Keypoint Triplets for Object Detection[C]//Proceedings of the IEEE International Conference on Computer Vision (ICCV).Seoul,Korea (South),2019:6569-6578.
[16]LAW H,DENG J.CornerNet:Detecting Objects as Paired Keypoints[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018:734-750.
[17]LAW H,TENG Y,RUSSAKOVSKY O,et al.CornerNet-Lite:Efficient Keypoint Based Object Detection[J].arXiv:1904.08900,2019.
[18]TIAN Z,SHEN C,CHEN H,et al.FCOS:Fully Convolutional One-Stage Object Detection[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV).Seoul,Korea (South),2020:9627-9636.
[19]YANG L Q,CAI L Q,GU S.Detection on Wearing Behavior of Safety Helmet Based on Machine Learning Method[J].Journal of Safety Science and Technology,2019,15(10):152-157.
[20]LONG X T,CUI W P,ZHENG Z.Safety Helmet Wearing Detection Based on Deep Learning[C]//2019 IEEE 3rd Information Technology,Networking,Electronic and Automation Control Conference (ITNEC).Chengdu,China,2019:2495-2499.
[21]LIU J,XIE Y H.Implementation of an Improved YOLO Algorithm for Intelligent Video Surveillance System[J].Information Technology and Network Security,2019,38(4):102-106.
[22]WU J X,CAI N,CHEN W J,et al.Automatic Detection ofHardhats Worn by Construction Personnel:A Deep Learning Approach and Benchmark Dataset[J].Automation in Construction,2019,106:102894.
[23]ZHANG M,LI J,DING R L,et al.Remote Sensing Image Object Detection Technology Based on Improved YOLO V2 Algorithm[J].Computer Science,2020,47(6A):176-180.
[24]ZHAO Y,CHEN Q,CAO W G,et al.Deep Learning for Risk Detection and Trajectory Tracking at Construction Sites[J].IEEE Access,2019,7:30905-30912.
[25]WU F,JIN G Q,GAO M Y,et al.Helmet Detection BasedonImproved YOLO V3 Deep Model[C]//2019 IEEE 16th International Conference on Networking,Sensing and Control (ICNSC),Banff,AB.Canada,2019:363-368.
[26]HUANG G,LIU Z,VAN D M L,et al.Densely Connected Convolutional Networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,Honolulu,HI.2017:4700-4708.
[27]XU S K,NI C H,JI C C,et al.Image Caption of Safety Helmets Wearing in Construction Scene Based on YOLO v3[J].Compu-ter Science,2020,47(8):233-240.
[28]GU Y W,XU S K,WANG Y R,et al.An Advanced DeepLearning Approach for Safety Helmet Wearing Detection[C]//2019 International Conference on Internet of Things and IEEE Green Computing and Communications and IEEE Cyber.Physical and Social Computing (CPSCom) and IEEE Smart Data.Atlanta,GA,USA,2019:669-674.
[29]FANG Q,LI H,LUO X C,et al.Detecting Non-hardhat-use by a Deep Learning Method from Far-field Surveillance Videos[J].Automation in Construction,2018,85(JAN.):1-9.
[30]ZHENG Z,WANG P,LIU W,et al.Distance-IOU Loss:Faster and Better Learning for Bounding Box Regression[C]//AAAI Conference on Artificial Intelligence.NY,USA,2020.
[31]LIU S,QI L,QIN H F,et al.Path Aggregation Network for Instance Segmentation[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Salt Lake City,UT,2018:8759-8768.
[32]WANG C Y,LIAO H Y,YEH I H,et al.CSPNet:A New Back-bone that can Enhance Learning Capability of CNN[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).Seattle,WA,USA,2020:1571-1580.
[33]CHOI J,CHUN D,KIM H,et al.Gaussian YOLO v3:An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV).Seoul,Korea (South),2019:502-511.
[1] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[2] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[3] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[4] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[5] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[6] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[7] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[8] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[9] 周慧, 施皓晨, 屠要峰, 黄圣君.
基于主动采样的深度鲁棒神经网络学习
Robust Deep Neural Network Learning Based on Active Sampling
计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044
[10] 苏丹宁, 曹桂涛, 王燕楠, 王宏, 任赫.
小样本雷达辐射源识别的深度学习方法综述
Survey of Deep Learning for Radar Emitter Identification Based on Small Sample
计算机科学, 2022, 49(7): 226-235. https://doi.org/10.11896/jsjkx.210600138
[11] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
[12] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[13] 刘伟业, 鲁慧民, 李玉鹏, 马宁.
指静脉识别技术研究综述
Survey on Finger Vein Recognition Research
计算机科学, 2022, 49(6A): 1-11. https://doi.org/10.11896/jsjkx.210400056
[14] 孙福权, 崔志清, 邹彭, 张琨.
基于多尺度特征的脑肿瘤分割算法
Brain Tumor Segmentation Algorithm Based on Multi-scale Features
计算机科学, 2022, 49(6A): 12-16. https://doi.org/10.11896/jsjkx.210700217
[15] 康雁, 徐玉龙, 寇勇奇, 谢思宇, 杨学昆, 李浩.
基于Transformer和LSTM的药物相互作用预测
Drug-Drug Interaction Prediction Based on Transformer and LSTM
计算机科学, 2022, 49(6A): 17-21. https://doi.org/10.11896/jsjkx.210400150
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!