Computer Science ›› 2023, Vol. 50 ›› Issue (11A): 230200163-6.doi: 10.11896/jsjkx.230200163

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Electrolyzer Equipment and Sample Detection Method Based on Multi-scale Improved YOLOv5

WU Jiaojiao, LIU Zheng   

  1. School of Electrical and Information Engineering,Changsha University of Science and Technology,Changsha 410114,China
  • Published:2023-11-09
  • About author:WU Jiaojiao,born in 1998,postgra-duate,is a member of China Computer Federation.Her main research interest is visual image processing.
    LIU Zheng,born in 1977,Ph.D.His main research interests include power electronics and power robots.

Abstract: Aiming at the real-time recognition problem of electrolytic cell transfer robot in electrolytic aluminum workshop,there is a problem that the size difference of the recognition object is too large for the target detection of electrolytic cell equipment and aluminum ingot samples.Generally,the parameters of the target detection algorithm are large,and it is difficult to meet the requirements of real-time detection when deployed on electrolyzer transfer robots.Therefore,a lightweight multi-scale YOLOv5 network model that solves the excessive difference in target size is proposed,and the backbone feature extraction network is replaced by a lightweight Shufflenet V2 network.Add SE attention mechanism to improve the accuracy of small target recognition.In the enhanced feature extraction network,a shallow detection layer is added as the detection layer for smaller targets to achieve the recognition accuracy of multi-scale and large size changes.Experimental results show that the average detection accuracy of the improved YOLOv5 algorithm in the electrolyzer equipment and sample identification of the electrolyzer transfer robot is 93.5%,which is 1.5% higher than the average detection accuracy of the YOLOv5 algorithm,the number of model parameters reduces by about 39.4%,and the average detection speed of each picture increases by 2.5 milliseconds,which is conducive to deployment to the Electrolyzer transfer robot.

Key words: YOLOv5, Lightweight, Attention mechanism, Multi-scale object detection, Target size difference

CLC Number: 

  • TP391
[1]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single ShotMulti Box Detector[C]//European Conference on Computer Vision.Cham:Springer,2015:21-37.
[2]LIN T Y,DOLLAR P,GIRSHICK R,et al.Feature pyramidnetworks for object detection[C]//Proceedings of The IEEE Conference on Computer Vision and Pattern Recognition.2017:936-944.
[3]GIRSHICK R.Fast r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision.2015:1440-1448.
[4]ZHANG Y,ZHANG M L,LV X L,et al.Review of deep lear-ning small target detection algorithm[J].Computer Engineering and Application,2022,58(15):1-17.
[5]FAN L L,ZHAO H W,ZHAO H Y,et al.Review of object detection based on deep convolutional neural network[J].Optics and Precision Engineering,2020,28(5):1152-1164.
[6]LIU S,QI L,QIN H,et al.Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8759-8768.
[7]TAN M,PANG R,LE Q V.Efficientdet:Scalable and efficient object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:10781-10790.
[8]IANDOLA F N,HAN S,MOSKEWICZ M W,et al.Sque-ezeNet:AlexNet-level accuracy with 50x fewer parameters and <0.5 MB model size[J].arXiv:1602.07360,2016.
[9]ZHANG X,ZHOU X,LINM,et al.Shufflenet:An extremely efficient convolutional neural network for mobile devices[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:6848-6856.
[10]CHOLLET F.Xception:Deep learning with depthwise separable convolutions[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:1251-1258.
[11]SANDLER M,HOWARD A,ZHU M,et al.Mobilenetv2:Inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:4510-4520.
[12]SZEGEDY C,VANHOUCKE V,IOFFE S,et al.Rethinking the inception architecture for computer vision[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:2818-2826.
[13]XIONG K F,FAN S S,WU J.Identification method of powerequipment in foggy substation based on improved YOLOv4[J].Radio Engineering,2022,52(8):1504-1512.
[14]PI J,LIU Y H,LI J H.Research on lightweight forest fire detection algorithm based on YOLOv5s[J].Journal of Graphics,2023,44(1):26-32.
[15]WANG P F,HUANG H M,WANG M Q.Improved complex road target detection algorithm for YOLOv5[J].Computer Engineering and Applications,2022,58(17):81-92.
[16]WANG G W.Multi-scale feature fusion to improve pedestrian and vehicle detection in YOLOv3 network[D].Nanjing University of Posts and Telecommunications,2019.
[17]DUBEY S R,CHAKRABORTY S,ROY S K,et al.DiffGrad:an optimization method for convolutional neural networks[J].IEEE transactions on neural networks and learning systems,2019,31(11):4500-4511.
[18]LOSHCHILOV I,HUTTER F.Sgdr:Stochastic gradient de-scent with warm restarts[J].arXiv:1608.03983,2016.
[1] LIU Yubo, GUO Bin, MA Ke, QIU Chen, LIU Sicong. Design of Visual Context-driven Interactive Bot System [J]. Computer Science, 2023, 50(9): 260-268.
[2] YI Liu, GENG Xinyu, BAI Jing. Hierarchical Multi-label Text Classification Algorithm Based on Parallel Convolutional Network Information Fusion [J]. Computer Science, 2023, 50(9): 278-286.
[3] LUO Yuanyuan, YANG Chunming, LI Bo, ZHANG Hui, ZHAO Xujian. Chinese Medical Named Entity Recognition Method Incorporating Machine ReadingComprehension [J]. Computer Science, 2023, 50(9): 287-294.
[4] ZHONG Yue, GU Jieming, CAO Honglin. Survey of Lightweight Block Cipher [J]. Computer Science, 2023, 50(9): 3-15.
[5] LI Ke, YANG Ling, ZHAO Yanbo, CHEN Yonglong, LUO Shouxi. EGCN-CeDML:A Distributed Machine Learning Framework for Vehicle Driving Behavior Prediction [J]. Computer Science, 2023, 50(9): 318-330.
[6] ZHANG Yian, YANG Ying, REN Gang, WANG Gang. Study on Multimodal Online Reviews Helpfulness Prediction Based on Attention Mechanism [J]. Computer Science, 2023, 50(8): 37-44.
[7] TENG Sihang, WANG Lie, LI Ya. Non-autoregressive Transformer Chinese Speech Recognition Incorporating Pronunciation- Character Representation Conversion [J]. Computer Science, 2023, 50(8): 111-117.
[8] ZHANG Xiao, DONG Hongbin. Lightweight Multi-view Stereo Integrating Coarse Cost Volume and Bilateral Grid [J]. Computer Science, 2023, 50(8): 125-132.
[9] WANG Jiahao, ZHONG Xin, LI Wenxiong, ZHAO Dexin. Human Activity Recognition with Meta-learning and Attention [J]. Computer Science, 2023, 50(8): 193-201.
[10] WANG Yu, WANG Zuchao, PAN Rui. Survey of DGA Domain Name Detection Based on Character Feature [J]. Computer Science, 2023, 50(8): 251-259.
[11] YAN Mingqiang, YU Pengfei, LI Haiyan, LI Hongsong. Arbitrary Image Style Transfer with Consistent Semantic Style [J]. Computer Science, 2023, 50(7): 129-136.
[12] GAO Xiang, TANG Jiqiang, ZHU Junwu, LIANG Mingxuan, LI Yang. Study on Named Entity Recognition Method Based on Knowledge Graph Enhancement [J]. Computer Science, 2023, 50(6A): 220700153-6.
[13] ZHANG Tao, CHENG Yifei, SUN Xinxu. Graph Attention Networks Based on Causal Inference [J]. Computer Science, 2023, 50(6A): 220600230-9.
[14] CUI Lin, CUI Chenlu, LIU Zhengwei, XUE Kai. Speech Emotion Recognition Based on Improved MFCC and Parallel Hybrid Model [J]. Computer Science, 2023, 50(6A): 220800211-7.
[15] DUAN Jianyong, YANG Xiao, WANG Hao, HE Li, LI Xin. Document-level Relation Extraction of Graph Attention Convolutional Network Based onInter-sentence Information [J]. Computer Science, 2023, 50(6A): 220800189-6.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!