计算机科学 ›› 2022, Vol. 49 ›› Issue (5): 84-91.doi: 10.11896/jsjkx.210400142
李发光, 伊力哈木·亚尔买买提
LI Fa-guang, YILIHAMU·Yaermaimaiti
摘要: 针对无人机在电力巡检过程中对绝缘子及其缺陷检测的准确率较低、实时性较差的问题,提出一种改进CenterNet的绝缘子缺陷检测模型。首先,使用轻量级网络EfficientNet-B0代替原始模型的特征提取网络ResNet18,在保证模型提取能力的同时加快了其推理速度;其次,搭建特征加强模块(Feature Enhancement Module,FEM),并对经过上采样后的特征通道权重进行合理分配,抑制无效特征,并借鉴FPN(Feature Pyramid Networks)融合浅层与深层特征,使特征层信息更加丰富;然后在CenterNet-Head中引入空间和通道混合的注意力机制CA(Coordinate Attention),使类别和位置信息的预测更加准确;最后,使用Soft-NMS解决在模型检测过程中由中心点预测不准导致的“单目标多框”问题。实验结果表明,改进的CenterNet比原始模型的精度提高了11.92%,速度提高了8.95 FPS,模型大小减小了54 MB。与其他检测模型相比,改进模型的精度与速度均有提高,证明了其实时性和鲁棒性。
中图分类号:
[1]LING S,NGUYEN K,ROUX L A,et al.A lattic E based group signature scheme with verifier-local revocation[J].Theoretical Computer Science,2018,730(19):1-20. [2]ZHANG M J,LI H W,ZAHO W H,et al.Applic ation of deep learning in unmanned aerial vehicle patrl inspection of military optical cable line[J].Optical Communication Research,2018(6):57-61. [3]YANG G,SUN C W,WANG D W,et al.A Comparative Study of Transmission Line Component Detection Models Based on UAV Front-end SSD Algorithm[J].Journal of Taiyuan University of Technology,2020,51(2):212-219. [4]REN S Q,HE K M,GIRSHICK R,et al.Faster rcnn:Towards real-time obiect dection with region proposal networks[C]//Advances in Neural Information Processing Systems.Montreal:MIT press,2015:91-99. [5]DAI J,LI Y,HE K M,et al.R-FCN:Objectdetection via regionbased fully convolutional neworks[C]//Advancesin Neural Information Processing System.2016:379-387. [6]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE,2014:580-587. [7]GIRSHICK R.Fast R-CNN[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision.Piscataway:IEEE,2015:1440-1448. [8]LIU W,ANGUELOV D,ERHAN D,et al.SSD:single shotmultibox detector[C]//Proceedings of the 14th European Conference on Computer Vision(ECCV).Amsterdam,Netherlands:Springer,2016:21-37. [9]FU C Y,LIU W,RANGA A,et al.DSSD:Deconvolutional single shot detector[J].arXiv:1701.06659,2017. [10]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,42(2):318-327. [11]REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:unified,real-time object detection[C]//Computer Vision and Pattern Recognltion.IEEE,2016:779-788. [12]REDMON J,FARHADI A,GIRSHICK R.YOLO900:faster,stronger[C]//Computer Vision and Pattern Recognition.IEEE,2017:6517-6525. [13]REDMON J,FARHADI A.Yolov3:An incremental improve-ment[OL].[2019-06-03].https://arXiv.org/abs/1804.02767. [14]LAW H,DENG J.Cornernet:Detecting obiect as paired key-points[C]//Proceedings of the 15th Europe an Conference on Computer Vision.Munich,Germany,2018:734-750. [15]LAW H,TENG Y,RUSSAKOVSKY O,et al.CornerNet-Lite:Efficient keypoint based object[J].arXiv:1904.08900,2019. [16]TIAN Z,SHEN C H,CHEN H,et al.FCOS:Fully Convolu-tional one-stage object detection[J].arXiv:1904.01355,2019. [17]ZHAO R,ZHAO G W,ZHANG J,et al.Real-time fault detection method for high voltage transmission line based on improved algorithm[J].Computer Engineering and Applications,2021,57(17):7. [18]YI J Y,CHEN C F,GONG G Q.Aerial Insulator Detection of Transmission Line Based on Improved Faster RCNN[J].Computer Engineering,2021,47(6):292-298,304. [19]DUAN K W,BAI S,XIE L X,et al.CenterNet:Keypoint triplets for object detection[J].arXiv:1904.08189,2019. [20]AN M,LE Q V.EfficientNet:Rethinking Model Scaling forConvolutional Neural Networks[C]//International Conference on Machine Learning.Long Beach,USA:ICML,2019:6105-6114. [21]LIN T Y,DOLLAR P,GIRSHICK R,et al.Feature pyramidnetworks for object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition.2017. [22]HOU Q B,ZHOU D Q,FENG J S.Corinate Attention for Efficient Mobile Network Design[C]//Proceedings of the 2021 IEEE Conference on Compvision and Patten Recognition.2021. [23]BODLA N,SINGH B,CHELLAPPA R,et al.Soft-nms improving object detection with one line of code[C]//ICCV.Venice,USA:IEEE Press,2017:5562-5570. [24]NEUBECK A,VAN G L.Efficient non-maximum suppression[C]//Proceedings of 18th International Conference on Pattern Recognition.2006:850-855. [25]HE K M,ZHANG X Y,REN S Q,et al.Deep residual learning for image recognition[C]//CVPR.Las Vegas,USA:IEEE Press,2016:770-778. [26]ZHANG T,TAO D,YANG J,et al.Discriminative LocalityAlignment[C]//European Conference on Computer Vision.2008:725-738. [27]NEWELL A,YANG K Y,DENG J.Stacked hourglass networks for human pose estimation[C]//Proceedings of the 14th European Conference on Computer Vision.Amsterdam,the Netherlands,2016:483-499. [28]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense bject detection[C]//Proceedings of the 2017 IEEE Interna-tional Conference on Computer Vision.Piscataway:IEEE,2017:2980-2988. [29]MARK S,ANDREW H,MENGLONG Z,et al.MobileNet v2:Inverted residuals and linear bottlencks[C]//CVPR.Salt Lake City,USA:IEEE Press,2018:4510-4520. [30]JIE H,LI S,GANG S.Squeeze-and-excitation networks[C]//CVPR.Salt Lake City,USA:IEEE Press,2018:7132-7141. [31]ZHAO Q,SHENG T,WANG Y,et al.M2det:A single-shot object detector based on multi-level feature pyramid network[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:9259-9266. [32]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection[C]//Proceeding of the IEEE International Conference on Computer Vision.2017:2980-2988. [33]TAN M,PANG R,LE Q,et al.EfficientDet:Scalable and efficient object detection[J].arXiv:1911.09070,2020. [34]WANG C Y,LIAO H Y.YOLOv4:Optional speed and accuracy of objection dection[OL].[2020-04-23].https://arxiv.org/abs/2004.10934. |
[1] | 周芳泉, 成卫青. 基于全局增强图神经网络的序列推荐 Sequence Recommendation Based on Global Enhanced Graph Neural Network 计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085 |
[2] | 戴禹, 许林峰. 基于文本行匹配的跨图文本阅读方法 Cross-image Text Reading Method Based on Text Line Matching 计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032 |
[3] | 周乐员, 张剑华, 袁甜甜, 陈胜勇. 多层注意力机制融合的序列到序列中国连续手语识别和翻译 Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion 计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026 |
[4] | 熊丽琴, 曹雷, 赖俊, 陈希亮. 基于值分解的多智能体深度强化学习综述 Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization 计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112 |
[5] | 饶志双, 贾真, 张凡, 李天瑞. 基于Key-Value关联记忆网络的知识图谱问答方法 Key-Value Relational Memory Networks for Question Answering over Knowledge Graph 计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277 |
[6] | 汪鸣, 彭舰, 黄飞虎. 基于多时间尺度时空图网络的交通流量预测模型 Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction 计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188 |
[7] | 姜梦函, 李邵梅, 郑洪浩, 张建朋. 基于改进位置编码的谣言检测模型 Rumor Detection Model Based on Improved Position Embedding 计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046 |
[8] | 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥. 基于注意力机制的医学影像深度哈希检索算法 Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism 计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153 |
[9] | 孙奇, 吉根林, 张杰. 基于非局部注意力生成对抗网络的视频异常事件检测方法 Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection 计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061 |
[10] | 闫佳丹, 贾彩燕. 基于双图神经网络信息融合的文本分类方法 Text Classification Method Based on Information Fusion of Dual-graph Neural Network 计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042 |
[11] | 张颖涛, 张杰, 张睿, 张文强. 全局信息引导的真实图像风格迁移 Photorealistic Style Transfer Guided by Global Information 计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036 |
[12] | 曾志贤, 曹建军, 翁年凤, 蒋国权, 徐滨. 基于注意力机制的细粒度语义关联视频-文本跨模态实体分辨 Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism 计算机科学, 2022, 49(7): 106-112. https://doi.org/10.11896/jsjkx.210500224 |
[13] | 程成, 降爱莲. 基于多路径特征提取的实时语义分割方法 Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction 计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157 |
[14] | 徐鸣珂, 张帆. Head Fusion:一种提高语音情绪识别的准确性和鲁棒性的方法 Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition 计算机科学, 2022, 49(7): 132-141. https://doi.org/10.11896/jsjkx.210100085 |
[15] | 孟月波, 穆思蓉, 刘光辉, 徐胜军, 韩九强. 基于向量注意力机制GoogLeNet-GMP的行人重识别方法 Person Re-identification Method Based on GoogLeNet-GMP Based on Vector Attention Mechanism 计算机科学, 2022, 49(7): 142-147. https://doi.org/10.11896/jsjkx.210600198 |
|