计算机科学 ›› 2021, Vol. 48 ›› Issue (8): 106-110.doi: 10.11896/jsjkx.200700161
龚浩田, 张萌
GONG Hao-tian, ZHANG Meng
摘要: 针对基于关键点的目标检测参数量大、检测框误匹配的问题,提出一种轻量级的基于关键点检测的无锚框目标检测算法。首先将输入图片输入优化过的特征提取算法,通过级联角池化与中心池化,输出3个关键点的热力图与它们的嵌入向量;然后通过嵌入向量匹配热力图并画出检测框。文中的创新点在于将SqueezeNet中的轻量级模块firemodule适配至CenterNet,并用深度可分离卷积代替主干网的常规卷积,同时,针对CenterNet的检测框误匹配问题优化了算法输出形式与训练时的损失函数。实验结果表明,改良后的算法使得原有的CenterNet算法模型尺寸缩小为原来的1/7,同时检测精度与速度较YOLOv3,CornerNet-Lite等相同量级的算法仍有所提高。
中图分类号:
[1]REDMON J,FARHADI A.Yolov3:An incremental improve-ment[J].arXiv:1804.02767. [2]REDMON J,FARHADI A.YOLO9000:Better,Faster,Stronger[C]//IEEE Conference on Computer Vision & Pattern Recognition.2017:6517-6525. [3]LAW H,DENG J.Cornernet:Detecting objects as paired key-points[C]//Proceedings of the European Conference on Computer Vision.2018:734-750. [4]DUAN K W,BAI S,XIE L X.CenterNet:Keypoint Triplets for Object Detection[C]//Proceedings of the European Conference on Computer Vision.2019. [5]REN S Q,HE K M,GIRSHICK R.Faster R-CNN:TowardsReal-Time Object Detection with Region Proposal Networks[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2017,39(6):1137-1149. [6]GIRSHICK R.Fast R-CNN[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV).2015:1440-1448. [7]HE K,GKIOXARI G,DOLLAR P,et al.Mask r-cnn[C]//Pro-ceedings of the IEEE International Conference on Computer Vision.2017:2961-2969. [8]ZHOU X,WEI G,FU W L,et al.Application of deep learning in object detection[C]//2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS).IEEE,2017. [9]MURPHY K P.Object detection and localization using local and global features[J].Toward Category Level Object Recognition,2006,12(1):382-400. [10]WANG W,SHEN J,SHAO L.Video Salient Object Detectionvia Fully Convolutional Networks[J].IEEE Transactions on Image Processing,2017,27(1):38-49. [11]YANG J,LIU Q S,ZHANG K H.Stacked Hourglass Network for Robust Facial Landmark Localisation[C]//IEEE Conference on Computer Vision & Pattern Recognition Workshops.IEEE Computer Society,2017. [12]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:580-587. [13]XIAO B,WU H,WEI Y.Simple baselines for human pose estimation and tracking[C]//ECCV.2018:472-478. [14]SANDLER M,HOWARD A,ZHU M L,et al.MobileNetV2:Inverted Residuals and Linear Bottlenecks [C]//IEEEConfe-rence on Computer Vision and Pattern Recognition (CVPR).2018:4510-4520. [15]IANDOLA F N,HAN S,MOSKEWICZ M W,et al.Squeeze-Net:AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size[J].arXiv,1602.07360. [16]SZEGEDY C,LIU W,JIA Y,et al.Going Deeper with Convolutions[J].arXiv:1409.4842. [17]HUANG G,LIU Z,LAURENS V D M,et al.Densely Connec-ted Convolutional Networks[J].arXiv:1608.06993,2016. [18]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//CVPR.2016. [19]HE K,ZHANG X,REN S,et al.Identity Mappings in Deep Residual Networks[C]//European Conference on Computer Vision.Cham:Springer,2016. [20]LAW H,TENG Y,RUSSAKOVSKY O,et al.CornerNet-Lite:Efficient Keypoint Based Object Detection[J].arXiv:1904.08900. |
[1] | 汪鸣, 彭舰, 黄飞虎. 基于多时间尺度时空图网络的交通流量预测模型 Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction 计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188 |
[2] | 刘冬梅, 徐洋, 吴泽彬, 刘倩, 宋斌, 韦志辉. 基于边框距离度量的增量目标检测方法 Incremental Object Detection Method Based on Border Distance Measurement 计算机科学, 2022, 49(8): 136-142. https://doi.org/10.11896/jsjkx.220100132 |
[3] | 王灿, 刘永坚, 解庆, 马艳春. 基于软标签和样本权重优化的Anchor Free目标检测算法 Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization 计算机科学, 2022, 49(8): 157-164. https://doi.org/10.11896/jsjkx.210600240 |
[4] | 祝文韬, 兰先超, 罗唤霖, 岳彬, 汪洋. 改进Faster R-CNN的光学遥感飞机目标检测 Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN 计算机科学, 2022, 49(6A): 378-383. https://doi.org/10.11896/jsjkx.210300121 |
[5] | 沈超, 何希平. 基于纹理特征增强和轻量级网络的人脸防伪算法 Face Anti-spoofing Algorithm Based on Texture Feature Enhancement and Light Neural Network 计算机科学, 2022, 49(6A): 390-396. https://doi.org/10.11896/jsjkx.210600217 |
[6] | 马宾, 付永康, 王春鹏, 李健, 王玉立. 基于GDIoU损失函数的YOLOv4绝缘子高效定位算法 High Performance Insulators Location Scheme Based on YOLOv4 with GDIoU Loss Function 计算机科学, 2022, 49(6A): 412-417. https://doi.org/10.11896/jsjkx.210600089 |
[7] | 陈永平, 朱建清, 谢懿, 吴含笑, 曾焕强. 基于外接圆半径差损失的实时安全帽检测算法 Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss 计算机科学, 2022, 49(6A): 424-428. https://doi.org/10.11896/jsjkx.220100252 |
[8] | 李健智, 王红玲, 王中卿. 基于图卷积网络的专利摘要自动生成研究 Automatic Generation of Patent Summarization Based on Graph Convolution Network 计算机科学, 2022, 49(6A): 172-177. https://doi.org/10.11896/jsjkx.210400117 |
[9] | 陈佳舟, 赵熠波, 徐阳辉, 马骥, 金灵枫, 秦绪佳. 三维城市场景中的小物体检测 Small Object Detection in 3D Urban Scenes 计算机科学, 2022, 49(6): 238-244. https://doi.org/10.11896/jsjkx.210400174 |
[10] | 赵小虎, 叶圣, 李晓. 多算法融合的骨骼重建信息动作分类方法 Multi-algorithm Fusion Behavior Classification Method for Body Bone Information Reconstruction 计算机科学, 2022, 49(6): 269-275. https://doi.org/10.11896/jsjkx.210500070 |
[11] | 胡伏原, 万新军, 沈鸣飞, 徐江浪, 姚睿, 陶重犇. 深度卷积神经网络图像实例分割方法研究进展 Survey Progress on Image Instance Segmentation Methods of Deep Convolutional Neural Network 计算机科学, 2022, 49(5): 10-24. https://doi.org/10.11896/jsjkx.210200038 |
[12] | 徐涛, 陈奕仁, 吕宗磊. 基于改进YOLOv3的机坪工作人员反光背心检测研究 Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm 计算机科学, 2022, 49(4): 239-246. https://doi.org/10.11896/jsjkx.210200119 |
[13] | 周海榆, 张道强. 面向多中心数据的超图卷积神经网络及应用 Multi-site Hyper-graph Convolutional Neural Networks and Application 计算机科学, 2022, 49(3): 129-133. https://doi.org/10.11896/jsjkx.201100152 |
[14] | 赵越, 余志斌, 李永春. 基于互注意力指导的孪生跟踪算法 Cross-attention Guided Siamese Network Object Tracking Algorithm 计算机科学, 2022, 49(3): 163-169. https://doi.org/10.11896/jsjkx.210300066 |
[15] | 张侣, 周博文, 吴亮红. 基于改进卷积注意力模块与残差结构的SSD网络 SSD Network Based on Improved Convolutional Attention Module and Residual Structure 计算机科学, 2022, 49(3): 211-217. https://doi.org/10.11896/jsjkx.201200019 |
|