计算机科学 ›› 2021, Vol. 48 ›› Issue (8): 106-110.doi: 10.11896/jsjkx.200700161

• 计算机图形学& 多媒体 • 上一篇    下一篇

基于关键点检测的无锚框轻量级目标检测算法

龚浩田, 张萌   

  1. 东南大学国家ASIC工程中心 南京210096
  • 收稿日期:2020-07-26 修回日期:2020-12-01 发布日期:2021-08-10
  • 通讯作者: 张萌(zmeng@seu .edu.cn)

Lightweight Anchor-free Object Detection Algorithm Based on Keypoint Detection

GONG Hao-tian, ZHANG Meng   

  1. National ASIC Engineering Center,Southeast University,Nanjing 210096,China
  • Received:2020-07-26 Revised:2020-12-01 Published:2021-08-10
  • About author:GONG Hao-tian,born in 1996,postgraduate.His main research interests include deep learning and computer vision.(220184705@seu.edu.cn)ZHANG Meng,born in 1964,Ph.D,associate professor,Ph.D supervisor.His main research interests include deep learning,machine learning,digital signal processing,digital communication systems,wireless sensor networks,digital integrated circuit design,information security and assurance,etc.

摘要: 针对基于关键点的目标检测参数量大、检测框误匹配的问题,提出一种轻量级的基于关键点检测的无锚框目标检测算法。首先将输入图片输入优化过的特征提取算法,通过级联角池化与中心池化,输出3个关键点的热力图与它们的嵌入向量;然后通过嵌入向量匹配热力图并画出检测框。文中的创新点在于将SqueezeNet中的轻量级模块firemodule适配至CenterNet,并用深度可分离卷积代替主干网的常规卷积,同时,针对CenterNet的检测框误匹配问题优化了算法输出形式与训练时的损失函数。实验结果表明,改良后的算法使得原有的CenterNet算法模型尺寸缩小为原来的1/7,同时检测精度与速度较YOLOv3,CornerNet-Lite等相同量级的算法仍有所提高。

关键词: 关键点, 卷积网络, 目标检测, 轻量级, 无锚框

Abstract: According to the large number of parameters of key-point object detection network and the problem of mismatching of bounding box,this paper proposes a lightweight key point anchor-free object detection algorithm.It inputs the image into the improved hourglass network to extract features,through the cascade corner pooling module and center pooling module,outputs three key points heatmap and their embedding vectors.At last,it matchs the key points by embedding vectors and draw the bounding box.The innovation of this paper is to applying the firemodule of SqueezeNet in the CenterNet object detection network,and replace the conventional convolution in the backbone with the depth separable convolution.At the same time,aiming at the mismatching bounding box problem in CenterNet,this algorithm adjusts the network's output and loss function.Experiment results show that the model size is reduced to 1/7 of CenterNet,while the accuracy and inference speed are still higher than the same size target detection algorithm like YOLOv3 and CornerNet-Lite.

Key words: Anchor-free, Convolution network, Key point, Lightweight, Object detection

中图分类号: 

  • TP391
[1]REDMON J,FARHADI A.Yolov3:An incremental improve-ment[J].arXiv:1804.02767.
[2]REDMON J,FARHADI A.YOLO9000:Better,Faster,Stronger[C]//IEEE Conference on Computer Vision & Pattern Recognition.2017:6517-6525.
[3]LAW H,DENG J.Cornernet:Detecting objects as paired key-points[C]//Proceedings of the European Conference on Computer Vision.2018:734-750.
[4]DUAN K W,BAI S,XIE L X.CenterNet:Keypoint Triplets for Object Detection[C]//Proceedings of the European Conference on Computer Vision.2019.
[5]REN S Q,HE K M,GIRSHICK R.Faster R-CNN:TowardsReal-Time Object Detection with Region Proposal Networks[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2017,39(6):1137-1149.
[6]GIRSHICK R.Fast R-CNN[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV).2015:1440-1448.
[7]HE K,GKIOXARI G,DOLLAR P,et al.Mask r-cnn[C]//Pro-ceedings of the IEEE International Conference on Computer Vision.2017:2961-2969.
[8]ZHOU X,WEI G,FU W L,et al.Application of deep learning in object detection[C]//2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS).IEEE,2017.
[9]MURPHY K P.Object detection and localization using local and global features[J].Toward Category Level Object Recognition,2006,12(1):382-400.
[10]WANG W,SHEN J,SHAO L.Video Salient Object Detectionvia Fully Convolutional Networks[J].IEEE Transactions on Image Processing,2017,27(1):38-49.
[11]YANG J,LIU Q S,ZHANG K H.Stacked Hourglass Network for Robust Facial Landmark Localisation[C]//IEEE Conference on Computer Vision & Pattern Recognition Workshops.IEEE Computer Society,2017.
[12]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:580-587.
[13]XIAO B,WU H,WEI Y.Simple baselines for human pose estimation and tracking[C]//ECCV.2018:472-478.
[14]SANDLER M,HOWARD A,ZHU M L,et al.MobileNetV2:Inverted Residuals and Linear Bottlenecks [C]//IEEEConfe-rence on Computer Vision and Pattern Recognition (CVPR).2018:4510-4520.
[15]IANDOLA F N,HAN S,MOSKEWICZ M W,et al.Squeeze-Net:AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size[J].arXiv,1602.07360.
[16]SZEGEDY C,LIU W,JIA Y,et al.Going Deeper with Convolutions[J].arXiv:1409.4842.
[17]HUANG G,LIU Z,LAURENS V D M,et al.Densely Connec-ted Convolutional Networks[J].arXiv:1608.06993,2016.
[18]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//CVPR.2016.
[19]HE K,ZHANG X,REN S,et al.Identity Mappings in Deep Residual Networks[C]//European Conference on Computer Vision.Cham:Springer,2016.
[20]LAW H,TENG Y,RUSSAKOVSKY O,et al.CornerNet-Lite:Efficient Keypoint Based Object Detection[J].arXiv:1904.08900.
[1] 汪鸣, 彭舰, 黄飞虎.
基于多时间尺度时空图网络的交通流量预测模型
Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction
计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188
[2] 刘冬梅, 徐洋, 吴泽彬, 刘倩, 宋斌, 韦志辉.
基于边框距离度量的增量目标检测方法
Incremental Object Detection Method Based on Border Distance Measurement
计算机科学, 2022, 49(8): 136-142. https://doi.org/10.11896/jsjkx.220100132
[3] 王灿, 刘永坚, 解庆, 马艳春.
基于软标签和样本权重优化的Anchor Free目标检测算法
Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization
计算机科学, 2022, 49(8): 157-164. https://doi.org/10.11896/jsjkx.210600240
[4] 祝文韬, 兰先超, 罗唤霖, 岳彬, 汪洋.
改进Faster R-CNN的光学遥感飞机目标检测
Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN
计算机科学, 2022, 49(6A): 378-383. https://doi.org/10.11896/jsjkx.210300121
[5] 沈超, 何希平.
基于纹理特征增强和轻量级网络的人脸防伪算法
Face Anti-spoofing Algorithm Based on Texture Feature Enhancement and Light Neural Network
计算机科学, 2022, 49(6A): 390-396. https://doi.org/10.11896/jsjkx.210600217
[6] 马宾, 付永康, 王春鹏, 李健, 王玉立.
基于GDIoU损失函数的YOLOv4绝缘子高效定位算法
High Performance Insulators Location Scheme Based on YOLOv4 with GDIoU Loss Function
计算机科学, 2022, 49(6A): 412-417. https://doi.org/10.11896/jsjkx.210600089
[7] 陈永平, 朱建清, 谢懿, 吴含笑, 曾焕强.
基于外接圆半径差损失的实时安全帽检测算法
Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss
计算机科学, 2022, 49(6A): 424-428. https://doi.org/10.11896/jsjkx.220100252
[8] 李健智, 王红玲, 王中卿.
基于图卷积网络的专利摘要自动生成研究
Automatic Generation of Patent Summarization Based on Graph Convolution Network
计算机科学, 2022, 49(6A): 172-177. https://doi.org/10.11896/jsjkx.210400117
[9] 陈佳舟, 赵熠波, 徐阳辉, 马骥, 金灵枫, 秦绪佳.
三维城市场景中的小物体检测
Small Object Detection in 3D Urban Scenes
计算机科学, 2022, 49(6): 238-244. https://doi.org/10.11896/jsjkx.210400174
[10] 赵小虎, 叶圣, 李晓.
多算法融合的骨骼重建信息动作分类方法
Multi-algorithm Fusion Behavior Classification Method for Body Bone Information Reconstruction
计算机科学, 2022, 49(6): 269-275. https://doi.org/10.11896/jsjkx.210500070
[11] 胡伏原, 万新军, 沈鸣飞, 徐江浪, 姚睿, 陶重犇.
深度卷积神经网络图像实例分割方法研究进展
Survey Progress on Image Instance Segmentation Methods of Deep Convolutional Neural Network
计算机科学, 2022, 49(5): 10-24. https://doi.org/10.11896/jsjkx.210200038
[12] 徐涛, 陈奕仁, 吕宗磊.
基于改进YOLOv3的机坪工作人员反光背心检测研究
Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm
计算机科学, 2022, 49(4): 239-246. https://doi.org/10.11896/jsjkx.210200119
[13] 周海榆, 张道强.
面向多中心数据的超图卷积神经网络及应用
Multi-site Hyper-graph Convolutional Neural Networks and Application
计算机科学, 2022, 49(3): 129-133. https://doi.org/10.11896/jsjkx.201100152
[14] 赵越, 余志斌, 李永春.
基于互注意力指导的孪生跟踪算法
Cross-attention Guided Siamese Network Object Tracking Algorithm
计算机科学, 2022, 49(3): 163-169. https://doi.org/10.11896/jsjkx.210300066
[15] 张侣, 周博文, 吴亮红.
基于改进卷积注意力模块与残差结构的SSD网络
SSD Network Based on Improved Convolutional Attention Module and Residual Structure
计算机科学, 2022, 49(3): 211-217. https://doi.org/10.11896/jsjkx.201200019
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!