计算机科学 ›› 2024, Vol. 51 ›› Issue (6): 264-271.doi: 10.11896/jsjkx.230300222
刘家森, 黄俊
LIU Jiasen, HUANG Jun
摘要: 针对Swin Transformer在提取局部特征信息和特征表达能力上存在的不足,提出了一种基于改进Swin Transformer的中心点目标检测算法,以提高其在目标检测方面的性能。通过调整网络结构和引入反卷积模块来增强网络对局部特征信息的提取能力,利用自适应二维高斯核和回归头模块检测目标中心点来增强特征表达能力,并在Swin Transformer block模块中加入dropout激活函数,以缓解网络过拟合问题。在Pascal VOC和MS COCO 2017数据集上分别对改进后的算法进行验证,实验结果表明,改进后的Swin Transformer算法在Pascal VOC数据集上的精确度达到了81.1%,在MS COCO数据集上的精确度达到了37.2%,明显优于其他主流目标检测算法。
中图分类号:
[1]CHEN K Q,ZHU Z L,DENG X M,et al.Deep learning for Multi-Scale Object Detection:A survery[J].Journal of Software,2021,32(4):1201-1227. [2]BAO S M,WANG S Q.Overview of Object Detection Algorithms Based on Deep Learning[J].Transducer and Microsystem Technologies,2022,41(4):5-9. [3]HAN C,GAO G,ZHANG Y.Real time small traffic sign detection with revised faster-RCNN[J].Multimedia Tools and Applications,2018,7(10):13263-13278. [4]REDMON J,FARHADI A.YOLOv3:An inc-remental improvement[J].arXiv:1804.02767,2018. [5]LI X J,DENG Y M,CHENG Z H,et al.Improved YOLOv5 algorithm for airport runway foreign object detection[J].Computer Engineering and Applications,2023,59(2):202-211. [6]TIAN Z,SHEN C H,CHEN H,et al.FCOS fully convolutionalone-stage object detection[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington USA,2019:9626-9635. [7]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isAll You Need[J].arXiv:1706.03762,2017. [8]DOSOVITSKIY A,BEYER L,KOESNIKOV A,et al.An Image is Worth 16×16 Words:Transformers for Image Recognition at Scale[C]//International Conference on Learning Representations.Online:ICLR,2021:3-7. [9]LIU Z,LIN Y T,CAO Y,et al.Swin transformer:Hierarchical vision transformer using shifted windows[C]//Proceedings of the IEEE International Conference on Computer Vision. Montreal.Canada,2021:11-18. [10]FU C Y,LIU W,RANGA A,et al.DSSD:Deconvolutional Single Shot Detector[J].arXiv:1701.06659,2017. [11]ZHOU Y,LIU Y,LU J,et al.DIT:A Deformation InvariantTransformer Network for Unsupervised Keypoint Discovery and Description[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2020:12630-12639. [12]ZHOU X,WANG D,KRÄHENBÜL P.Objects as Points[J].arXiv:1904.07850,2019. [13]HINTON G E,SRIVASTAVA N,KRIZHEVSKY A,et al.Improving neural networks by preventing co-adaptation of feature detectors[J].arXiv:1207.0580,2012. [14]WANG C,LIU Y J,XIE Q,et al.Anchor Free object detection algorithm based on soft labeland sample weight optimization[J].Computer Science,2022,49(8):157-164. [15]LIN T Y,MAIRE M,BELONGIE S,et al.Microsoft COCO:common objects in context[C]//Proceedings of Conference on Computer Vision.Berlin,Germany,2014:740-755. [16]LIU W,ANGUELOV D,ERHAND,et al.SSD:Single shotmultibox detector[C]//Computer Vision-SCCV 2016.Amsterdam,2016:21-37. [17]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal Loss for Dense Object Detection[C]//Proceedings of Conference on Computer Vision.Venice,2017:2980-2988. |
|