Computer Science ›› 2022, Vol. 49 ›› Issue (11A): 210900220-6.doi: 10.11896/jsjkx.210900220

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Improved FCOS Target Detection Algorithm

CHEN Jin-ling, CHENG Mao-kai, XU Zi-han   

  1. School of Electrical Information,Southwest Petroleum University,Chengdu 610500,China
  • Online:2022-11-10 Published:2022-11-21
  • About author:CHEN Jin-ling,born in 1975,Ph.D,professorate senior engineer.His main research interests include deep learning and image processing.
  • Supported by:
    Sichuan Provincial Key R & D Plan(Major Science and Technology Project)(2022YFS0020)and Nanchong City-School Science and Technology Strategic Cooperation Project(22SXQT0292).

Abstract: An enhanced FCOS object detection algorithm is proposed to address the problems that the classical anchorless frame object detection algorithm FCOS(fully constitutional one-stage object detection) has difficulty in extracting target information,insufficient ability to combine location and content information,and weak performance due to insufficient differentiation between positive and negative sample.The method first adds a deformable convolution module and a global attention module to the ResNet50 feature extraction network to improve the feature information capture capability.Then,the FPN feature pyramid is combined with the deep link layer to form a multi-scale feature fusion module to improve the feature extraction effect.Finally,the adaptive division of positive and negative samples module is added to enhance the accuracy of the test frame to achieve the effect of improving the regression accuracy.In order to test the detection effect of the algorithm,the COCO dataset and VOC dataset are used for experiments.Compared with the original FCOS algorithm,the average accuracy of the proposed algorithm on the two datasets improves by 2.3% and 1.8%,respectively.Among them,there is a significant improvement for the detection of small targets in the COCO dataset.

Key words: Target detection, Deformable convolution, Global attention, Multi-scale features, Feature pyramid, Positive and negative samples

CLC Number: 

  • TP391.41
[1]REDMON J,FARHADI A.YOLO9000:Better,Faster,Stronger[C]//IEEE Conference on Computer Vision & Pattern Recognition.IEEE,2017:6517-6525.
[2]REDMON J,FARHADI A.YOLOv3:An Incremental Improvement[J].arXiv:1804.02767,2018.
[3]LIU W,ANGUELOV D,ERHAN D,et al.Ssd:Single ShotMultiBox Detector[C]//European Conference on Computer Vision.Cham:Springer,2016:21-37.
[4]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal Loss for Dense Object Detection[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2020,42(2):318-327.
[5]REN S Q,HE K M,GIRSHICK,et al.FASTER R-CNN:Towards Real-Time Object Detection with Region Proposal Networks[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2017,39(6):1137-1149.
[6]TIAN Z,SHEN C,CHEN H,et al.FCOS:A simple and strong anchor-free object detector[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,44(4):1922-1933.
[7]ZHOU X,WANG D,KRHENBÜHL P.Objects as Points[J].arXiv:1904.07850,2019.
[8]LAW H,DENG J.CornerNet:Detecting Objects as Paired Keypoints[J].International Journal of Computer Vision,2020,128(3):642-656.
[9]YANG Z,LIU S,HU H,et al.RepPoints:Point Set Representation for Object Detection[C]//2019 IEEE/CVF International Conference on Computer Vision(ICCV).IEEE,2019.
[10]LONG J,SHELHAMER E,DARRELL T.Fully Convolutional Networks for Semantic Segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,39(4):640-651.
[11]LIN T Y,DOLLAR P,GIRSHICK R,et al.Feature Pyramid Networks for Object Detection[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).IEEE Computer Society,2017.
[12]SRINIVAS A,LIN T Y,PARMAR N,et al.Bottleneck Transformers for Visual Recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:16519-16529.
[13]HE K,ZHANG X,REN S,et al.Deep Residual Learning forImage Recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[14]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[C]//Proceedings of the 31st International Confe-rence on Neural Information Processing Systems(NIPS’17).2017:6000-6010.
[15]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[16]LIN T Y,MAIRE M,BELONGIES,et al.Microsoft COCO:common objects in context [C]//European Conference on Computer Vision(ECCV).Cham:Springer,2014:745-755.
[17]EVERINGHAM M,VAN G L,WILLIAMS C K I,et al.Thepascal visual object classes(VOC) challenge [J].International Journal of Computer Vision,2010 88(2):303-338.
[18]KANTOR P B.Foundations of Statistical Natural LanguageProcessing[J].Information Retrieval,2001,4(1):80-81.
[19]CARION N,MASSA F,SYNNAEVE G,et al.End-to-End Object Detection with Transformers[C]//European Conference on Computer Vision.Cham:Springer,2020:213-229.
[20]DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al.AnImage is Worth 16x16 Words:Transformers for Image Recognition at Scale[J].arXiv:2010.11929,2020.
[21]BA J L,KIROS J R,HINTON G E.Layer Normalization[J].arXiv:1607.06450,2016.
[22]LOFFE S,SZEGEDY C.Batch normalization:Accelerating deep network training by reducing internal covariate shift[C]//International Conference on Machine Learning.PMLR,2015:448-456.
[23]LECUN Y,BENGIO Y.Convolutional networks for images,speech,and time series[M]//The Handbook of Brain Theory and Neural Networks.MIT press,1998:255-258.
[24]DAI J,QI H,XIONG Y,et al.Deformable Convolutional Networks[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:764-773.
[25]ZHANG S,CHI C,YAO Y,et al.Bridging the Gap Between Anchor-Based and Anchor-Free Detection via Adaptive Training Sample Selection[C]//2020 IEEE/CVF Conference on Compu-ter Vision and Pattern Recognition(CVPR).IEEE,2020.
[26]FANG L P,HE H J,ZHOU G M.A review of target detection algorithm research[J].Computer Engineering and Applications,2018,54(13):11-18,33.
[1] SUN Fu-quan, CUI Zhi-qing, ZOU Peng, ZHANG Kun. Brain Tumor Segmentation Algorithm Based on Multi-scale Features [J]. Computer Science, 2022, 49(6A): 12-16.
[2] ZHU Wen-tao, LAN Xian-chao, LUO Huan-lin, YUE Bing, WANG Yang. Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN [J]. Computer Science, 2022, 49(6A): 378-383.
[3] ZHANG Lyu, ZHOU Bo-wen, WU Liang-hong. SSD Network Based on Improved Convolutional Attention Module and Residual Structure [J]. Computer Science, 2022, 49(3): 211-217.
[4] SHAO Hai-lin, JI Yi, LIU Chun-ping, XU Yun-long. Scene Text Detection Algorithm Based on Enhanced Feature Pyramid Network [J]. Computer Science, 2022, 49(2): 248-255.
[5] GU Xi-long, GONG Ning-sheng, HU Qian-sheng. Multi-label Vehicle Real-time Recognition Algorithm Based on YOLOv3 and Improved VGGNet [J]. Computer Science, 2022, 49(11A): 210600142-7.
[6] HE Xiao-hui, QIU Fang-bing, CHENG Xi-jie, TIAN Zhi-hui, ZHOU Guang-sheng. High-resolution Image Building Target Detection Based on Edge Feature Fusion [J]. Computer Science, 2021, 48(9): 140-145.
[7] WANG Dong, ZHOU Da-ke, HUANG You-da , YANG Xin. Multi-scale Multi-granularity Feature for Pedestrian Re-identification [J]. Computer Science, 2021, 48(7): 238-244.
[8] XU Ze, SHUAI Ren-jun, LIU Kai-kai, MA Li, WU Meng-lin. Generation of Realistic Image from Text Based on Feature Fusion [J]. Computer Science, 2021, 48(6): 125-130.
[9] XU Ying-kun, MA Fang-nan, YANG Xu-hua, YE Lei. Attribute Network Representation Learning Based on Global Attention [J]. Computer Science, 2021, 48(12): 188-194.
[10] YUAN Xiao-pei, CHEN Xiao-feng, LIAN Ming. Improved Multi-feature Fusion Algorithm for Target Detection Based on Haar-like and LBP [J]. Computer Science, 2021, 48(11): 219-225.
[11] JIN Yu-fang, WU Xiang, DONG Hui, YU Li, ZHANG Wen-an. Improved YOLO v4 Algorithm for Safety Helmet Wearing Detection [J]. Computer Science, 2021, 48(11): 268-275.
[12] YAO Lan, ZHAO Yong-heng, SHI Yu-qing, YU Ming-he. Highway Abnormal Event Detection Algorithm Based on Video Analysis [J]. Computer Science, 2020, 47(8): 208-212.
[13] ZHANG Man, LI Jie, DING Rong-li, CHENG Hao-tian and SHEN Ji. Remote Sensing Image ObJect Detection Technology Based on Improved YOLO-V2 Algorithm [J]. Computer Science, 2020, 47(6A): 176-180.
[14] LI Lin, ZHAO Kai-yue, ZHAO Xiao-yong, WEI Shuai-qin and ZHANG Bing. Contaminated and Shielded Number Plate Recognition Based on Convolutional Neural Network [J]. Computer Science, 2020, 47(6A): 213-219.
[15] CAI Yu-xin, TANG Zhi-wei, ZHAO Bo, YANG Ming and WU Yu-fei. Accelerated Software System Based on Embedded Multicore DSP [J]. Computer Science, 2020, 47(6A): 622-625.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!