Computer Science ›› 2023, Vol. 50 ›› Issue (11A): 221100247-9.doi: 10.11896/jsjkx.221100247

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Study on Scale Adaptive Target Detection Algorithm Based on Improved D2Det

WANG Ling, HUANG Guan, WANG Peng, BAI Yane, QIU Tianheng   

  1. School of Computer Science and Technology,Changchun University of Science and Technology,Changchun 130022,China
  • Published:2023-11-09
  • About author:WANG Ling,born in 1979,Ph.D.Her main research interests include machine learning and image processing.
  • Supported by:
    Jilin Provincial Basic Research Project of the Central Leading local Science and Technology Development Fund(202002038JC).

Abstract: Aiming at the problem that D2Det(Towards High Quality Object Detection and Instance Segmentation) has poor detection effect and large parameter quantity in the face of scale change targets and small targets,this paper proposes a scale adaptive target detection model G-SAD2Det based on D2Det.Firstly,in the data preprocessing stage,the data enhancement algorithms CutOut and Mosaic are introduced,and the model has good robustness when dealing with complex scenes.Secondly,the feature extraction network ResNet is improved,the multi-scale feature extraction structure is built in each residual block,and the target features are better extracted from the fine-grained level.At the same time,the switchable global context semantic feature extraction module is added to the network structure,and the salience features and global context semantic information are enhanced through different pooling layers.Then,the candidate frame generation module is improved,and the center area of the self-locating target is used to guide the generation of the candidate frame,so that the adaptive ability of the algorithm to the scaling target can be enhanced.Finally,replacing ordinary convolution with Ghost convolution to reduce the amount of network parameters and computation.VOC data set and COCO sub-data set are used to verify the effectiveness of the algorithm,the mAP@0.5 value of G-SAD2Det increases by 3.6% and 4.9% respectively,compared with D2Det in the two data sets.The number of model parameters reduces by 27.42% and the amount of calculation reduces by 35.96%.It is proved that the improved algorithm not only improves the accuracy,but also reduces the amount of computation.

Key words: Object detection, Scale adaptive, Multi-scale feature extraction, Residual element, Regional guidance candidate box

CLC Number: 

  • TP391.41
[1]LOU G X,SHI H Z.Face image recognition based on convolu-tional neural network[J].China Communications,2020,17(2):117-124.
[2]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarc hies for accurate object detection and semantic segment ation[C]//Proceedings of the IEEE Conference on Com Puter Vision and Pattern Recognition.Piscataway,NJ:IEEE Press,2014:580-587.
[3]HE K M,ZHANG X Y,REN S Q,et al.Deep residual learningfor image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE Press,2016:770-778.
[4]CAI Z W,VASCONCELOS N.Cascade r-cnn:Delving into high quality object detecti on[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition.Piscata way,NJ:IEEE Press,2018:6154-6162.
[5]LUO H L,CHEN H K.A Survey oftarget detection based ondeep learning[J].Acta Electronica Sinica,2020,48(6):1230-1239.
[6]ZHAO Y Q,JIA J L,GONG W J,et al.Multi scale aerial image target detection algorithm based on pro-YOLOv4[J].Computer Engineering & Science,2021,38(11):3466-3471.
[7]ZHANG R M,BI L J,WANG F B,et al.Target detection algorithm based on multi-scale feature fusion and adaptive anchor frame[J].Laser & Optoelectronics Progress,2022,59(12):420-429.
[8]LIU F,HAN X.Adaptive aerial target detection based on multi-scale depth learning[J].Acta Aeronautica et Astronautica Sinica,2022,43(5):471-482.
[9]LI Y Z,LIU H Z.Object Detection Based on Neighbour Feature Fusion[J].Computer Science,2021,48(12):264-268.
[10]CAO J L,CHOLAKKAL H,ANWER R M,et al.D2det:To-wards high quality object detection and instances segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE Press,2020:11485-11494.
[11]LIN T Y,DOLLAR P,GIRSHICK R,et al.Feature pyramidnetworks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Pisca-taway,NJ:IEEE Press,2017:2117-2125.
[12]ZHANG L,ZHOU B W,WU L H.SSD Network Based on Improved Convolutional Attention Module and Residual Structure[J].Computer Science,2022,49(3):211-217.
[13]YU X Y,WU S Y,LU X Q,et al.Adaptive multiscale feature for object detection[J].Neurocomputing,2021,449:146-158.
[14]ZENG N Y,WU P S,WANG Z D,et al.A smalll-sized object detection oriented multi-scale feature fusion approach with application to defect detection[J].IEEE Transactions on Instrumentation and Measurement,2022,71:1-14.
[15]DEVRIES T,TAYLOR G W.Improved regul arization of con-volutional neural networks with cutout[J].arXiv:1708.04552,2017.
[16]BOCHKOVSKIY A,WANG C Y,LIAO H Y M.Yolov4:Optimal speed and accuracy of object detection[J].arXiv:2004.10934,2020.
[17]GAO S H,CHENG M M,ZHAO K,et al.Res2net:A newmulti-scale bac kbone architecture[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2019,43(2):652-662.
[18]HAN K,WANG Y,TIAN Q,et al.Ghostnet:More featuresfrom cheap operations[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:1580-1589.
[19]WANG J Q,CHEN K,YANG S,et al.Region proposal byguided anchoring[C]//Proceedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE Press,2019:2965-2974.
[20]HE Y H,ZHU C H,WANG J R,et al.Bounding box regression with uncertainty for accurate object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataw ay,NJ:IEEE Press,2019:2888-2897.
[21]LU J,LIN W,CHEN P,et al.Research on Lightweight Citrus Flowering Rate Statistical Model Combined with Anchor Frame Clustering Optimization[J].Sensors,2021,21(23):7929.
[22]ZHANG H,CISSE M,DAUPHIN Y N,et al.mixup:Beyondempirical risk minimization[J].arXiv:1710.09412,2017.
[23]YUN S,HAN D,OH S J,et al.Cutmix:Regu larization strategy to train strong classifiers with localizable features[C]//Procee-dings of the IEEE/CVF International Conference on Computer Vision.2019:6023-6032.
[24]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE Press,2018:7132-7141.
[25]WOO S,PARK J,LEE J Y,et al.Cbam:Con volutional block at-tention module[C]//Proceedings of the European Conference on Computer Vision(ECCV).Berlin:Springer,2018:3-19.
[26]BAO Y X,LU T L,DU Y H,et al.Deepfake Videos Detection Method Based on iResNet34 Model and Data Aug mentation[J].Computer Science,2021,48(7):77-85.
[27]NAJIBI M,SINGH B,DAVIS L S.Fa-rpn:Floating region prop osals for face detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:7723-7732.
[28]DAI J F,QI H Z,XIONG Y W,et al.Deformable convolutional networks[C]//Proceedings of the IEEE International Confe-rence on Computer Vision.Piscataway,NJ:IEEE Press,2017:764-773.
[29]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for denseobject detection[C]//Proceedings of the IEEE International Conference on Computer Vision.Piscataway,NJ:IEEE Press,2017:2980-2988.
[30]LIU W,ANGUELOV D,ERHAN D,et al.SSd:Single shot mul tibox detector[C]//European Conference on Computer Vision.Cham:Springer,2016:21-37.
[31]LU X,LI B Y,YUE Y X,et al.Grid r-cnn[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE Press,2019:7363-7372.
[32]PANG J M,CHEN K,SHI J P,et al.Libra r-cnn:Towards ba-lanced learning for object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE Press,2019:821-830.
[33]CHEN Q,WANG Y M,YANG T,et al.You only look one-level feature[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE Press,2021:13039-13048.
[34]QIAO S Y,CHEN L C,YUILLE A.Detectors:Detecting objects with recursive feature pyramid and switchable atrous convolution[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE Press,2021:10213-10224.
[35]DUAN K W,BAI S,XIE L X,et al.Centernet:Keypoint triplets for object dete ction[C]//Proceedings of the IEEE/CVF Internati ONAL Conference on Computer Vision.Piscataway,NJ:IEEE Press,2019:6569-6578.
[1] YANG Yi, SHEN Sheng, DOU Zhiyang, LI Yuan, HAN Zhenjun. Tiny Person Detection for Intelligent Video Surveillance [J]. Computer Science, 2023, 50(9): 75-81.
[2] ZHU Ye, HAO Yingguang, WANG Hongyu. Deep Learning Based Salient Object Detection in Infrared Video [J]. Computer Science, 2023, 50(9): 227-234.
[3] LIU Yubo, GUO Bin, MA Ke, QIU Chen, LIU Sicong. Design of Visual Context-driven Interactive Bot System [J]. Computer Science, 2023, 50(9): 260-268.
[4] WANG Xu, WU Yanxia, ZHANG Xue, HONG Ruize, LI Guangsheng. Survey of Rotating Object Detection Research in Computer Vision [J]. Computer Science, 2023, 50(8): 79-92.
[5] HUO Weile, JING Tao, REN Shuang. Review of 3D Object Detection for Autonomous Driving [J]. Computer Science, 2023, 50(7): 107-118.
[6] QI Xuanlong, CHEN Hongyang, ZHAO Wenbing, ZHAO Di, GAO Jingyang. Study on BGA Packaging Void Rate Detection Based on Active Learning and U-Net++ Segmentation [J]. Computer Science, 2023, 50(6A): 220200092-6.
[7] WANG Guogang, WU Yan, LIU Yibo. Target Detection Algorithm Based on Compound Scaling Deep Iterative CNN by RegressionConverging and Scaling Mixture [J]. Computer Science, 2023, 50(6A): 220500230-9.
[8] WU Liuchen, ZHANG Hui, LIU Jiaxuan, ZHAO Chenyang. Defect Detection of Transmission Line Bolt Based on Region Attention Mechanism andMulti-scale Feature Fusion [J]. Computer Science, 2023, 50(6A): 220200096-7.
[9] DOU Zhi, HU Chenguang, LIANG Jingyi, ZHENG Liming, LIU Guoqi. Lightweight Target Detection Algorithm Based on Improved Yolov4-tiny [J]. Computer Science, 2023, 50(6A): 220700006-7.
[10] JIA Tianhao, PENG Li. SSD Object Detection Algorithm with Residual Learning and Cyclic Attention [J]. Computer Science, 2023, 50(5): 170-176.
[11] WU Han, NIE Jiahao, ZHANG Zhaowei, HE Zhiwei, GAO Mingyu. Deep Learning-based Visual Multiple Object Tracking:A Review [J]. Computer Science, 2023, 50(4): 77-87.
[12] ZHANG Weiliang, CHEN Xiuhong. SSD Object Detection Algorithm with Cross-layer Fusion and Receptive Field Amplification [J]. Computer Science, 2023, 50(3): 231-237.
[13] CHEN Liang, WANG Lu, LI Shengchun, LIU Changhong. Study on Visual Dashboard Generation Technology Based on Deep Learning [J]. Computer Science, 2023, 50(3): 238-245.
[14] HUA Jie, LIU Xueliang, ZHAO Ye. Few-shot Object Detection Based on Feature Fusion [J]. Computer Science, 2023, 50(2): 209-213.
[15] SHANG Di, LYU Yanfeng, QIAO Hong. Incremental Object Detection Inspired by Memory Mechanisms in Brain [J]. Computer Science, 2023, 50(2): 267-274.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!