Computer Science ›› 2021, Vol. 48 ›› Issue (10): 226-232.doi: 10.11896/jsjkx.210100058

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Camouflaged Object Detection Based on Improved YOLO v5 Algorithm

WANG Yang1, CAO Tie-yong1, YANG Ji-bin1, ZHENG Yun-fei1,2,3, FANG Zheng1, DENG Xiao-tong1, WU Jing-wei1, LIN Jia4   

  1. 1 Insitute of Command and Control Engineering,Army Engineering University of PLA,Nanjing 210007,China
    2 The Army Artillery and Defense Academy of PLA,Nanjing 211100,China
    3 The Key Laboratory of Polarization Imaging Detection Technology,Hefei 230031,China
    4 Shandong Military Region,Ji'nan 250000,China
  • Received:2021-01-20 Revised:2021-05-08 Online:2021-10-15 Published:2021-10-18
  • About author:WANG Yang,born in 1996,postgra-duate.His main research interests include object detection and image processing.
    CAO Tie-yong,born in 1970,Ph.D,professor,Ph.D supervisor.His main research interests include computer vision and image processing.
  • Supported by:
    National Science Fund for Distinguished Young Scholars of China(61801512),National Natural Science Foundation of China(62071484) and Natural Science Foundation of Jiangsu Province(BK20180080).

Abstract: Since the camouflage object is highly similar to the surrounding environment with a rather small size,the general detection algorithm is not fully applicable to the camouflaged object detection task,which makes the detection of camouflaged object more challenging than the general detection task.In order to solve this problem,the existing methods are analyzed in this paper and a detection algorithm for camouflage object is proposed based on the YOLO v5 algorithm.A new feature extraction network combined with attention mechanism is designed to highlight the feature information of the camouflage target.The original path aggregation network is improved so that the high,middle and lowly level feature map information is fully fused.The semantic information of the target is strengthened by nonlinear pool module,and the detection feature map size is increased to improve the detection recall rate of the small size target.On a public camouflage target dataset,the proposed algorithm is tested with 7 algorithms.The mAP of the proposed algorithm is 4.4% higher than that of the original algorithm,while the recall rate has improved 2.8%,which verifies the effectiveness of the algorithm for camouflaged object detection and the great advantage in accuracy compared with other algorithms.

Key words: Aggregation network, Attention mechanism, Camouflaged object, Object detection, YOLO

CLC Number: 

  • TP751
[1]BHAJANTRI N U,NAGABHUSHAN P.CAmouflage DefectIdentification:A Novel Approach[C]//9th International Confe-rence on Information Technology.Bhubaneswar:IEEE Press,2007:145-148.
[2]TANKUS A,YESHURUN Y.Convexity-Based Visual Camouflage Breaking[J].Computer Vision and Image Understanding,2001,82(3):208-237.
[3]SENGOTTUVELAN P,WAHI A,SHANMUGAM A.Performanceof Decamouflaging Through Exploratory Image Analysis[C]//2008 First International Conference on EmergingTrends in Engineering & Technology.Nagpur:IEEE Press,2008:6-10.
[4]REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2017,39(6):1137-1149.
[5]CAIZ W,VASCONCELOS N.Cascade R-CNN:Delving intoHigh Quality Object De-tection[C]//2018 IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.Salt Lake:IEEE Press,2018:6154-6162.
[6]PANG M J,CHEN K,SHI J P,et al.Libra R-CNN:Towards Balanced Learning for Object Detection[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Long Beach:IEEE Press,2019:821-830.
[7]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single Shot MultiBox Detector[C]//2016 European Conference on Compu-ter Vision.Amsterdam:Springer,Cham,2016:21-37.
[8]REDMON J,FARHADI A.YOLO9000:Better,Faster,Stronger[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE Press,2017:6517-6525.
[9]XU S K,NI C H,JI C C,et al.Image Caption of Safety Helmets Wearing in Construction Scene Based on YOLOv3[J].Computer Science,2020,47(8):233-240.
[10]LIN S Y,GOYAL P,GIRSHICK R,et al.Focal Loss for Dense Object Detection[J].IEEE Trans-actions on Pattern Analysis and Machine Intelligence,2020,42(2):318-327.
[11]TIAN Z,SHEN C H,CHEN H,et al.FCOS:Fully Convolutio-nal One-Stage Object Detection[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV).Seoul:IEEE Press,2019:9626-9635.
[12]YANG Z,LIU S H,HU H,et al.RepPoints:Point Set Representation for Object Detection[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV).Seoul:IEEE Press,2019:9656-9665.
[13]ZHOU X Y,WANG D Q,KRÄHENBÜHL P.Object as Points [EL/OB].https://arxiv.org/pdf/1903.07850.
[14]HU J,SHEN L,ALBANIE S,et al.Squeeze-and-Excitation Networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,42(8):2011-2023.
[15]MISRA D,NALAMADA T,ARASANIPALAI A U,et al.Rotate to Attend:Convolutional Triplet Attention Module [EL/OB].https://arxiv.org/pdf/2010.03045.
[16]WANG C Y,LIAO H Y,WUY H,et al.CSPNet:A New Backbone that can Enhance Learning Capability of CNN[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).Seattle:IEEE Press,2020:1571-1580.
[17]LIU S,QI L,QIN H F,et al.Path Aggregation Network for Instance Segmentation[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Salt Lake:IEEE Press,2018:8759-8768.
[18]ZHENG Y F,ZHANG X W,WANG F,et al.Detection of People with Camouflage Pattern Via Dense Deconvolution Network[J].IEEE Signal Processing Letters,2019,26(1):29-33.
[19]CHEN K,WANG J Q,PANG J M,et al.MMDetection:Open MMLab Detection Tool-box and Benchmark [EL/OB].https://arxiv.org/pdf/1906.07155.
[1] ZHOU Fang-quan, CHENG Wei-qing. Sequence Recommendation Based on Global Enhanced Graph Neural Network [J]. Computer Science, 2022, 49(9): 55-63.
[2] DAI Yu, XU Lin-feng. Cross-image Text Reading Method Based on Text Line Matching [J]. Computer Science, 2022, 49(9): 139-145.
[3] ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[4] XIONG Li-qin, CAO Lei, LAI Jun, CHEN Xi-liang. Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization [J]. Computer Science, 2022, 49(9): 172-182.
[5] RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[6] WANG Ming, PENG Jian, HUANG Fei-hu. Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction [J]. Computer Science, 2022, 49(8): 40-48.
[7] JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[8] ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[9] LIU Dong-mei, XU Yang, WU Ze-bin, LIU Qian, SONG Bin, WEI Zhi-hui. Incremental Object Detection Method Based on Border Distance Measurement [J]. Computer Science, 2022, 49(8): 136-142.
[10] WANG Can, LIU Yong-jian, XIE Qing, MA Yan-chun. Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization [J]. Computer Science, 2022, 49(8): 157-164.
[11] SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[12] YAN Jia-dan, JIA Cai-yan. Text Classification Method Based on Information Fusion of Dual-graph Neural Network [J]. Computer Science, 2022, 49(8): 230-236.
[13] JIN Fang-yan, WANG Xiu-li. Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM [J]. Computer Science, 2022, 49(7): 179-186.
[14] XIONG Luo-geng, ZHENG Shang, ZOU Hai-tao, YU Hua-long, GAO Shang. Software Self-admitted Technical Debt Identification with Bidirectional Gate Recurrent Unit and Attention Mechanism [J]. Computer Science, 2022, 49(7): 212-219.
[15] PENG Shuang, WU Jiang-jiang, CHEN Hao, DU Chun, LI Jun. Satellite Onboard Observation Task Planning Based on Attention Neural Network [J]. Computer Science, 2022, 49(7): 242-247.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!