Computer Science ›› 2021, Vol. 48 ›› Issue (12): 264-268.doi: 10.11896/jsjkx.201200196

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Object Detection Based on Neighbour Feature Fusion

LI Ya-ze, LIU Hong-zhe   

  1. Beijing Key Laboratory of Information Service Engineering,Beijing Union University,Beijing 100101,China
    College of Robotics,Beijing Union University,Beijing 100101,China
  • Received:2020-12-22 Revised:2021-06-08 Online:2021-12-15 Published:2021-11-26
  • About author:LI Ya-ze,born in 1991,postgraduate.His main research interests include computer vision and object detection.
    LIU Hong-zhe,born in 1971,Ph.D.Her main research interests include compu-ter vision,deep learning,media semantic computing,etc.
  • Supported by:
    National Natural Science Foundation of China(61871039,61906017,61802019),Beijing Municipal Commission of Education Project(KM202111417001,KM201911417001),Collaborative Innovation Center for Visual Intelligence(CYXC2011) and Academic Research Projects of Beijing Union University(ZK80202001,202011417004,202011417005).

Abstract: With the development of intelligent driving,the precision requirements for target detection are getting higher and higher,especially for small targets that are far away.In the neck of two-stage object detection network,although the feature fusion of semantic information and location information is more effective for large targets if the bottom-up fusion method is used,it will cause big information loss to small targets.To address this problem,we propose neighbor feature pyramid networks(NFPN) method of feature fusion of neighbor layers,the Double RoI(Region of Interest) method to fuse the FPN and NFPN features,and the recursive feature pyramicl(RFP) method.Using Faster RCNN 50 as the benchmark,the mean average precision(mAP) of our model in the Lisa data set has increased by 2.6% while using NFPN,Double RoI and RFP.On the VOC2007 data set,using the VOC07+12 train data set for training,VOC2007 test as the test set,and Faster RCNN101 as the baseline,the mAP of our model both used NFPN,Double RoIE and RFP has increased by 6%,and the object detect accuracy of large,medium and small targets is improved at the same time.

Key words: Autonomous driving, Computer vision, Deep learning, Feature fusion, Object detection

CLC Number: 

  • TP183
[1]MOGELMOSE A,TRIVEDI M M,MOESLUND T B.Vision-based traffic sign detection and analysis for intelligent driver assistance systems:Perspectives and survey[J].IEEE Transactions on Intelligent Transportation Systems,2012,13(4):1484-1497.
[2]HE K,ZHANG X,REN S,et al.Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[3]REN S,HE K,GIRSHICK R,et al.Faster r-cnn:Towards real-time object detection with region proposal networks[C]//Advances in Neural Information Processing Systems.2015:91-99.
[4]LIU S,QI L,QIN H,et al.Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8759-8768.
[5]LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramid networks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2117-2125.
[6]JOHN M E.The PASCAL Visual Object Classes Challenge 2007(VOC2007) Development Kit[J].International Journal of Computer Vision,2006,111(1):98-136.
[7]LIU W,ANGUELOV D,ERHAN D,et al.Ssd:Single shot multibox detector[C]//European Conference on Computer Vision.2016:21-37.
[8]TAN M,PANG R,LE Q V.Efficientdet:Scalable and efficient object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:10781-10790.
[9]PANG J,CHEN K,SHI J,et al.Libra r-cnn:Towards balanced learning for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2019:821-830.
[10]HE K,GKIOXARI G,DOLLÁR P,et al.Mask r-cnn[C]//Pro- ceedings of the IEEE International Conference on Computer Vision.2017:2961-2969.
[11]REDMON J,FARHADI A.Yolov3:An incremental improve- ment[J].arXiv:1804.02767,2018.
[12]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2980-2988.
[13]CAI Z,VASCONCELOS N.Cascade r-cnn:Delving into high quality object detection[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition.2018:6154-6162.
[14]QIAO S,CHEN L C,YUILLE A.DetectoRS:Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution[J].arXiv:2006.02334,2020.
[15]LIU Y,WANG Y,WANG S,et al.CBNet:A Novel Composite Backbone Network Architecture for Object Detection[C]//AAAI.2020:11653-11660.
[16]SHRIVASTAVA A,GUPTA A,GIRSHICK R.Training region-based object detectors with online hard example mining[C]//IEEE Conference on Computer Vision & Pattern Recognition.IEEE Computer Society,2016:761-769.
[17]CHEN K,WANG J,PANG J,et al.Mmdetection:Open mmlab detection toolbox and benchmark[J].arXiv:1906.07155,2019.
[18]GAO S,CHENG M M,ZHAO K,et al.Res2Net:A New Multi-scale Backbone Architecture[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2021,43(2):652-662,1.
[19]WANG T,YUAN L,ZHANG X,et al.Distilling object detectors with fine-grained feature imitation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2019:4933-4942.
[1] RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[2] TANG Ling-tao, WANG Di, ZHANG Lu-fei, LIU Sheng-yun. Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy [J]. Computer Science, 2022, 49(9): 297-305.
[3] XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[4] WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[5] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[6] JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[7] LIU Dong-mei, XU Yang, WU Ze-bin, LIU Qian, SONG Bin, WEI Zhi-hui. Incremental Object Detection Method Based on Border Distance Measurement [J]. Computer Science, 2022, 49(8): 136-142.
[8] WANG Can, LIU Yong-jian, XIE Qing, MA Yan-chun. Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization [J]. Computer Science, 2022, 49(8): 157-164.
[9] SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[10] HU Yan-yu, ZHAO Long, DONG Xiang-jun. Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification [J]. Computer Science, 2022, 49(7): 73-78.
[11] ZHANG Ying-tao, ZHANG Jie, ZHANG Rui, ZHANG Wen-qiang. Photorealistic Style Transfer Guided by Global Information [J]. Computer Science, 2022, 49(7): 100-105.
[12] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
[13] HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.
[14] ZHOU Hui, SHI Hao-chen, TU Yao-feng, HUANG Sheng-jun. Robust Deep Neural Network Learning Based on Active Sampling [J]. Computer Science, 2022, 49(7): 164-169.
[15] SU Dan-ning, CAO Gui-tao, WANG Yan-nan, WANG Hong, REN He. Survey of Deep Learning for Radar Emitter Identification Based on Small Sample [J]. Computer Science, 2022, 49(7): 226-235.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!