计算机科学 ›› 2019, Vol. 46 ›› Issue (2): 249-254.doi: 10.11896/j.issn.1002-137X.2019.02.038

• 图形图像与模式识别 • 上一篇    下一篇

基于多源特征后融合的分层目标检测算法

盛雷, 卫志华, 张鹏宇   

  1. 同济大学计算机科学与技术系 上海201804
    嵌入式系统与服务计算教育部重点实验室(同济大学) 上海201804
  • 收稿日期:2018-07-13 出版日期:2019-02-25 发布日期:2019-02-25
  • 通讯作者: 卫志华(1979-),女,副教授,博士生导师,CCF会员,主要研究方向为机器学习、数据挖掘、视频内容理解等,E-mail:zhihua_wei@tongji.edu.cn
  • 作者简介:盛 雷(1993-),男,硕士,主要研究方向为目标检测、图像处理;张鹏宇(1993-),男,硕士,主要研究方向为目标检测、目标跟踪等。
  • 基金资助:
    本文受国家自然科学基金项目(61573259),公安部重大专项(20170004),国家重点研发计划项目(2017YFC0821300)资助。

Multi-layer Object Detection Algorithm Based on Multi-source Feature Late Fusion

SHENG Lei, WEI Zhi-hua, ZHANG Peng-yu   

  1. Department of Computer Science and Technology,Tongji University,Shanghai 201804,China
    Key Laboratory of Embedded Systems and Service Computing,Tongji University,Shanghai 201804,China
  • Received:2018-07-13 Online:2019-02-25 Published:2019-02-25

摘要: 目标检测是计算机视觉领域的热门研究课题,是视频内容分析的基础。文中提出了一种基于图像多源特征后融合的分层目标检测算法。在该算法中,使用多级决策的思想对目标检测任务进行粗细两个粒度的划分。在粗粒度层面,先使用HOG特征对图像进行分类,根据分类器的置信度分数,将测试图像分为正例、负例和不确定例。在细粒度层面,使用多种视觉特征以及多种核函数后融合的方法对不确定域中的图像做进一步分类。在同一数据集上设置了3组对比实验。实验结果表明,所提算法在各个评价指标上都有出色的表现,且在实际视频的目标检测中的效果优于Faster-RCNN。

关键词: 多级决策, 后融合, 计算机视觉, 目标检测, 特征提取

Abstract: Object detection is a hot topic in computer vision and it is the foundation of video caption.This paper proposed amulti-layer object detection algorithm based on multi-source feature late fusion,and used ways of multi-level decisions to divide the object detection task into two granularities.At the coarse level,the HOG feature was used to classify the images.According to the confidence scores of the classifier,the test images were categorized into positive,negative and uncertain examples.At the fine level,this paper proposed a multi-source feature late fusion method to classify the examples which are in the uncertain field.This paper conducted several comparative experiments on the same data set.Experimental results demonstrate that the proposed algorithm can obtain excellent results in all evaluation metrics,and achieve a better detection result than Faster-RCNN.

Key words: Computer vision, Feature extraction, Late fusion, Multi-level decision, Object detection

中图分类号: 

  • TP301.6
[1]WANG S.A Review of Gradient-Based and Edge-Based Feature Extraction Methods for Object Detection[C]∥International Conference on Computer and Information Technology.IEEE,2011:277-282.
[2]OLSON C F,HUTTENLOCHER D P.Automatic target recog- nition by matching oriented edge pixels[J].IEEE Transactions on Image Processing,1997,6(1):103-113.
[3]GAVRILA D M,PHILOMIN V.Real-time object detection for smart vehicles[C]∥The Proceedings of the Seventh IEEE International Conference on Computer Vision,1999.IEEE,1999:87-93.
[4]CANNY J.A computational approach to edge detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,1986(6):679-698.
[5]HINTERSTOISSER S,LEPETIT V,ILIC S,et al.Dominant orientation templates for real-time detection of texture-less objects[C]∥2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE,2010:2257-2264.
[6]HINTERSTOISSER S,CAGNIART C,ILIC S,et al.Gradient response maps for real-time detection of textureless objects[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2012,34(5):876-888.
[7]HSIAO E.Addressing ambiguity in object instance detection [D].Pittsburgh:Carnegie Mellon University,2013.
[8]LOWE D G.Distinctive image features from scale-invariant keypoints[J].International Journal of Computer Vision,2004,60(2):91-110.
[9]KE Y,SUKTHANKAR R.PCA-SIFT:a more distinctive representation for local image descriptors[C]∥IEEE Computer So-ciety Conference on Computer Vision & Pattern Recognition.IEEE Computer Society,2004:506-513.
[10]BAY H,TUYTELAARS T,GOOL L V.SURF:Speeded Up Robust Features[C]∥European Conference on Computer Vision.Springer-Verlag,2006:404-417.
[11]WANG X,HAN T X,YAN S.An HOG-LBP human detector with partial occlusion handling[C]∥International Conference on Computer Vision.IEEE,2010:32-39.
[12]LIENHART R,MAYDT J.An extended set of Haar-like features for rapid object detection[C]∥International Conference on Image Processing.2002:900-903.
[13]VIOLA P,JONES M.Fast and robust classification using asymmetric adaboost and a detector cascade[C]∥Advances in Neural Information Processing Systems.2002:1311-1318.
[14]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.Imagenet classification with deep convolutional neural networks[C]∥Advances in Neural Information Processing Systems.2012:1097-1105.
[15]SZEGEDY C,TOSHEV A,ERHAN D.Deep neural networks for object detection[C]∥Advances in Neural Information Processing Systems.2013:2553-2561.
[16]ERHAN D,SZEGEDY C,TOSHEV A,et al.Scalable object de- tection using deep neural networks[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:2147-2154.
[17]SERMANET P,EIGEN D,ZHANG X,et al.Overfeat:Integra- ted recognition,localization and detection using convolutional networks[J].arXiv:1312.6229,2013.
[18]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation[C]∥IEEE Conference on Computer Vision and Pattern Recognition.IEEE Computer Society,2014:580-587.
[19]GIRSHICK R.Fast R-CNN[C]∥IEEE International Con-ference on Computer Vision.IEEE Computer Society,2015:1440-1448.
[20]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks[C]∥International Conference on Neural Information Processing Systems.MIT Press,2015:91-99.
[21]HE K,GKIOXARI G,DOLLAR P,et al.Mask R-CNN[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence,2017,PP(99):1.
[22]MUNDER S,GAVRILA D M.An Experimental Study on Pedestrian Classification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2006,28(11):1863-1868.
[1] 刘冬梅, 徐洋, 吴泽彬, 刘倩, 宋斌, 韦志辉.
基于边框距离度量的增量目标检测方法
Incremental Object Detection Method Based on Border Distance Measurement
计算机科学, 2022, 49(8): 136-142. https://doi.org/10.11896/jsjkx.220100132
[2] 王灿, 刘永坚, 解庆, 马艳春.
基于软标签和样本权重优化的Anchor Free目标检测算法
Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization
计算机科学, 2022, 49(8): 157-164. https://doi.org/10.11896/jsjkx.210600240
[3] 张源, 康乐, 宫朝辉, 张志鸿.
基于Bi-LSTM的期货市场关联交易行为检测方法
Related Transaction Behavior Detection in Futures Market Based on Bi-LSTM
计算机科学, 2022, 49(7): 31-39. https://doi.org/10.11896/jsjkx.210400304
[4] 曾志贤, 曹建军, 翁年凤, 蒋国权, 徐滨.
基于注意力机制的细粒度语义关联视频-文本跨模态实体分辨
Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism
计算机科学, 2022, 49(7): 106-112. https://doi.org/10.11896/jsjkx.210500224
[5] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[6] 祝文韬, 兰先超, 罗唤霖, 岳彬, 汪洋.
改进Faster R-CNN的光学遥感飞机目标检测
Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN
计算机科学, 2022, 49(6A): 378-383. https://doi.org/10.11896/jsjkx.210300121
[7] 马宾, 付永康, 王春鹏, 李健, 王玉立.
基于GDIoU损失函数的YOLOv4绝缘子高效定位算法
High Performance Insulators Location Scheme Based on YOLOv4 with GDIoU Loss Function
计算机科学, 2022, 49(6A): 412-417. https://doi.org/10.11896/jsjkx.210600089
[8] 陈永平, 朱建清, 谢懿, 吴含笑, 曾焕强.
基于外接圆半径差损失的实时安全帽检测算法
Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss
计算机科学, 2022, 49(6A): 424-428. https://doi.org/10.11896/jsjkx.220100252
[9] 刘伟业, 鲁慧民, 李玉鹏, 马宁.
指静脉识别技术研究综述
Survey on Finger Vein Recognition Research
计算机科学, 2022, 49(6A): 1-11. https://doi.org/10.11896/jsjkx.210400056
[10] 陈佳舟, 赵熠波, 徐阳辉, 马骥, 金灵枫, 秦绪佳.
三维城市场景中的小物体检测
Small Object Detection in 3D Urban Scenes
计算机科学, 2022, 49(6): 238-244. https://doi.org/10.11896/jsjkx.210400174
[11] 高元浩, 罗晓清, 张战成.
基于特征分离的红外与可见光图像融合算法
Infrared and Visible Image Fusion Based on Feature Separation
计算机科学, 2022, 49(5): 58-63. https://doi.org/10.11896/jsjkx.210200148
[12] 胡伏原, 万新军, 沈鸣飞, 徐江浪, 姚睿, 陶重犇.
深度卷积神经网络图像实例分割方法研究进展
Survey Progress on Image Instance Segmentation Methods of Deep Convolutional Neural Network
计算机科学, 2022, 49(5): 10-24. https://doi.org/10.11896/jsjkx.210200038
[13] 徐涛, 陈奕仁, 吕宗磊.
基于改进YOLOv3的机坪工作人员反光背心检测研究
Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm
计算机科学, 2022, 49(4): 239-246. https://doi.org/10.11896/jsjkx.210200119
[14] 张继凯, 李琦, 王月明, 吕晓琪.
基于单目RGB图像的三维手势跟踪算法综述
Survey of 3D Gesture Tracking Algorithms Based on Monocular RGB Images
计算机科学, 2022, 49(4): 174-187. https://doi.org/10.11896/jsjkx.210700084
[15] 左杰格, 柳晓鸣, 蔡兵.
基于图像分块与特征融合的户外图像天气识别
Outdoor Image Weather Recognition Based on Image Blocks and Feature Fusion
计算机科学, 2022, 49(3): 197-203. https://doi.org/10.11896/jsjkx.201200263
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!