计算机科学 ›› 2017, Vol. 44 ›› Issue (Z11): 221-224.doi: 10.11896/j.issn.1002-137X.2017.11A.046

• 模式识别与图像处理 • 上一篇    下一篇

基于DCT变换的多通道特征的行人检测

刘春阳,吴泽民,胡磊,刘熹   

  1. 解放军理工大学通信工程学院 南京210007,解放军理工大学通信工程学院 南京210007,解放军理工大学通信工程学院 南京210007,解放军理工大学通信工程学院 南京210007
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家自然科学基金(61501509)资助

Pedestrian Detection Based on DCT of Multi-channel Feature

LIU Chun-yang, WU Ze-min, HU Lei and LIU Xi   

  • Online:2018-12-01 Published:2018-12-01

摘要: 在行人检测中,针对目前多通道检测算法特征利用不充分的问题,提出一种基于DCT变换的多通道特征级联的行人检测算法。通过一种2层卷积网络模型将图像信息DCT变换后的数据进行整理,形成新的频域通道特征,该通道能描述行人的复杂纹理特征。结合梯度方向直方图特征、颜色空间特征和DCT频域特征,基于Adaboost算法训练了低开销的多通道特征行人检测器。在典型的公开行人库上的实验结果表明,该方法能提高检测的性能,在较低误检率时效果更加显著。

关键词: 行人检测,DCT变换,多通道特征

Abstract: In pedestrian detection,multi-channel feature detection has the defect of incomplete using features.In this paper,an algorithm of multi-channel feature detection based on discrete cosine transform (DCT) was proposed.We used a two-layer convolution network for arranging the image information after DCT to build a new channel in the frequency domain.This channel can describe complex textures about pedestrian.Combined with the features of the histogram of gradient and the color space as well as the DCT frequency domain,a low cost multi-channel pedestrian detector has been trained based on the Adaboost algorithm.Experiments on popular pedestrian databases show that the proposed method improves the accuracy of detection,and the effect is remarkable in low false positive per image.

Key words: Pedestrian detection,DCT,Multi-channel feature

[1] IKEUCHI K.Computer Vision:A Reference Guide[M].Sprin-ger Publishing Company,Incorporated,2014.
[2] 苏松志,李绍滋,陈淑媛,等.行人检测技术综述[J].电子学报,2012,40(4):814-820.
[3] CAO J,PANG Y,LI X.Pedestrian Detection Inspired by Appearance Constancy and Shape Symmetry[J].IEEE Transactions on Image Processing a Publication of the IEEE Signal Processing Society,2016,5(12):5538.
[4] KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNetClassification with Deep Convolutional Neural Networks[J].Advances in Neural Information Processing Systems,2012,25(2):2012.
[5] GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation[C]∥CVPR’14.2014:580-587.
[6] REDMON J,DIiVVALA S,GIRSHICK R,et al.You Only Look Once:Unified,Real-Time Object Detection[C]∥CVPR’16.2016:779-788.
[7] DALAL N,TRIGGS B.Histograms of oriented gradients forhuman detection[C]∥IEEE Conference on Computer Vision & Pattern Recognition.2005:886-893.
[8] ZHANG S,BAUCKHAGE C,CREMERS A B.Informed Haar-Like Features Improve Pedestrian Detection[C]∥IEEE Confe-rence on Computer Vision and Pattern Recognition.2014:947-954.
[9] WANG X G.Deep learning in image recognition[J].Communications of the CCF,2015,11(8):15-23.
[10] DOLLR P,TU Z,PERONA P,et al.Integral Channel Features[C]∥British Machine Vision Conference,BMVC 2009.London,UK,2009.
[11] ZHANG S,BENESON R,OMRAN M,et al.How Far are We from Solving Pedestrian Detection?[C]∥ CVPR’16.2016:1259-1267.
[12] DOLLAR P,APPEL R,BELONGIE S,et al.Fast Feature Pyramids for Object Detection[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2014,36(8):1532-1545.
[13] FARINELLA G M,RAV D,et al.Representing scenes for real-time context classification on mobile devices[J].Pattern Recognition,2015,48(4):1086-1100.
[14] WALLACE G K.The JPEG still picture compression standard[J].Communications of the ACM,1991,34(4):30-44.
[15] STURGESS P,ALAHARI K,LADICKY L,et al.CombiningAppearance and Structure from Motion Features for Road Scene Understanding[C]∥British Machine Vision Conference,BMVC 2009.London,UK,2009.
[16] BATTIATO S,MANCUSO M,BOSCO A,et al.Psychovisualand Statistical Optimization of Quantization Tables for DCT Compression Engines[C]∥International Conference on Image Analysis and Processing.IEEE Computer Society,2001:602-606.
[17] PETERSON H A,PENG H,MORGAN J H,et al.Quantization of color image components in the DCT domain[C]∥Proceedings of SPIE-The International Society for Optical Engineering.1991:210-222.
[18] LECUN Y,BOTTOU L,BENGIO Y,et al.Gradient-based lear-ning applied to document recognition[J].Proceedings of the IEEE,1998,86(11):2278-2232.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!