计算机科学 ›› 2018, Vol. 45 ›› Issue (6A): 230-233.

• 模式识别与图像处理 • 上一篇    下一篇

基于超分辨率和深度神经网络的车型识别

雷倩,郝存明,张伟平   

  1. 河北省科学院应用数学研究所 石家庄050081
  • 出版日期:2018-06-20 发布日期:2018-08-03
  • 作者简介:雷 倩(1989-),女,硕士,CCF会员,主要研究方向为人工智能,E-mail:943221957@qq.com;郝存明(1981-),男,硕士,副研究员,主要研究方向为计算机视觉;张伟平(1989-),女,硕士,主要研究方向为图像处理。
  • 基金资助:
    河北省科技计划基金项目(17395602D)资助

Vehicle Recognition Based on Super-resolution and Deep Neural Networks

LEI Qian, HAO Cun-ming,ZHANG Wei-ping   

  1. Institute of Applied Mathematics,Hebei Academy of Sciences,Shijiazhuang 050081,China
  • Online:2018-06-20 Published:2018-08-03

摘要: 车型识别在视频监控系统中起着关键作用,文中利用深度神经网络和超分辨率来实现交通监控中的车型识别。利用深度卷积神经网络CaffeNet,并采用先进的深度学习框架CAFFE和具有强大计算能力的GPU来完成对车辆的车型识别。在图像预处理阶段,采用一种基于深度学习和稀疏表示的图像超分辨率(SR)重构算法,来增强图像的细节信息。其中首先基于深度学习模型自编码器,提出一种改进模型非负稀疏去噪自编码器(Nonnegative Sparse Denoising Auto-Encoders,NSDAE)来实现字典的联合学习,然后基于稀疏表示实现车辆图像的超分辨率重构。经实验验证,在加入超分辨率处理之后,车型识别效果在精确度上得到了明显的提升。

关键词: 超分辨率, 车型识别, 去噪自编码器, 深度卷积神经网络, 深度学习

Abstract: Vehicle recognition plays a key role in traffic video surveillance system.In this paper,deep neural network and super-resolution were uses to realize vehicle recognition in traffic surveillance.It uses deep convolution neural network CaffeNet to complete vehicle recognition with advanced deep learning framework CAFFE and computationally powerful GPU.In the image preprocessing stage,an image super-resolution reconstruction algorithm based on deep learning and sparse representation was used to enhance the detail information of the image.First,based on auto-encoders,an improved model of nonnegative sparse denoising auto-encoders (NSDAE) was proposed to realize the dictio-nary joint learning.Then,the sparse representation was used to realize super-resolution reconstruction of vehicle image.Experimental results show that the accuracy of vehicle recognition is improved obviously after adding the super resolution processing.

Key words: Deep convolutional neural networks, Deep learning, Denoising auto-encoders, Super-resolution, Vehicle recognition

中图分类号: 

  • TP391.4
[1]BUCH N,VELASTIN S A,ORWELL J.A review of computer vision techniques for the analysis of urban traffic[J].IEEE Transactions on Intelligent Transportation Systems,2011,12(3):920-939.
[2]DALAL N,TRIGGS B,SCHMID C.Human detection using orien- ted histograms of flow and appearance[C]∥European Confe-rence on Computer Vision.2006:428-441.
[3]LOWE D G.Distinctive image features from scale-invariant keypoints[J].Kluwer Academic Publishers,2004,60(2):91-110.
[4]MA X,GRIMSON W.Edge-based rich representation for vehicle classification[C]∥Tenth IEEE International Conference on Computer Vision.2005:1185-1192.
[5]OJALA T,PIETIK,INEN M,et al.Multiresolution gray-scale and rotation invariant texture classification with local binary patterns[C]∥IEEE Transactions on Pattern Analysis and Machine Intelligence.2002:971-987.
[6]ZHANG L,LI S Z,YUAN X,et al.Real-time object classification in video surveillance based on appearance learning[C]∥IEEE Conference on Computer Vision & Pattern Recognition.2007:1-8.
[7]孙志军,薛磊,许阳明,等.深度学习研究综述[J].计算机研究应用,2012,29(8):2805-2810.
[8]HINTON G E,SALAKHUTDINOV P R.Reducing the dimensionality of data with neural networks[J].Science,2006,313(5786):504-507.
[9]沈焕锋,李平湘,张良培,等.图像超分辨率重建技术与方法综述[J].Optical Technique,2009,35(2):196-203.
[10]YANG J C,WRIGHT J,HUANG T,et al.Image super-resolution as sparse representation of raw image patches[C]∥IEEE Conference of Computer Vision and Pattern Recognition (CVPR).2008:1-8.
[11]CHO K.Simple Sparsification Improves Sparse Denoising Autoencoders in Denoising Highly Noisy Images[C]∥Proceedings of the 30th International Conference on Machine Learning (ICML-13).2013:432-440.
[12]ALAIN D,OLIVIER S.Gated Autoencoders with Tied Input Weights[C]∥International Conference on Machine Learning.2013:154-162.
[13]VINCENT P,LAROCHELLE H,BENGIO Y,et al.Extracting and composing robust features with denoising autoencoders[C]∥International Conference on Machine Learning.2008:1096-1103.
[14]TRIESCH J.A gradient rule for the plasticity of a neuron’s intrinsic excitability[C]∥International Conference on Artificial Neural Networks:Biological Inspirations.Springer-Verlag,2005:65-70.
[15]YANG J C,WANG Z W,LIN Z,et al.Coupled dictionary trai- ning for image super-resolution[C]∥IEEE Transactions on Image Processing.2012:2359-2390.
[16]JIA Y,SHELHAMER E,DONAHUE J,et al.Caffe:Convolutional Architecture for Fast Feature Embedding[J].Eprint Ar-xiv,2014:675-678.
[9]LU Y,WANG Z.Detecting Driver Yawning in Successive Images[C]∥International Conference on Bioinformatics and Biomedical Engineering.IEEE Xplore,2007:581-583.
[10]IBRAHIM M M,SORAGHAN J J,PETROPOULAKIS L,et al.Yawn analysis with mouth occlusion detection[J].Biomedical Signal Processing & Control,2015,18:360-369.
[11]王忠民,曹洪江,范琳.一种基于卷积神经网络深度学习的人体行为识别方法[J].计算机科学,2016,43(Z11):56-58.
[12]CUN Y L,BOSER B,DENKER J,et al.Handwritten digit re- cognition with a backpropogation network[J].World of Computer Science & Information Technology Journal,1990,2(2):396-404.
[13]ABTAHI S,OMIDYEGANEH M,SHIRMOHAMMADI S,et al.YawDD:A Yawning Detection DataSet[C]∥Proceedings of the 5th ACM Multimedia Systems Conference.New York:ACM Press,2014:24-28.
[14]曹莹,苗启广,刘家辰,等.AdaBoost算法研究进展与展望[J].自动化学报,2013,39(6):745-758.
[15]王平,全吉成,赵柏宇.基于双线性插值的图像缩放在GPU上的实现[J].微电子学与计算机,2016,33(11):129-132.
[16]刘春晓,朱臻阳,伍敏,等.基于主色检测与灰度传播的彩色图像灰度化方法[J].计算机辅助设计与图形学学报,2016,28(3):433-442.
[17]VOLOSHYN D.Application of deep learning and computer vision frameworks for solving video context prediction problem [C]∥Proceedings of the 10th International Conference of Programming.2016:164-169.
[18]ZHANG W,YI L M,WANG T,et al.Driver yawning detection based on deep convolutional neural learning and robust nose tracking[C]∥International Joint Conference on Neural Networks.IEEE,2015:1-8.
[1] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[2] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[3] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[4] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[5] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[6] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[7] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[8] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
[9] 戴朝霞, 李锦欣, 张向东, 徐旭, 梅林, 张亮.
基于DNGAN的磁共振图像超分辨率重建算法
Super-resolution Reconstruction of MRI Based on DNGAN
计算机科学, 2022, 49(7): 113-119. https://doi.org/10.11896/jsjkx.210600105
[10] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[11] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[12] 周慧, 施皓晨, 屠要峰, 黄圣君.
基于主动采样的深度鲁棒神经网络学习
Robust Deep Neural Network Learning Based on Active Sampling
计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044
[13] 苏丹宁, 曹桂涛, 王燕楠, 王宏, 任赫.
小样本雷达辐射源识别的深度学习方法综述
Survey of Deep Learning for Radar Emitter Identification Based on Small Sample
计算机科学, 2022, 49(7): 226-235. https://doi.org/10.11896/jsjkx.210600138
[14] 祝文韬, 兰先超, 罗唤霖, 岳彬, 汪洋.
改进Faster R-CNN的光学遥感飞机目标检测
Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN
计算机科学, 2022, 49(6A): 378-383. https://doi.org/10.11896/jsjkx.210300121
[15] 王建明, 陈响育, 杨自忠, 史晨阳, 张宇航, 钱正坤.
不同数据增强方法对模型识别精度的影响
Influence of Different Data Augmentation Methods on Model Recognition Accuracy
计算机科学, 2022, 49(6A): 418-423. https://doi.org/10.11896/jsjkx.210700210
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!