计算机科学 ›› 2020, Vol. 47 ›› Issue (11A): 219-223.doi: 10.11896/jsjkx.200100087

• 计算机图形学&多媒体 • 上一篇    下一篇

基于卷积神经网络的煤炭运载车辆识别

马传香1,2, 汪炀杰1, 王旭1   

  1. 1 湖北大学计算机与信息工程学院 武汉 430062
    2 湖北省教育信息化工程研究中心 武汉 430062
  • 出版日期:2020-11-15 发布日期:2020-11-17
  • 通讯作者: 汪炀杰(wyj_zizy0310@163.com)
  • 作者简介:mcx838@hubu.edu.cn
  • 基金资助:
    湖北省自然科学基金(2019CFB757)

Identification of Coal Vehicles Based on Convolutional Neural Network

MA Chuan-xiang1,2, WANG Yang-jie1, WANG Xu1   

  1. 1 School of Computer and Information Engineering,Hubei University,Wuhan 430062,China
    2 Hubei Engineering Research Center for Educational Informationization,Wuhan 430062,China
  • Online:2020-11-15 Published:2020-11-17
  • About author:MA Chuan-xiang,born in 1971,professor,postgraduate superviser.Her research interests include data mining and machine learning.
    WANG Yang-jie,born in 1994,master.His research interests include deep learning and image recognition.
  • Supported by:
    This work was supported by the Natural Science Foundation of Hubei Province,China(2019CFB757).

摘要: 为了杜绝或避免矿产品资源如煤炭、砂石矿等行业因不开票而导致偷税漏税现象的发生,利用深度卷积神经网络自动识别空车重车是一种有效途径。本文在AlexNet模型基础上,针对空车重车图像的差异性,提出5种改进思路,最终得到一种基于maxout+dropout的6层卷积神经网络的结构。对34 220张空车重车图片的测试结果表明,模型在准确度、敏感度、特异性、精度等方面都取得了良好的效果。此外,模型还具有高度的鲁棒性,可以成功识别大量不同角度和不同场景的空车重车图像。

关键词: AlexNet, maxout, 卷积神经网络, 空车重车识别, 深度学习

Abstract: In order to prevent or avoid the occurrence of tax evasion and taxation caused by non-invoicing of mineral resources such as coal,sand and gravel,it is an effective way to use the deep convolutional neural network to automatically identify empty vehicles.Based on the AlexNet model,this paper proposes5 kinds of improvement ideas for the difference of empty car and heavy vehicle images,and finally obtains a structure of 6-layer convolutional neural network based on maxout+dropout.The test results of the picture of the 34 220 empty cars and loaded cars show that the model has achieved good results in terms of accuracy,sensitivity,specificity and precision.In addition,the model is highly robust and can successfully identify a large number of empty car images with different angles and different scenes.

Key words: AlexNet, CNN, Deep learning, Empty car and loaded car identification, maxout

中图分类号: 

  • TP391
[1] CHEN C.Design of Empty Car Recognition System for Undetected Tipper[J].Lifting and transporting machinery,2018,11:107-109.
[2] KRIZHEVSKY A,SUTSKEVER I,HINTON G.ImageNetClassification with Deep Convolutional Neural Networks[C]//International Conference on Neural Information Processing Systems.Curran Associates Inc,2012:1097-1105.
[3] ZHONG Z Q,YUAN J,TANG X Y.Left-vs-Right Eye Discrim-ination Based on Convolution Neural Network[J].Journal of Computer Research and Development,2018,55(8):1667-1673.
[4] LV H M,ZHAO D,CHI X B.Deep Learning for Early Diagnosis of Alzheimer's Disease Based on Intensive AlexNet[J].Computer Science,2017,44(S1):50-60.
[5] CHEN S W,LIU Y J,LIU D,et al.AlexNet Model and Adaptive Contrast Enhancement Based Ultrasound Imaging Classification[J].Computer Science,2019,46(S1):146-152.
[6] LI N,WANG Y X,XU S,et al.Recognition of Floating Objects on Water Surface With Small Sample Based on AlexNet[J].Computer Aplications and Software,2019,36(2):245-251.
[7] GOODFELLOW I J,WARDE-FARLEY D,MIRZA M,et al.Maxout networks[EB/OL].[2017-0912].http://www-etud.iro.umontreal.ca/~goodfeli/maxout.pdf.
[8] SRIVASTAVA N,HINTON G,KRIZHEVSKY A,et al.Dropout:a simple way to prevent neural networks from overfitting[J].Journal of Machine Learning Research,2014,15(1):1929-1958.
[9] LECUN Y,BOTTOU L,BENGIO Y,et al.Gradient-BasedLearning Applied to Document Recognition[J].Proceedings of the IEEE,1998,86(11):2278-2324.
[10] SHUIWANG J,MING Y,KAI Y.3D Convolutional NeuralNetworks for Human Action Recognition[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2013,35(1):221-231.
[11] HUBEL D H,WIESEL T N.Receptive field,binocular interaction and functional architecture in the cat's visual cortex[J].The Journal of Physiology,1962,160(1):106-154.
[12] WANG X H.Tensorlow deep learning application practice[M].Beijin:Tsinghua University Press,2018.
[13] HINTON G,DENG L,YU D,et al.Deep Neural Networks for Acoustic Modeling in Speech Recognition[J].IEEE Signal Processing Magazine,2012,29(6):82-97.
[14] ZHAO H Z,LIU F X,LI L Y,et al.Improving deep convolutional neural networks with mixed maxout units[J].Journal of Communications,2017,38(7):107-114.
[15] BAI C,HUANG L,CHEN J N,et al.Optimization of Deep Convolutional Neural Network for Large Scale Image Classificaion.Journal of Software[J].2018,29(4):1029-1038.
[16] LIU F,SHEN C.Learning Deep Convolutional Features forMRI Based Alzheimer's Disease Classification[OL].http://ar-xiv-web.arxiv.org/pdf/1404.3366v1.
[17] SIMONYAN K,ZISSERMAN A.Very Deep Convolutional Networks for Large Scale Image Recognition.[OL].http://arxiv.org/abs/1409.1556.2014.
[18] FAN M,MENG X F.Data mining concepts and techniques[M].Beijing:China Machine Press.2012.
[1] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[2] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[3] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[4] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[5] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[6] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[7] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[8] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[9] 陈泳全, 姜瑛.
基于卷积神经网络的APP用户行为分析方法
Analysis Method of APP User Behavior Based on Convolutional Neural Network
计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121
[10] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[11] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[12] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[13] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
[14] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[15] 戴朝霞, 李锦欣, 张向东, 徐旭, 梅林, 张亮.
基于DNGAN的磁共振图像超分辨率重建算法
Super-resolution Reconstruction of MRI Based on DNGAN
计算机科学, 2022, 49(7): 113-119. https://doi.org/10.11896/jsjkx.210600105
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!