计算机科学 ›› 2018, Vol. 45 ›› Issue (8): 229-235.doi: 10.11896/j.issn.1002-137X.2018.08.041

• 图形图像与模式识别 • 上一篇    下一篇

基于自适应叠合分割与深度神经网络的人数统计方法

郭文生, 包灵, 钱智成, 曹万里   

  1. 电子科技大学信息与软件工程学院 成都610054
  • 收稿日期:2017-06-06 出版日期:2018-08-29 发布日期:2018-08-29
  • 作者简介:郭文生(1976-),男,博士,副教授,CCF会员,主要研究方向为人工智能、嵌入式系统及应用、软件测试、形式化方法等,E-mail:gws@uestc.edu.cn(通信作者); 包 灵(1993-),男,硕士生,主要研究方向为深度学习、计算机视觉; 钱智成(1995-),男,主要研究方向为深度学习; 曹万里(1995-),男,主要研究方向为深度学习。
  • 基金资助:
    本文受国家自然科学基金(61272175,61572109),中央高校基本业务费(ZYGX2015J066)资助。

People Counting Method Based on Adaptive Overlapping Segmentation and Deep Neural Network

GUO Wen-sheng, BAO Ling, QIAN Zhi-cheng, CAO Wan-li   

  1. School of Information and Software Engineering,University of Electronic Science and Technology of China,Chengdu 610054,China
  • Received:2017-06-06 Online:2018-08-29 Published:2018-08-29

摘要: 基于监控视频的人数(人群)统计是人群行为的分析、资源的优化配置、现代安防、商业信息的采集以及智能管理等重要任务的基础,具有较高的研究意义与应用价值。近年来,数字图像处理技术以及深度学习理论的不断完善和发展,极大地促进了基于监控视频的人数统计的研究,但仍然无法很好地解决监控场景中人数统计准确率较低、高清图片耗时的问题。针对在待检对象尺度变化较大的情况下,基于对象检测的人数统计方法的准确率大幅下降的问题,提出一种基于自适应叠合分割与深度神经网络的人数统计方法。该方法的思想来源于注意力机制,同时充分利用了叠合分割块内人头对象的尺度信息和人数信息。实验结果表明,自适应叠合分割算法能够与现有深度神经网络对象检测模型相结合,并且相较于直接利用深度神经网络对象检测模型进行人数统计的方法,该结合方法可以大幅提高人数统计的准确率。

关键词: 对象检测, 非最大值抑制, 人数统计, 深度神经网络, 自适应叠合分割

Abstract: People counting based on surveillance camera is fundamentalfor analyzing behavior of counting,resource optimization and resource allocation,modern security and protection,collecting commerce information as well as intelligent management.Therefore,it has significant meaning of study and application value.Recently,technology of digital image processing and theory of deep learning are constantly improved and developed,extremely promoting the study of people counting based on surveillance camera.However,there exist some problems,such as low accuracy of people counting and time-consuming of high definition,which are unable to be solved.In the wide range of object scale,accuracy of people counting method based on object detection decreases significantly.Aiming at this problem,this paper proposed a people counting method based onadaptive overlapping segmentation and deep neural network.The idea of this method comes from attention mechanism,and makes full use of information of the scales and numbers of head object in overlapping segmentation.The experimental results show that the adaptive overlapping segmentation algorithm can combine existing object detection model based on neural network.What’s more,compared with the method of counting people by directly using object detection model based on neural network,the combination algorithm of adaptive overlapping segmentation and deep neural network can greatly improve the accuracy of people counting.

Key words: Adaptive overlapping segmentation, Deep neural network, NMS, Object detection, People counting

中图分类号: 

  • TP391.41
[1]KOWCIKA A.A Literature Study on Crowd (People) Counting With the Help of Surveillance Videos.International Journal of Information Technology & Decision Making,2015,3(4):2353-2361.
[2]SUBBURAMAN V B,DESCAMPS A,CARINCOTTE C.Countingpeople in the crowd using a generic head detector[C]∥2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance (AVSS).IEEE,2012:470-475.
[3]VU T H,OSOKIN A,LAPTEV I.Context-Aware CNNs forPerson Head Detection[C]∥IEEE International Conference on Computer Vision.IEEE,2015.
[4]ZHANG Y,ZHOU D,CHEN S,et al.Single-Image CrowdCounting via Multi-Column Convolutional Neural Network[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:589-597.
[5]LI B,ZHANG J,ZHANG Z,et al.A people counting method based on head detection and tracking[C]∥2014 International Conference on Smart Computing (SMARTCOMP).IEEE,2014:136-141.
[6]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks.IEEETransactions on Pattern Analysis & Machine Intelligence,2017,39(6):1137-1149.
[7]ZENG X,OUYANG W,YAN J,et al.Crafting gbd-net for object detection.IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,PP(99):1-9.
[8]REDMON J,DIVVALA S,GIRSHICK R,et al.You Only Look Once:Unified,Real-Time Object Detection∥Computer Vision and Pattern Recognition.IEEE,2016:779-788.
[9]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single shotmultibox detector∥European Conference on Computer Vision.Springer,Cham,2016:21-37.
[10]SERMANET P,EIGEN D,ZHANG X,et al.Overfeat:Integra-ted recognition,localization and detection using convolutional networks.arXiv preprint arXiv:1312.6229,2013.
[11]HE K,ZHANG X,REN S,et al.Spatial Pyramid Pooling inDeep Convolutional Networks for Visual Recognition[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2014,37(9):1904-1916.
[12]UIJLINGS J R R,VAN DE SANDE K E A,GEVERS T,et al.Selective search for object recognition.International Journal of Computer Vision,2013,104(2):154-171.
[13]ZITNICK C L,DOLLÁR P.Edge boxes:Locating object propo-sals from edges∥European Conference on Computer Vision.Springer,Cham,2014:391-405.
[14]LU Y,JAVIDI T,LAZEBNIK S.Adaptive object detectionusing adjacency and zoom prediction∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:2351-2359.
[15]XIA F,WANG P,CHEN L C,et al.Zoom better to see clearer:Human and object parsing with hierarchical auto-zoom net∥European Conference on Computer Vision.Springer,Cham,2016:648-663.
[1] 焦翔, 魏祥麟, 薛羽, 王超, 段强.
基于深度学习的自动调制识别研究
Automatic Modulation Recognition Based on Deep Learning
计算机科学, 2022, 49(5): 266-278. https://doi.org/10.11896/jsjkx.211000085
[2] 高捷, 刘沙, 黄则强, 郑天宇, 刘鑫, 漆锋滨.
基于国产众核处理器的深度神经网络算子加速库优化
Deep Neural Network Operator Acceleration Library Optimization Based on Domestic Many-core Processor
计算机科学, 2022, 49(5): 355-362. https://doi.org/10.11896/jsjkx.210500226
[3] 范红杰, 李雪冬, 叶松涛.
面向电子病历语义解析的疾病辅助诊断方法
Aided Disease Diagnosis Method for EMR Semantic Analysis
计算机科学, 2022, 49(1): 153-158. https://doi.org/10.11896/jsjkx.201100125
[4] 周欣, 刘硕迪, 潘薇, 陈媛媛.
自然交通场景中的车辆颜色识别
Vehicle Color Recognition in Natural Traffic Scene
计算机科学, 2021, 48(6A): 15-20. https://doi.org/10.11896/jsjkx.200800078
[5] 刘东, 王叶斐, 林建平, 马海川, 杨闰宇.
端到端优化的图像压缩技术进展
Advances in End-to-End Optimized Image Compression Technologies
计算机科学, 2021, 48(3): 1-8. https://doi.org/10.11896/jsjkx.201100134
[6] 马琳, 王云霄, 赵丽娜, 韩兴旺, 倪金超, 张婕.
基于多模型判别的网络入侵检测系统
Network Intrusion Detection System Based on Multi-model Ensemble
计算机科学, 2021, 48(11A): 592-596. https://doi.org/10.11896/jsjkx.201100170
[7] 潘雨, 邹军华, 王帅辉, 胡谷雨, 潘志松.
基于网络表示学习的深度社团发现方法
Deep Community Detection Algorithm Based on Network Representation Learning
计算机科学, 2021, 48(11A): 198-203. https://doi.org/10.11896/jsjkx.210200113
[8] 刘天星, 李伟, 许铮, 张立华, 戚骁亚, 甘中学.
面向高维连续行动空间的蒙特卡罗树搜索算法
Monte Carlo Tree Search for High-dimensional Continuous Control Space
计算机科学, 2021, 48(10): 30-36. https://doi.org/10.11896/jsjkx.201000129
[9] 张艳梅, 楼胤成.
基于深度神经网络的庞氏骗局合约检测方法
Deep Neural Network Based Ponzi Scheme Contract Detection Method
计算机科学, 2021, 48(1): 273-279. https://doi.org/10.11896/jsjkx.191100020
[10] 丁子昂, 乐曹伟, 吴玲玲, 付明磊.
基于CEEMD-Pearson和深度LSTM混合模型的PM2.5浓度预测方法
PM2.5 Concentration Prediction Method Based on CEEMD-Pearson and Deep LSTM Hybrid Model
计算机科学, 2020, 47(6A): 444-449. https://doi.org/10.11896/JsJkx.190700158
[11] 尚骏远, 杨乐涵, 何琨.
基于特征可视化分析深度神经网络的内部表征
Analyzing Latent Representation of Deep Neural Networks Based on Feature Visualization
计算机科学, 2020, 47(5): 190-197. https://doi.org/10.11896/jsjkx.190700128
[12] 唐国强,高大启,阮彤,叶琪,王祺.
融入语言模型和注意力机制的临床电子病历命名实体识别
Clinical Electronic Medical Record Named Entity Recognition Incorporating Language Model and Attention Mechanism
计算机科学, 2020, 47(3): 211-216. https://doi.org/10.11896/jsjkx.190200259
[13] 樊玮, 刘挺, 黄睿, 郭青, 张宝.
卷积神经网络低层特征辅助的图像实例分割方法
Low-level CNN Feature Aided Image Instance Segmentation
计算机科学, 2020, 47(11): 186-191. https://doi.org/10.11896/jsjkx.191200063
[14] 孔繁钰, 周愉峰, 陈纲.
基于时空特征挖掘的交通流量预测方法
Traffic Flow Prediction Method Based on Spatio-Temporal Feature Mining
计算机科学, 2019, 46(7): 322-326. https://doi.org/10.11896/j.issn.1002-137X.2019.07.049
[15] 彭金喜, 苏远歧, 薛笑荣.
基于深度学习和同生矩阵的SAR图像纹理特征检索方法
SAR Image Feature Retrieval Method Based on Deep Learning and Synchronic Matrix
计算机科学, 2019, 46(6A): 196-199.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!