计算机科学 ›› 2019, Vol. 46 ›› Issue (6): 305-310.doi: 10.11896/j.issn.1002-137X.2019.06.046

• 图形图像与模式识别 • 上一篇    下一篇

基于复合特征及深度学习的人群行为识别算法

袁亚军, 李菲菲, 陈虬   

  1. (上海理工大学光电信息与计算机工程学院 上海200093)
  • 收稿日期:2018-05-12 发布日期:2019-06-24
  • 通讯作者: 李菲菲(1970-),女,博士,教授,主要研究方向为多媒体信息处理、图像处理与模式识别、信息检索,E-mail:feifeilee1701@163.com
  • 作者简介:袁亚军(1993-),男,硕士生,主要研究方向为计算机视觉与模式识别;陈 虬(1972-),男,博士,教授,主要研究方向为图像处理与模式识别、计算机视觉、信息检索。
  • 基金资助:
    上海市高校特聘教授(东方学者)岗位计划(ES2015XX)资助。

Crowd Behavior Recognition Algorithm Based on Combined Features and Deep Learning

YUAN Ya-jun, LEE Fei-fei, CHEN Qiu   

  1. (School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China)
  • Received:2018-05-12 Published:2019-06-24

摘要: 分析人群行为的目的是更好地分析与管理人群运动的状态与趋势。针对人群行为的两种特征信息,提出了一种基于深度学习的人群行为识别方法。先将人群作为主要对象,通过前景提取方法来提取人群静态信息,利用人群运动的变化获取人群动态信息,借助卷积神经网络(CNN)模型学习这两种不同的人群行为特征,再综合这两种特征来分析常见的人群行为。同时,人群数据提取位置与间隔是影响人群行为分析的重要因素。实验结果表明,这两种人群特征能更好地描述空间维度上的人群状态和时间维度上的人群变化,合理的数据位置与数据间隔可以有效地提高人群信息的表达能力。最后将提出的方法与其他人群行为分析方法进行比较,定量与定性的实验结果验证了所提方法的有效性,同时也表明了所提方法能得到更优的混淆矩阵和更高的准确度。

关键词: CNN, 动态特征, 静态特征, 人群行为识别, 数据提取

Abstract: The target of analyzing crowd behavior is to better analyze and manage the state and tendency of crowd movement.This paper proposed a novel deep learning based crowd behavior recognition method by using two types of crowd behavior features.Firstly,the crowd is regarded as the main object,a foreground extraction method is used to extract the static information of crowd,and the dynamic information of crowd is obtained by the change of the crowd movement.Then two different crowd behavior characteristics are learned by using convolution neural network (CNN) model,so as to analyze crowd behaviors in the end.Additionally,the extraction location and interval of crowd data are crucial factors in the crowd behavior recognition.Experimental results show that two crowd characteristics can better describe crowd states on the spatial dimension and crowd changes on the temporal dimension.The rational data location and data interval can effectively improve the expression ability of crowd information.At last,this method was compared with other crowd behavior recognition algorithms.The quantitative and qualitative experimental results demonstrate the validity of the proposed method.Besides,better confusion matrix and higher precision can be obtained by this method.

Key words: CNN, Crowd behavior recognition, Data extraction, Dynamic characteristic, Static characteristic

中图分类号: 

  • TP391
[1]SHAO J,KANG K,CHEN C L,et al.Deeply learned attributes for crowded scene understanding[C]∥International Conference on Computer Vision and Pattern Recognition.2015:4657-4666.
[2]PALANISAMY G,MANIKANDAN T T.Group Behaviour Profiling for Detection of Anomaly in Crowd[C]∥International Conference on Technical Advancements in Computers and Communications.2017:11-15.
[3]RODRIGUES F,LOURENCO M,RIBEIRO B,et al.Learning Supervised Topic Models for Classification and Regression from Crowds[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,PP(99):1-1.
[4]YI S,LI H,WANG X.Pedestrian Behavior Modeling from Stationary Crowds with Applications to Intelligent Surveillance[J].IEEE Transactions on Image Processing,2016,25(9):4354-4368.
[5]SENGUPTA S,WANG H,BLACKBURN W,et al.Spatial information in classification of activity videos[C]∥2015 Federated Conference on Computer Science and Information Systems.2015:145-153.
[6]SHAO J,CHEN C L,WANG X.Learning Scene-Independent Group Descriptors for Crowd Understanding[J].IEEE Transactions on Circuits & Systems for Video Technology,2017,27(6):1290-1303.
[7]ZHANG C,KANG K,LI H,et al.Data-Driven Crowd Understanding:A Baseline for a Large-Scale Crowd Dataset[J].IEEE Transactions on Multimedia,2016,18(6):1048-1061.
[8]JI S,YANG M,YU K.3D convolutional neural networks for human action recognition [J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2013,35(1):221-231.
[9]SIMONYAN K,ZISSERMAN A.Two-Stream Convolutional Networks for Action Recognition in Videos[J].Advances in Neural Information Processing Systems,2014,1(4):568-576.
[10]SHAO J,CHEN C L,KANG K,et al.Slicing Convolutional Neural Network for Crowd Video Understanding[C]∥International Conference Conference on Computer Vision and Pattern Recognition.2016:5620-5628.
[11]JING S,CHEN C L,KAI K,et al.Crowded Scene Understan-ding by Deeply Learned Volumetric Slices[J].IEEE Transactions on Circuits & Systems for Video Technology,2017,27(3):613-623.
[12]KARPATHY A,TODERICI G,SHETTY S,et al.Large-Scale Video Classification with Convolutional Neural Networks[C]∥International Conference Computer Vision and Pattern Recognition.2014:1725-1732.
[13]SHAO J,LOY C C,WANG X.Scene-Independent Group Profiling in Crowd[C]∥International Conference on Computer Vision and Pattern Recognition.2014:2227-2234.
[14]BURNEY A,SYED T Q.Crowd Video Classification Using Convolutional Neural Networks[C]∥International Conference on Frontiers of Information Technology.2017:1255-1259.
[15]YI S,WANG X.Profiling stationary crowd groups[C]∥International Conference on Multimedia and Expo.2014:1-6.
[16]YI S,LI H,WANG X.Understanding pedestrian behaviors from stationary crowd groups[C]∥International Conference on Computer Vision and Pattern Recognition.2015:3488-3496.
[17]SENST T,EISELEIN V,SIKORA T.A local feature based on lagrangian measures for violent video classification[C]∥International Conference on Imaging for Crime Prevention and Detection.2015:1-6.
[18]FEICHTENHOFER C,PINZ A,ZISSERMAN A.Convolutional Two-Stream Network Fusion for Video Action Recognition[C]∥International Conference on Computer Vision and PatternRe-cognition.2016:1933-1941.
[19]GOMEZ-DONOSO F,GARCIA-GARCIA A,GARCIA-RO-DRIGUEZ J,et al.LonchaNet:A sliced-based CNN architecture for real-time 3D object recognition[C]∥International Joint Conference on Neural Networks.2017:412-418.
[20]MARSDEN M,MCGUINNESS K,LITTLE S,et al.Fully Convolutional Crowd Counting On Highly Congested Scenes[J].arXiv preprint arXiv:1612.00220.
[21]SHI Y,TIAN Y,WANG Y,et al.Sequential Deep Trajectory Descriptor for Action Recognition with Three-Stream CNN[J].IEEE Transactions on Multimedia,2017,19(7):1510-1520.
[1] 祝文韬, 兰先超, 罗唤霖, 岳彬, 汪洋.
改进Faster R-CNN的光学遥感飞机目标检测
Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN
计算机科学, 2022, 49(6A): 378-383. https://doi.org/10.11896/jsjkx.210300121
[2] 岳晴, 尹健宇, 王生生.
基于改进CNN的低剂量CT图像肺结节自动检测
Automatic Detection of Pulmonary Nodules in Low-dose CT Images Based on Improved CNN
计算机科学, 2022, 49(6A): 54-59. https://doi.org/10.11896/jsjkx.210400211
[3] 余本功, 张子薇, 王惠灵.
一种融合多层次情感和主题信息的TS-AC-EWM在线商品排序方法
TS-AC-EWM Online Product Ranking Method Based on Multi-level Emotion and Topic Information
计算机科学, 2022, 49(6A): 165-171. https://doi.org/10.11896/jsjkx.210400238
[4] 王杉, 徐楚怡, 师春香, 张瑛.
基于CNN-LSTM的卫星云图云分类方法研究
Study on Cloud Classification Method of Satellite Cloud Images Based on CNN-LSTM
计算机科学, 2022, 49(6A): 675-679. https://doi.org/10.11896/jsjkx.210300177
[5] 赵征鹏, 李俊钢, 普园媛.
基于卷积神经网络的Retinex低照度图像增强
Low-light Image Enhancement Based on Retinex Theory by Convolutional Neural Network
计算机科学, 2022, 49(6): 199-209. https://doi.org/10.11896/jsjkx.210400092
[6] 黄晓生, 徐静.
基于PCANet的非下采样剪切波域多聚焦图像融合
Multi-focus Image Fusion Method Based on PCANet in NSST Domain
计算机科学, 2021, 48(9): 181-186. https://doi.org/10.11896/jsjkx.200800064
[7] 崔雯昊, 蒋慕蓉, 杨磊, 傅鹏铭, 朱凌霄.
结合MCycleGAN与RFCNN实现太阳斑点图高分辨重建
Combining MCycleGAN and RFCNN to Realize High Resolution Reconstruction of Solar Speckle Image
计算机科学, 2021, 48(6A): 38-42. https://doi.org/10.11896/jsjkx.201000160
[8] 刘汉卿, 康晓东, 李博, 张华丽, 冯继超, 韩俊玲.
利用深度学习网络对医学影像分类识别的比较研究
Comparative Study on Classification and Recognition of Medical Images Using Deep Learning Network
计算机科学, 2021, 48(6A): 89-94. https://doi.org/10.11896/jsjkx.201000116
[9] 俞建业, 戚湧, 王宝茁.
基于Spark的车联网分布式组合深度学习入侵检测方法
Distributed Combination Deep Learning Intrusion Detection Method for Internet of Vehicles Based on Spark
计算机科学, 2021, 48(6A): 518-523. https://doi.org/10.11896/jsjkx.200700129
[10] 束鑫, 常锋, 张歆, 杜睿, 余转.
基于BCNN的胎儿颅脑超声横切面识别算法
Transverse Section Recognition Algorithm Based on BCNN for Fetal Craniocerebral Ultrasound
计算机科学, 2021, 48(4): 151-156. https://doi.org/10.11896/jsjkx.200500049
[11] 齐延荣, 周夏冰, 李斌, 周清雷.
基于FPGA的CNN图像识别加速与优化
FPGA-based CNN Image Recognition Acceleration and Optimization
计算机科学, 2021, 48(4): 205-212. https://doi.org/10.11896/jsjkx.200600089
[12] 陈明豪, 祝跃飞, 芦斌, 翟懿, 李玎.
基于Attention-CNN的加密流量应用类型识别
Classification of Application Type of Encrypted Traffic Based on Attention-CNN
计算机科学, 2021, 48(4): 325-332. https://doi.org/10.11896/jsjkx.200900155
[13] 李冰荣, 皮德常, 候梦如.
基于CNN和LSTM的移动对象目的地预测
Destination Prediction of Moving Objects Based on Convolutional Neural Networks and Long-Short Term Memory
计算机科学, 2021, 48(4): 70-77. https://doi.org/10.11896/jsjkx.200200024
[14] 王丽芳, 王蕊芳, 蔺素珍, 秦品乐, 高媛, 张晋.
基于双残差超密集网络的多模态医学图像融合
Multimodal Medical Image Fusion Based on Dual Residual Hyper Densely Networks
计算机科学, 2021, 48(2): 160-166. https://doi.org/10.11896/jsjkx.200400095
[15] 刘肖, 袁冠, 张艳梅, 闫秋艳, 王志晓.
基于自适应多分类器融合的手势识别
Hand Gesture Recognition Based on Self-adaptive Multi-classifiers Fusion
计算机科学, 2020, 47(7): 103-110. https://doi.org/10.11896/jsjkx.200100073
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!