Computer Science ›› 2018, Vol. 45 ›› Issue (8): 22-27.doi: 10.11896/j.issn.1002-137X.2018.08.005

• ChinaMM 2017 • Previous Articles     Next Articles

Human Action Recognition Framework with RGB-D Features Fusion

MAO Xia1, WANG Lan1, LI Jian-jun1,2   

  1. School of Electronic and Information Engineering,Beihang University,Beijing 100191,China1
    School of Information Engineering,Inner Mongolia University of Science and Technology,Baotou,Inner Mongolia 014010,China2
  • Received:2017-10-24 Online:2018-08-29 Published:2018-08-29

Abstract: Human action recognition is an important research direction in the field of computer vision and pattern recognition.The complexity of human behavior and the variety of action performing make behavior recognition still as a challenging subject.With the new generation of sensing technology,RGB-D cameras can simultaneously record RGB images,depth images,and extract skeleton information from depth images in real time.How to take advantages of above information has become the new hotspot and breakthrough point of behavior recognition research.This paper presented a new feature extraction method based on Gaussian weighted pyramid histograms of orientation gradients for RGB images,and built an action recognition framework fusing multiple features.The feature extraction method and the framework proposed in this paper were researched on three databases:UTKinect-Action3D,MSR-Action 3D and Florence 3D Actions.The results indicate that the proposed action recognition framework achieves the accuracy of 97.5%,93.1%,91.7% respectively.It shows the effectiveness of the proposed action recognition framework.

Key words: Action recognition, Feature fusion, Gaussian weighted, Histogram of orientation gradients, Sparse representation classifier

CLC Number: 

  • TP391
[1]XIA L,CHEN C C,Aggarwal J K.View Invariant Human Action Recognition Using Histograms of 3D Joints[C]∥Procee-dings of IEEE Conference on Computer Vision and Pattern Re-cognition.Providence:IEEE Press,2012:20-27.
[2]DALAL N,TRIGGS B.Histograms of Oriented Gradients for Human Detection[C]∥Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.San Diego:IEEE Press,2005:886-893.
[3]BOSCH A,ZISSERMAN A,MUMOZ X.Representing Shapewith a Spatial Pyramid Kernel[C]∥Proceedings of the Sixth ACM International Conference on Image and Video Retrieval.New York:ACM,2007:401-408.
[4]LUVIZON D C,TABIA H,PICARD D.Learning FeaturesCombination for Human Recognition from Skeleton Sequences[J/OL].(2017-02-02) [2017-07-06].
[5]YE J,LI K,QI G,et al.Temporal Order-Preserving DynamicQuantization for Human Action Recognition from Multimodal Sensor Streams[C]∥Proceedings of the Annual ACM International Conference on Multimedia Retrieval.Shanghai:ACM,2015:99-106.
[6]ZHANG H L,ZHONG P,HE J L,et al.Combining Depth-Ske-leton Feature with Sparse Coding for Action Recognition[J].Neurocomputing,2016,230(C):417-426.
[7]AHARON M,ELAD M,BRUCKSTEIN A.K-SVD:An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation[J].IEEE Transactions on Signal Processing,2006,54(11):4311-4322.
[8]JEGOU H,DOUZE M,SCHMID C,et al.Aggregating LocalDescriptors into a Compact Image Representation[C]∥Proceedings of the 23th IEEE Conference on Computer Vision and Pattern Recognition.California:IEEE Press,2010:3304-3311.
[9]WRIGHT J,YANG A Y,GANESH A,et al.Robust FaceRe-cognition via Sparse Representation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2009,31(2):210-227.
[10]LI W,ZHANG Z,LIU Z.Action Recognition Based on a Bag of 3D Points[C]∥Proceedings of the 23th IEEE Conference on Computer Vision and Pattern Recognition.San Francisco:IEEE Press,2010:9-14.
[11]SEUDEBARU L,VARANO V,BERRETTI S,et al.Recogni-zing Actions from Depth Cameras as Weakly Aligned Multi-part Bag-of-Poses[C]∥Proceedings of the 26th IEEE Conference on Computer Vision and Pattern Recognition.Portland:IEEE Press,2013:479-485.
[12]WANG J,LIU Z,WU Y,et al.Mining Actionlet Ensemble for Action Recognition with Depth Cameras[C]∥Proceedings of the 25th IEEE Conference on Computer Vision and Pattern Re-cognition.Providence:IEEE Press,2012:1290-1297.
[13]VEMULAPALLI R,ARRATE F,CHELLAPPA R,et al.Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group[C]∥Proceedings of the 27th IEEE Conference on Computer Vision and Pattern Recognition.Columbus:IEEE Press,2014:588-595.
[14]DEVANNE M,WANNOUS H,BERRETTI S,et al.3-D Human Action Recognition by Shape Analysis of Motion Trajectories on Riemannian Manifold[J].IEEE Transactions on Systems,Man,and Cybernetics,2015,45(7):1340-1352.
[15]YIN F,JIAO L C,SHANG F,et al.Sparse Regularization Discriminant Analysis for Face Recognition[J].Neurocomputing,2014,128(5):341-362.
[1] HE Xiao-hui, QIU Fang-bing, CHENG Xi-jie, TIAN Zhi-hui, ZHOU Guang-sheng. High-resolution Image Building Target Detection Based on Edge Feature Fusion [J]. Computer Science, 2021, 48(9): 140-145.
[2] ZHANG Xin-feng, SONG Bo. A Person Re-identification Method Based on Improved Triple Loss and Feature Fusion [J]. Computer Science, 2021, 48(9): 146-152.
[3] YE Zhong-yu, WU Meng-lin. Choroidal Neovascularization Segmentation Combining Temporal Supervision and Attention Mechanism [J]. Computer Science, 2021, 48(8): 118-124.
[4] WANG Shi-yun, YANG Fan. Remote Sensing Image Semantic Segmentation Method Based on U-Net Feature Fusion Optimization Strategy [J]. Computer Science, 2021, 48(8): 162-168.
[5] ZHANG Man, LI Jie, ZHU Xin-zhong, SHEN Ji, CHENG Hao-tian. Augmentation Technology of Remote Sensing Dataset Based on Improved DCGAN Algorithm [J]. Computer Science, 2021, 48(6A): 80-84.
[6] LIU Meng-yang, WU Li-juan, LIANG Hui, DUAN Xu-lei, LIU Shang-qing, GAO Yi-bo. A Kind of High-precision LSTM-FC Atmospheric Contaminant Concentrations Forecasting Model [J]. Computer Science, 2021, 48(6A): 184-189.
[7] PAN Ming-yuan, SONG Hui-hui, ZHANG Kai-hua, LIU Qing-shan. Learning Global Guided Progressive Feature Aggregation Lightweight Network for Salient Object Detection [J]. Computer Science, 2021, 48(6): 103-109.
[8] XU Ze, SHUAI Ren-jun, LIU Kai-kai, MA Li, WU Meng-lin. Generation of Realistic Image from Text Based on Feature Fusion [J]. Computer Science, 2021, 48(6): 125-130.
[9] ZHONG Yue, FANG Hu-sheng, ZHANG Guo-yu, WANG Zhao, ZHU Jing-wei. Method of CNN Flag Movement Recognition Based on 9-axis Attitude Sensor [J]. Computer Science, 2021, 48(6): 153-158.
[10] LYU Jin-na, XING Chun-yu , LI Li. Video Character Relation Extraction Based on Multi-feature Fusion and Fine-granularity Analysis [J]. Computer Science, 2021, 48(4): 117-122.
[11] WANG Yi-hao, DING Hong-wei, LI Bo, BAO Li-yong, ZHANG Ying-jie. Prediction of Protein Subcellular Localization Based on Clustering and Feature Fusion [J]. Computer Science, 2021, 48(3): 206-213.
[12] ZHAN Rui, LEI Yin-jie, CHEN Xun-min, YE Shu-han. Street Scene Change Detection Based on Multiple Difference Features Network [J]. Computer Science, 2021, 48(2): 142-147.
[13] ZHANG Fan, HE Wen-qi, JI Hong-bing, LI Dan-ping, WANG Lei. Multi-view Dictionary-pair Learning Based on Block-diagonal Representation [J]. Computer Science, 2021, 48(1): 233-240.
[14] HE Xin, XU Juan, JIN Ying-ying. Action-related Network:Towards Modeling Complete Changeable Action [J]. Computer Science, 2020, 47(9): 123-128.
[15] HE Lei, SHAO Zhan-peng, ZHANG Jian-hua and ZHOU Xiao-long. Review of Deep Learning-based Action Recognition Algorithms [J]. Computer Science, 2020, 47(6A): 139-147.
Full text



[1] . [J]. Computer Science, 2018, 1(1): 1 .
[2] LEI Li-hui and WANG Jing. Parallelization of LTL Model Checking Based on Possibility Measure[J]. Computer Science, 2018, 45(4): 71 -75 .
[3] SUN Qi, JIN Yan, HE Kun and XU Ling-xuan. Hybrid Evolutionary Algorithm for Solving Mixed Capacitated General Routing Problem[J]. Computer Science, 2018, 45(4): 76 -82 .
[4] ZHANG Jia-nan and XIAO Ming-yu. Approximation Algorithm for Weighted Mixed Domination Problem[J]. Computer Science, 2018, 45(4): 83 -88 .
[5] WU Jian-hui, HUANG Zhong-xiang, LI Wu, WU Jian-hui, PENG Xin and ZHANG Sheng. Robustness Optimization of Sequence Decision in Urban Road Construction[J]. Computer Science, 2018, 45(4): 89 -93 .
[6] SHI Wen-jun, WU Ji-gang and LUO Yu-chun. Fast and Efficient Scheduling Algorithms for Mobile Cloud Offloading[J]. Computer Science, 2018, 45(4): 94 -99 .
[7] ZHOU Yan-ping and YE Qiao-lin. L1-norm Distance Based Least Squares Twin Support Vector Machine[J]. Computer Science, 2018, 45(4): 100 -105 .
[8] LIU Bo-yi, TANG Xiang-yan and CHENG Jie-ren. Recognition Method for Corn Borer Based on Templates Matching in Muliple Growth Periods[J]. Computer Science, 2018, 45(4): 106 -111 .
[9] GENG Hai-jun, SHI Xin-gang, WANG Zhi-liang, YIN Xia and YIN Shao-ping. Energy-efficient Intra-domain Routing Algorithm Based on Directed Acyclic Graph[J]. Computer Science, 2018, 45(4): 112 -116 .
[10] CUI Qiong, LI Jian-hua, WANG Hong and NAN Ming-li. Resilience Analysis Model of Networked Command Information System Based on Node Repairability[J]. Computer Science, 2018, 45(4): 117 -121 .