Computer Science ›› 2018, Vol. 45 ›› Issue (8): 22-27.doi: 10.11896/j.issn.1002-137X.2018.08.005

• ChinaMM 2017 • Previous Articles     Next Articles

Human Action Recognition Framework with RGB-D Features Fusion

MAO Xia1, WANG Lan1, LI Jian-jun1,2   

  1. School of Electronic and Information Engineering,Beihang University,Beijing 100191,China1
    School of Information Engineering,Inner Mongolia University of Science and Technology,Baotou,Inner Mongolia 014010,China2
  • Received:2017-10-24 Online:2018-08-29 Published:2018-08-29

Abstract: Human action recognition is an important research direction in the field of computer vision and pattern recognition.The complexity of human behavior and the variety of action performing make behavior recognition still as a challenging subject.With the new generation of sensing technology,RGB-D cameras can simultaneously record RGB images,depth images,and extract skeleton information from depth images in real time.How to take advantages of above information has become the new hotspot and breakthrough point of behavior recognition research.This paper presented a new feature extraction method based on Gaussian weighted pyramid histograms of orientation gradients for RGB images,and built an action recognition framework fusing multiple features.The feature extraction method and the framework proposed in this paper were researched on three databases:UTKinect-Action3D,MSR-Action 3D and Florence 3D Actions.The results indicate that the proposed action recognition framework achieves the accuracy of 97.5%,93.1%,91.7% respectively.It shows the effectiveness of the proposed action recognition framework.

Key words: Action recognition, Feature fusion, Gaussian weighted, Histogram of orientation gradients, Sparse representation classifier

CLC Number: 

  • TP391
[1]XIA L,CHEN C C,Aggarwal J K.View Invariant Human Action Recognition Using Histograms of 3D Joints[C]∥Procee-dings of IEEE Conference on Computer Vision and Pattern Re-cognition.Providence:IEEE Press,2012:20-27.
[2]DALAL N,TRIGGS B.Histograms of Oriented Gradients for Human Detection[C]∥Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.San Diego:IEEE Press,2005:886-893.
[3]BOSCH A,ZISSERMAN A,MUMOZ X.Representing Shapewith a Spatial Pyramid Kernel[C]∥Proceedings of the Sixth ACM International Conference on Image and Video Retrieval.New York:ACM,2007:401-408.
[4]LUVIZON D C,TABIA H,PICARD D.Learning FeaturesCombination for Human Recognition from Skeleton Sequences[J/OL].(2017-02-02) [2017-07-06].
[5]YE J,LI K,QI G,et al.Temporal Order-Preserving DynamicQuantization for Human Action Recognition from Multimodal Sensor Streams[C]∥Proceedings of the Annual ACM International Conference on Multimedia Retrieval.Shanghai:ACM,2015:99-106.
[6]ZHANG H L,ZHONG P,HE J L,et al.Combining Depth-Ske-leton Feature with Sparse Coding for Action Recognition[J].Neurocomputing,2016,230(C):417-426.
[7]AHARON M,ELAD M,BRUCKSTEIN A.K-SVD:An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation[J].IEEE Transactions on Signal Processing,2006,54(11):4311-4322.
[8]JEGOU H,DOUZE M,SCHMID C,et al.Aggregating LocalDescriptors into a Compact Image Representation[C]∥Proceedings of the 23th IEEE Conference on Computer Vision and Pattern Recognition.California:IEEE Press,2010:3304-3311.
[9]WRIGHT J,YANG A Y,GANESH A,et al.Robust FaceRe-cognition via Sparse Representation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2009,31(2):210-227.
[10]LI W,ZHANG Z,LIU Z.Action Recognition Based on a Bag of 3D Points[C]∥Proceedings of the 23th IEEE Conference on Computer Vision and Pattern Recognition.San Francisco:IEEE Press,2010:9-14.
[11]SEUDEBARU L,VARANO V,BERRETTI S,et al.Recogni-zing Actions from Depth Cameras as Weakly Aligned Multi-part Bag-of-Poses[C]∥Proceedings of the 26th IEEE Conference on Computer Vision and Pattern Recognition.Portland:IEEE Press,2013:479-485.
[12]WANG J,LIU Z,WU Y,et al.Mining Actionlet Ensemble for Action Recognition with Depth Cameras[C]∥Proceedings of the 25th IEEE Conference on Computer Vision and Pattern Re-cognition.Providence:IEEE Press,2012:1290-1297.
[13]VEMULAPALLI R,ARRATE F,CHELLAPPA R,et al.Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group[C]∥Proceedings of the 27th IEEE Conference on Computer Vision and Pattern Recognition.Columbus:IEEE Press,2014:588-595.
[14]DEVANNE M,WANNOUS H,BERRETTI S,et al.3-D Human Action Recognition by Shape Analysis of Motion Trajectories on Riemannian Manifold[J].IEEE Transactions on Systems,Man,and Cybernetics,2015,45(7):1340-1352.
[15]YIN F,JIAO L C,SHANG F,et al.Sparse Regularization Discriminant Analysis for Face Recognition[J].Neurocomputing,2014,128(5):341-362.
[1] ZHANG Ying-tao, ZHANG Jie, ZHANG Rui, ZHANG Wen-qiang. Photorealistic Style Transfer Guided by Global Information [J]. Computer Science, 2022, 49(7): 100-105.
[2] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
[3] CHEN Yong-ping, ZHU Jian-qing, XIE Yi, WU Han-xiao, ZENG Huan-qiang. Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss [J]. Computer Science, 2022, 49(6A): 424-428.
[4] SUN Jie-qi, LI Ya-feng, ZHANG Wen-bo, LIU Peng-hui. Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation [J]. Computer Science, 2022, 49(6A): 434-440.
[5] YU Shu-hao, ZHOU Hui, YE Chun-yang, WANG Tai-zheng. SDFA:Study on Ship Trajectory Clustering Method Based on Multi-feature Fusion [J]. Computer Science, 2022, 49(6A): 256-260.
[6] YANG Yue, FENG Tao, LIANG Hong, YANG Yang. Image Arbitrary Style Transfer via Criss-cross Attention [J]. Computer Science, 2022, 49(6A): 345-352.
[7] LAN Ling-xiang, CHI Ming-min. Remote Sensing Change Detection Based on Feature Fusion and Attention Network [J]. Computer Science, 2022, 49(6): 193-198.
[8] FAN Xin-nan, ZHAO Zhong-xin, YAN Wei, YAN Xi-jun, SHI Peng-fei. Multi-scale Feature Fusion Image Dehazing Algorithm Combined with Attention Mechanism [J]. Computer Science, 2022, 49(5): 50-57.
[9] LI Fa-guang, YILIHAMU·Yaermaimaiti. Real-time Detection Model of Insulator Defect Based on Improved CenterNet [J]. Computer Science, 2022, 49(5): 84-91.
[10] DONG Qi-da, WANG Zhe, WU Song-yang. Feature Fusion Framework Combining Attention Mechanism and Geometric Information [J]. Computer Science, 2022, 49(5): 129-134.
[11] LI Peng-zu, LI Yao, Ibegbu Nnamdi JULIAN, SUN Chao, GUO Hao, CHEN Jun-jie. Construction and Classification of Brain Function Hypernetwork Based on Overlapping Group Lasso with Multi-feature Fusion [J]. Computer Science, 2022, 49(5): 206-211.
[12] XU Hua-jie, QIN Yuan-zhuo, YANG Yang. Scene Recognition Method Based on Multi-level Feature Fusion and Attention Module [J]. Computer Science, 2022, 49(4): 209-214.
[13] GAO Xin-yue, TIAN Han-min. Droplet Segmentation Method Based on Improved U-Net Network [J]. Computer Science, 2022, 49(4): 227-232.
[14] XU Tao, CHEN Yi-ren, LYU Zong-lei. Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm [J]. Computer Science, 2022, 49(4): 239-246.
[15] YANG Xiao-yu, YIN Kang-ning, HOU Shao-qi, DU Wen-yi, YIN Guang-qiang. Person Re-identification Based on Feature Location and Fusion [J]. Computer Science, 2022, 49(3): 170-178.
Full text



No Suggested Reading articles found!