Computer Science ›› 2020, Vol. 47 ›› Issue (6): 180-183.doi: 10.11896/jsjkx.200200030

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Gesture Recognition Algorithm Based on Improved Multiscale Deep Convolutional Neural Network

JING Yu, QI Rui-hua, LIU Jian-xin, LIU Zhao-xia   

  1. School of Software,Dalian University of Foreign Languages,Dalian,Liaoning 116044,China
  • Received:2020-02-05 Online:2020-06-15 Published:2020-06-10
  • About author:JING Yu,born in 1982,Ph.D,associate professor,is a member of China Computer Federation.Her main research interests include image processing,pattern recognition and computer vision.
  • Supported by:
    This work was supported by the National Natural Science Foundation of China(61501082),Scientific Research Fund Project of Dalian University of Foreign Languages (2018XJYB27),Scientific Research Project of Liaoning Province Educational Department (2019JYT01,2019JYT07),Natural Science Foundation Project of Liaoning Province(20180550018) and Doctoral Research Start Fund Project of Liaoning Province (2019BS061)

Abstract: Since the traditional shallow learning networks rely too much on manual selection of gesture features,they cannot adapt to complex and varied natural scenes in real time.Based on the convolutional neural network architecture,this paper proposes an improved multi-scale deep network gesture recognition model,which makes it possible to overcome the drawbacks of ma-nual extraction features by using the convolutional layer to automatically learn gesture features.In this method,the adaptive multi-scale features are introduced to realize that convolution kernels with different sizes at the same convolutional layer to gene-rate different scale features,and achieves feature map fusion with different levels by cascading shallow and deep features.In addition,in order to enhance the generalization ability of the model,this paper proposes a loss function based on regularization constraints.The experimental results show that the recognition accuracy of the proposed network model is higher than that of the ordinary single -scale convolutional neural network,and the shortcomings of imprecise and incomprehensive extraction as well as poor stability are overcome,and the time required for network training is not greatly increased.

Key words: Convolution feature, Deep learning, Gesture recognition, Loss function, Multi-scale, Regularization

CLC Number: 

  • TP391
[1]MITRA S,ACHARYA T.Gesture Recognition:A Survey[J].IEEE Transactions on Systems,Man and Cybernetics,Part C,(Applications and Reviews),2007,37(3):311-324.
[2] WU Y,HUANG T S,MATHEWS N.Vision-Based Gesture Recognition:A Review[C]//International Gesture Workshop on Gesture-based Communication in Human-computer Interaction.Springer-Verlag,1999.
[3]WANG S B,QUATTONI A,MORENCY L P,et al.HiddenConditional Random Fields for Gesture Recognition[C]//2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006).New York:IEEE,2006.
[4]LEE H K,KIM J H.An HMM-Based Threshold Model Approach for Gesture Recognition[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,1999,21(10):961-973.
[5]BRETZNER L,LAPTEV I,LINDEBERG T.Hand Gesture Recognition using Multi-Scale Colour Features,Hierarchical Models and Particle Filtering[C]//Face and Gesture FG’02.IEEE Computer Society,2002.
[6]SUN Y,LI C,LI G,et al.Gesture Recognition Based on Kinect and sEMG Signal Fusion[J].Mobile Networks and Applications,2018,23(4):797-805.
[7]YANG L,HU G M,HUANG D F,et al.Static Gesture Recognition Algorithm of Combinating Skin Color Segmentation with ELM[J].Journal of Guangxi University(Nat Sci Ed),2015,40(2):444-450.
[8]ZHANG Q Y,WANG D D,ZHANG M Y,et al.Hand Gesture Recognition Based on Bag of Features and Support Vector Machine[J].Journal of Computer Applications,2012,32(12):3392-3396.
[9]LATORRECARMONA P,PLA F.3D human gesture recognition using integral imaging[J].Spienewsroom,2018,12(1):15-20.
[10]CORNEANU C,NOROOZI F,KAMINSKA D,et al.Survey on Emotional Body Gesture Recognition[J/OL].IEEE Transactions on Affective Computing.https://ieeexploreieee.org/document/8493586.
[11]HAO Y,BAI Y P,ZHANG X F,et al.Application of Convolution Neural Network in SAR Target Recognition[J].Journal of Chongqing University of Technology(Natural Science),2018,32(5):204-209.
[12]HUANG G,LIU Z,VAN DER MAATEN L,et al.DenselyConnected Convolutional Networks[C]//IEEE.The IEEE Conference on Computer Vision and Pattern Recognition (CPVR).Las Vegas:IEEE,2017:4700-4708.
[13]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single ShotMultiBox Detector[C]//European Conference on Computer Vision.2016:21-37.
[14]XIE S,GIRSHICK R B,DOLLAR P,et al.Aggregated Residual Transformations for Deep Neural Networks[C]//Computer Vision and Pattern Recognition.2017:5987-5995.
[15]WU,XIAO Y.A Hand Gesture Recognition Algorithm Based on DC-CNN[J].Multimedia Tools and Applications,2019:1-13.
[16]HE Y,LI G,LIAO Y,et al.Gesture Recognition Based on An Improved Local Sparse Representation Classification Algorithm[J].Cluster Computing,2019,22(10):935-946.
[17]PI S Y,TANG H,XIAO N F.Fully Convolutional Deep Lear-ning Model Based Graspable Object etection[J].Journal of Chongqing University of Technology(Natural Science),2018,32(2):166-173.
[18]LATORRECARMONA P,PLA F.3D Human Gesture Recognition Using Integral Imaging[J].Spienewsroom,2018,12(1):15-20.
[19]GUYON I,ATHITSOS V,JANGYODSUK P,et al.ChaLearn Gesture Challenge:Design and First Results[C]//Computer Vision & Pattern Recognition Workshops.2012.
[20]XIN S,HEE-SEUNG K,KOMATSU S,et al.Spatial-temporal Human Gesture Recognition Under Degraded Conditions Using Three-dimensional Integral Imaging[J].Optics Express,2018,26(11):13938.
[1] XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[2] RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[3] TANG Ling-tao, WANG Di, ZHANG Lu-fei, LIU Sheng-yun. Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy [J]. Computer Science, 2022, 49(9): 297-305.
[4] WEI Kai-xuan, FU Ying. Re-parameterized Multi-scale Fusion Network for Efficient Extreme Low-light Raw Denoising [J]. Computer Science, 2022, 49(8): 120-126.
[5] SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[6] LI Yao, LI Tao, LI Qi-fan, LIANG Jia-rui, Ibegbu Nnamdi JULIAN, CHEN Jun-jie, GUO Hao. Construction and Multi-feature Fusion Classification Research Based on Multi-scale Sparse Brain Functional Hyper-network [J]. Computer Science, 2022, 49(8): 257-266.
[7] WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[8] WANG Xin-tong, WANG Xuan, SUN Zhi-xin. Network Traffic Anomaly Detection Method Based on Multi-scale Memory Residual Network [J]. Computer Science, 2022, 49(8): 314-322.
[9] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[10] JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[11] HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.
[12] ZHOU Hui, SHI Hao-chen, TU Yao-feng, HUANG Sheng-jun. Robust Deep Neural Network Learning Based on Active Sampling [J]. Computer Science, 2022, 49(7): 164-169.
[13] SU Dan-ning, CAO Gui-tao, WANG Yan-nan, WANG Hong, REN He. Survey of Deep Learning for Radar Emitter Identification Based on Small Sample [J]. Computer Science, 2022, 49(7): 226-235.
[14] HU Yan-yu, ZHAO Long, DONG Xiang-jun. Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification [J]. Computer Science, 2022, 49(7): 73-78.
[15] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!