多流融合的轻量级图卷积行为识别算法

doi:10.11896/jsjkx.220800147

Abstract

Abstract: Traditional action recognition based on RGB-based methods is easy to be affected by problems such as light intensity and viewing angle.Skeleton-based action recognition is less affected by these problems and has become one of the mainstream methods.However,the current skeleton-based action recognition methods have a large number of parameters and slow operation speed.In order to solve these problems,a multi-stream fusion lightweight graph convolution action recognition framework is proposed.Firstly,the data fused with various information of joint,bone,joint motion and bone motion are input into the spatial map convolution module.Secondly,the spatial attention mechanism is added to the spatial graph convolution module to better extract the relationship between the joints.Finally,in the time convolution module,depthwise convolution and pointwise convolution are used to reduce the amount of parameters.Compared with the baseline network SGN,in NTU-RGB+D120 dataset,the proposed network increases by 2.3% under cross-subject evaluation,increases by 1.9% under cross-setup evaluation,and the number of parameters reduces by 0.12×10⁶.The validity of the proposed network is verified.

Key words: Human skeleton, Action recognition, Lightweight, Attention mechanism, Graph convolution

CLC Number:

TP391

LI Hua, ZHAO Lingdi, CHEN Yujie, YANG Yang, DU Xinzhao. Lightweight Graph Convolution Action Recognition Algorithm Based on Multi-streamFusion[J].Computer Science, 2023, 50(11A): 220800147-6.

References

[1]DENG M L,GAO Z D,LI L,et al.Overview of Human Behavior Recognition Based on Deep Learning[J].Computer Engineering and Applications,2022,58(13):14-26.
[2]CAI Q,DENG Y B,LI H S,et al.Survey on Human Action Re-cognition Based on Deep Learning[J].Computer Science,2020,47(4):85-93.
[3]SU B Y,WU H,SHENG M,et al.Accurate Hierarchical Hu-man Actions Recognition From Kinect Skeleton Data[J].IEEE Access,2019,7.
[4]LI M H,XU H J,SHI L X,et al.Multi-person Activity Recognition Based on Bone Keypoints Detection[J].Computer Science,2021,48(4):138-143.
[5]JIANG Q Y,WU X J,XU T Y.M2FA:multi-dimensional feature fusion attention mechanism for skeleton-based action recognition[J].Journal of Image and Graphics,2022,27(8):2391-2403.
[6]LEE J,LEE M,LEE D,et al.Hierarchically Decomposed GraphConvolutional Networks for Skeleton-Based Action Recognition[J].arXiv:2208.10741,2022.
[7]DUAN H,ZHAO Y,XIONG Y,et al.Omni-sourced webly-supervised learning for video recognition[C]//European Confe-rence on Computer Vision.Cham:Springer,2020:670-688.
[8]ATEFE A,ALI N,EBRAHIMI M M.Sparse Deep LSTMs with Convolutional Attention for Human Action Recognition[J].SN Computer Science,2021,2(3).
[9]CHEN Y,ZHANG Z,YUAN C,et al.Channel-wise topology refinement graph convolution for skeleton-based action recognition[C]//Proceedings of the IEEE/CVF International Confe-rence on Computer Vision.2021:13359-13368.
[10]LI C,ZHONG Q,XIE D,et al.Co-occurrence feature learning from skeleton data for action recognition and detection with hierarchical aggregation[J].arXiv:1804.06055,2018.
[11]DU Y,WANG W,WANG L.Hierarchical recurrent neural network for skeleton based action recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:1110-1118.
[12]YAN S,XIONG Y,LIN D.Spatial temporal graph convolutional networks for skeleton-based action recognition[C]//Thirty-se-cond AAAI Conference on Artificial Intelligence.2018.
[13]SHI L,ZHANG Y,CHENG J,et al.Two-stream adaptive graph convolutional networks for skeleton-based action recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:12026-12035.
[14]QIN Z Y,LIU Y,JI P,et al.Fusing Higher-Order Features in Graph Neural Networks for Skeleton-Based Action Recognition[J].arXiv:2015.01563,2022.
[15]CHENG K,ZHANG Y,HE X,et al.Skeleton-based action re-cognition with shift graph convolutional network[C]//Procee-dings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition.2020.
[16]LIU Z,ZHANG H,CHEN Z,et al.Disentangling and unifying graph convolutions for skeleton-based action recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:143-152.
[17]DUAN H,ZHAO Y,CHEN K,et al.Revisiting skeleton-based action recognition[C]//Proceedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.2022:2969-2978.
[18]ZHANG P,LAN C,ZENG W,et al.Semantics-guided neuralnetworks for efficient skeleton-based human action recognition[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition.2020.
[19]HOWARD A G,ZHU M,CHEN B,et al.Mobilenets:Efficient convolutional neural networks for mobile vision applications[J].arXiv:1704.04861,2017.
[20]SANDLER M,HOWARD A,ZHU M,et al.Mobilenetv2:Inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:4510-4520.
[21]HOWARD A,SANDLER M,CHU G,et al.Searching for MobileNetV3[J].arXiv:1905.02244,2019.
[22]WANG Q,WU B,ZHU P,et al.ECA-Net:Efficient channel attention for deep convolutional neural networks[C]//Procee-dings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition.2020.
[23]SHAHROUDY A,LIU J,NG T T,et al.NTU RGB+D:A Large Scale Dataset for 3D Human Activity Analysis[J].arXiv:1604.02808,2016.
[24]LIU J,AMIR A,LISBOA P M,et al.NTU RGB+D 120:ALarge-Scale Benchmark for 3D Human Activity Understanding[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2019,42(10).
[25]CHEN Y S,YA J,WEI W,et al.Skeleton-based action recognition with hierarchical spatial reasoning and temporal stack lear-ning network[J].Pattern Recognition,2020,107.
[26]SONG Y F,ZHANG Z,WANG L.Richly Activated Graph Convolutional Network for Action Recognition with Incomplete Skeletons[J].arXiv:1905.06774,2019.
[27]LI M S,CHEN S H,CHEN X,et al.Actional-Structural Graph Convolutional Networks for Skeleton-based Action Recognition[J].arXiv:1904.12659,2019.
[28]SONG Y F,ZHANG Z,SHAN C,et al.Richly Activated Graph Convolutional Network for Robust Skeleton-Based Action Recognition[J].IEEE Transactions on Circuits and Systems for Video Technology,2021,31(5).
[29]PENG W,SHI J,ZHAO G.Spatial temporal graph deconvolutional network for skeleton-based human action recognition[J].IEEE Signal Processing Letters,2021,28:244-248.

Related Articles 15

[1]	LIU Yubo, GUO Bin, MA Ke, QIU Chen, LIU Sicong. Design of Visual Context-driven Interactive Bot System [J]. Computer Science, 2023, 50(9): 260-268.
[2]	YI Liu, GENG Xinyu, BAI Jing. Hierarchical Multi-label Text Classification Algorithm Based on Parallel Convolutional Network Information Fusion [J]. Computer Science, 2023, 50(9): 278-286.
[3]	LUO Yuanyuan, YANG Chunming, LI Bo, ZHANG Hui, ZHAO Xujian. Chinese Medical Named Entity Recognition Method Incorporating Machine ReadingComprehension [J]. Computer Science, 2023, 50(9): 287-294.
[4]	HENG Hongjun, MIAO Jing. Fusion of Semantic and Syntactic Graph Convolutional Networks for Joint Entity and Relation Extraction [J]. Computer Science, 2023, 50(9): 295-302.
[5]	LI Ke, YANG Ling, ZHAO Yanbo, CHEN Yonglong, LUO Shouxi. EGCN-CeDML:A Distributed Machine Learning Framework for Vehicle Driving Behavior Prediction [J]. Computer Science, 2023, 50(9): 318-330.
[6]	ZHONG Yue, GU Jieming, CAO Honglin. Survey of Lightweight Block Cipher [J]. Computer Science, 2023, 50(9): 3-15.
[7]	WANG Jiahao, ZHONG Xin, LI Wenxiong, ZHAO Dexin. Human Activity Recognition with Meta-learning and Attention [J]. Computer Science, 2023, 50(8): 193-201.
[8]	WANG Yu, WANG Zuchao, PAN Rui. Survey of DGA Domain Name Detection Based on Character Feature [J]. Computer Science, 2023, 50(8): 251-259.
[9]	ZHANG Yian, YANG Ying, REN Gang, WANG Gang. Study on Multimodal Online Reviews Helpfulness Prediction Based on Attention Mechanism [J]. Computer Science, 2023, 50(8): 37-44.
[10]	TENG Sihang, WANG Lie, LI Ya. Non-autoregressive Transformer Chinese Speech Recognition Incorporating Pronunciation- Character Representation Conversion [J]. Computer Science, 2023, 50(8): 111-117.
[11]	ZHANG Xiao, DONG Hongbin. Lightweight Multi-view Stereo Integrating Coarse Cost Volume and Bilateral Grid [J]. Computer Science, 2023, 50(8): 125-132.
[12]	YAN Mingqiang, YU Pengfei, LI Haiyan, LI Hongsong. Arbitrary Image Style Transfer with Consistent Semantic Style [J]. Computer Science, 2023, 50(7): 129-136.
[13]	ZHANG Shunyao, LI Huawang, ZHANG Yonghe, WANG Xinyu, DING Guopeng. Image Retrieval Based on Independent Attention Mechanism [J]. Computer Science, 2023, 50(6A): 220300092-6.
[14]	LIU Haowei, YAO Jingchi, LIU Bo, BI Xiuli, XIAO Bin. Two-stage Method for Restoration of Heritage Images Based on Muti-scale Attention Mechanism [J]. Computer Science, 2023, 50(6A): 220600129-8.
[15]	LI Fan, JIA Dongli, YAO Yumin, TU Jun. Graph Neural Network Few Shot Image Classification Network Based on Residual and Self-attention Mechanism [J]. Computer Science, 2023, 50(6A): 220500104-5.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Lightweight Graph Convolution Action Recognition Algorithm Based on Multi-streamFusion

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0