基于拓扑信息的多层图卷积动作识别方法

doi:10.11896/jsjkx.250600147

Abstract

Abstract: Human action recognition achieves the identification of human behaviors by analyzing spatiotemporal features in vi-deos.As one of the important research topics in the field of computer vision,its efficient and accurate recognition performance has demonstrated wide application value in various scenarios such as human-computer interaction and intelligent security.Graph Convolutional Networks(GCNs),owing to their significant advantages in modeling human skeletal topology,have become a mainstream method for action recognition tasks.However,existing approaches generally adopt a unified modeling of the entire skeleton structure,overlooking the hierarchical characteristics of the human body composed of multiple functional regions.This limitation restricts model performance in complex action recognition tasks.To address these,this paper proposes a Topology-informed Multi-layer Graph Convolutional Network(TMGCN).The model employs a multi-branch architecture to partition and model the human skeleton,effectively capturing spatial dependencies between skeletal nodes.Additionally,it introduces a Topology Perception Unit(TPU) to extract and integrate topological features during graph convolution,enhancing the model's representation capability for skeletal topology.Experimental results based on NTU-RGB+D dataset show that TM-GCN has achieved excellent performance in human skeletal action recognition tasks,and effectively improved the accuracy of action recognition.

Key words: Action recognition, Skeleton modality, Graph convolutional network, Topology-aware, Computer vision

CLC Number:

TP183

HUANG Haixin, HE Tianyu, HOU Guangshuai. Multi-layer Graph Convolutional Action Recognition Method Based on Topological Information[J].Computer Science, 2026, 53(6A): 250600147-5.

References

[1] PARK J Y,KIM J H.Online incremental classification reso-nance network and its application to human-robot interaction[J].IEEE Transactions on Neural Networks and Learning Systems,2019,31(5):1426-1436.
[2] CAO Z,SIMON T,WEI S E,et al.Realtime multi-person 2Dpose estimation using part affinityfields[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:7291-7299.
[3] YAN S,XIONG Y,LIN D.Spatial temporal graph convolutional networks for skeleton-based action recognition[C]//Procee-dings of the AAAI Conference on Artificial Intelligence.2018.
[4] SHI L,ZHANG Y,CHENG J,et al.Two-stream adaptive graph convolutional networks for skeleton-based action recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:12026-12035.
[5] TANG Y,TIAN Y,LU J,et al.Deep progressive reinforcement learning for skeleton-based action recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:5323-5332.
[6] LEE J,LEE M,LEE D,et al.Hierarchically decomposed graph convolutional networks for skeleton-based action recognition[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2023:10444-10453.
[7] CHENG K,ZHANG Y,HE X,et al.Skeleton-based action recognition with shift graph convolutional network[C]//Procee-dings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:183-192.
[8] HEDEGAARD L,HEIDARI N,IOSIFIDIS A.Online skeleton-based action recognition with continual spatio-temporal graph convolutional networks[J].arXiv:2203.11009,2022.
[9] VELICˇKOVIĆ P,CUCURULL G,CASANOVA A,et al.Graph attention networks[J].arXiv:1710.10903,2017.
[10] YING C,CAI T,LUO S,et al.Do transformers really perform badly for graph representation?[J].Advances in Neural Information Processing Systems,2021,34:28877-28888.
[11] CHENG K,ZHANG Y,CAO C,et al.Decoupling GCN withdropgraph module for skeleton-based action recognition[C]//European Conference on Computer Vision.2020:536-553.
[12] ZHOU Y,YAN X,CHENG Z Q,et al.BlockGCN:Redefine Topology Awareness for Skeleton-Based Action Recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Seattle,WA,USA:IEEE,2024:2049-2058.
[13] TIAN Q,YU J J,ZHANG Z.Skeleton-Based Action Recogni-tion Combining Adaptive Local Graph Convolution and Multi-Scale Temporal Modeling[J].Computer Applications and Research,2025,42(7):2199-2205.
[14] CHEN H,SHEN Y,ZHANG Y,et al.Skeleton-Based Action Recognition through Dual-Granularity Feature Fusion with Self-Adapting Graph Convolution and Multi-Scale Temporal Convolution[J].Neurocomputing,2025,639:130261.
[15] YAN S J,XIONG Y J,LIN D H.Spatial temporal graph convolutional networks for skeleton-based action recognition[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2018:7444-7452.
[16] LIN L,ZHANG J,LIU J.Actionlet-dependent contrastive learning skeleton-based action for unsupervised recognition[C]//Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Vancouver,BC,Canada,2023:2363-2372.
[17] HUA Y,WU W,ZHENG C,et al.Part aware contrastive learning for self-supervised action recognition[C]//Proceedings of the Thirty Second International Joint Conference on Artificial Intelligence.2023:855-863.
[18] ZHU Y S,HAN H,YU Z T,et al.Modeling the relative visual tempo for self-supervised skeleton-based action recognition[C]//2023 IEEE/CVF International Conference on Computer Vision(ICCV).2023:13867-13876.
[19] SHI L,ZHANG Y F,CHENG J,et al.Two stream adaptivegraph convolutional networks for skeleton-based action recognition[C]//Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Long Beach,2019:12018-12027.
[20] CHI H G,HA M H,CHI S G,et al.InfoGCN:Representation learning for human skeleton-based action recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:20186-20196.
[21] LEE J,LEE M,LEE D,et al.Hierarchically decomposed graph convolutional networks for skeleton-based action recognition[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2023:10444-10453.
[22] ZHOU H Y,LIU Q J,WANG Y H.Learning discriminative representations for skeleton-based action recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2023:10608-10617.

Related Articles 15

[1]	HUANG Haixin, HOU Guangshuai, HE Tianyu. SeguGAN:Research on Super-resolution Reconstruction of License Plate Images UtilizingGenerative Adversarial Networks [J]. Computer Science, 2026, 53(6A): 250600070-5.
[2]	CHEN Nuo, ZHAO Peng, HUAN Haisheng. Review of Small Object Detection Based on Deep Learning [J]. Computer Science, 2026, 53(6A): 250700022-9.
[3]	GAO Tai, REN Yanzhang, WANG Huiqing, LI Ying, WANG Bin. KGMamba:Gene Regulatory Network Prediction Model Based on Kolmogorov-Arnold Network Optimizing Graph Convolutional Network and Mamba [J]. Computer Science, 2026, 53(4): 101-111.
[4]	PENG Juhong, ZHANG Zhengyue, DING Zixu, FAN Xinyu, HU Changyu, ZHAO Mingjun. Multi-view Local Language Feature and Global Feature Fusion for Conversational Aspect-based Sentiment Quadruple Analysis [J]. Computer Science, 2026, 53(4): 384-392.
[5]	ZHAO Binbei, ZHU Li, ZHAO Hongli, LI Yutong. Computer Vision Applications in Rail Transit Systems [J]. Computer Science, 2026, 53(3): 214-224.
[6]	ZHAI Jie, LI Yanhao, CHEN Lexuan, GUO Weibin. Dynamic Recommendation of Personalized Hands-on Learning Materials Based on LightweightEducational LLMs [J]. Computer Science, 2026, 53(2): 48-56.
[7]	CHEN Haitao, LIANG Junwei, CHEN Chen, WANG Yufan, ZHOU Yu. Multimodal Physical Education Data Fusion via Graph Alignment for Action Recognition [J]. Computer Science, 2026, 53(2): 89-98.
[8]	CHANG Xuanwei, DUAN Liguo, CHEN Jiahao, CUI Juanjuan, LI Aiping. Method for Span-level Sentiment Triplet Extraction by Deeply Integrating Syntactic and Semantic Features [J]. Computer Science, 2026, 53(2): 322-330.
[9]	LIU Wei, XU Yong, FANG Juan, LI Cheng, ZHU Yujun, FANG Qun, HE Xin. Multimodal Air-writing Gesture Recognition Based on Radar-Vision Fusion [J]. Computer Science, 2025, 52(9): 259-268.
[10]	HU Hailong, XU Xiangwei, LI Yaqian. Drug Combination Recommendation Model Based on Dynamic Disease Modeling [J]. Computer Science, 2025, 52(9): 96-105.
[11]	WANG Jia, XIA Ying, FENG Jiangfan. Few-shot Video Action Recognition Based on Two-stage Spatio-Temporal Alignment [J]. Computer Science, 2025, 52(8): 251-258.
[12]	LI Mengxi, GAO Xindan, LI Xue. Two-way Feature Augmentation Graph Convolution Networks Algorithm [J]. Computer Science, 2025, 52(7): 127-134.
[13]	SU Zhiyuan, ZHAO Lixu, HAO Zhiheng, BAI Rufeng. Suvery of Artificial Intelligence Ensuring eVTOL Flight Safety in the Context of Low-altitudeEconomy [J]. Computer Science, 2025, 52(6A): 250200050-13.
[14]	GAO Junyi, ZHANG Wei, LI Zelin. YOLO-BFEPS:Efficient Attention-enhanced Cross-scale YOLOv10 Fire Detection Model [J]. Computer Science, 2025, 52(6A): 240800134-9.
[15]	BIAN Hui, MENG Changqian, LI Zihan, CHEN Zihaoand XIE Xuelei. Continuous Sign Language Recognition Based on Graph Convolutional Network and CTC/Attention [J]. Computer Science, 2025, 52(6A): 240400098-9.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Multi-layer Graph Convolutional Action Recognition Method Based on Topological Information

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0