Computer Science ›› 2026, Vol. 53 ›› Issue (6A): 250600147-5.doi: 10.11896/jsjkx.250600147

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Multi-layer Graph Convolutional Action Recognition Method Based on Topological Information

HUANG Haixin, HE Tianyu, HOU Guangshuai   

  1. School of Automation and Electrical Engineering,Shenyang Ligong University,Shenyang 110159,China
  • Online:2026-06-16 Published:2026-06-12
  • About author:HUANG Haixin,born in 1973,Ph.D,associate professor.Her main research interests include machine learning,artificial intelligence and intelligent grid.
  • Supported by:
    Key R&D Program of China(2022YFC3302500).

Abstract: Human action recognition achieves the identification of human behaviors by analyzing spatiotemporal features in vi-deos.As one of the important research topics in the field of computer vision,its efficient and accurate recognition performance has demonstrated wide application value in various scenarios such as human-computer interaction and intelligent security.Graph Convolutional Networks(GCNs),owing to their significant advantages in modeling human skeletal topology,have become a mainstream method for action recognition tasks.However,existing approaches generally adopt a unified modeling of the entire skeleton structure,overlooking the hierarchical characteristics of the human body composed of multiple functional regions.This limitation restricts model performance in complex action recognition tasks.To address these,this paper proposes a Topology-informed Multi-layer Graph Convolutional Network(TMGCN).The model employs a multi-branch architecture to partition and model the human skeleton,effectively capturing spatial dependencies between skeletal nodes.Additionally,it introduces a Topology Perception Unit(TPU) to extract and integrate topological features during graph convolution,enhancing the model's representation capability for skeletal topology.Experimental results based on NTU-RGB+D dataset show that TM-GCN has achieved excellent performance in human skeletal action recognition tasks,and effectively improved the accuracy of action recognition.

Key words: Action recognition, Skeleton modality, Graph convolutional network, Topology-aware, Computer vision

CLC Number: 

  • TP183
[1] PARK J Y,KIM J H.Online incremental classification reso-nance network and its application to human-robot interaction[J].IEEE Transactions on Neural Networks and Learning Systems,2019,31(5):1426-1436.
[2] CAO Z,SIMON T,WEI S E,et al.Realtime multi-person 2Dpose estimation using part affinityfields[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:7291-7299.
[3] YAN S,XIONG Y,LIN D.Spatial temporal graph convolutional networks for skeleton-based action recognition[C]//Procee-dings of the AAAI Conference on Artificial Intelligence.2018.
[4] SHI L,ZHANG Y,CHENG J,et al.Two-stream adaptive graph convolutional networks for skeleton-based action recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:12026-12035.
[5] TANG Y,TIAN Y,LU J,et al.Deep progressive reinforcement learning for skeleton-based action recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:5323-5332.
[6] LEE J,LEE M,LEE D,et al.Hierarchically decomposed graph convolutional networks for skeleton-based action recognition[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2023:10444-10453.
[7] CHENG K,ZHANG Y,HE X,et al.Skeleton-based action recognition with shift graph convolutional network[C]//Procee-dings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:183-192.
[8] HEDEGAARD L,HEIDARI N,IOSIFIDIS A.Online skeleton-based action recognition with continual spatio-temporal graph convolutional networks[J].arXiv:2203.11009,2022.
[9] VELICˇKOVIĆ P,CUCURULL G,CASANOVA A,et al.Graph attention networks[J].arXiv:1710.10903,2017.
[10] YING C,CAI T,LUO S,et al.Do transformers really perform badly for graph representation?[J].Advances in Neural Information Processing Systems,2021,34:28877-28888.
[11] CHENG K,ZHANG Y,CAO C,et al.Decoupling GCN withdropgraph module for skeleton-based action recognition[C]//European Conference on Computer Vision.2020:536-553.
[12] ZHOU Y,YAN X,CHENG Z Q,et al.BlockGCN:Redefine Topology Awareness for Skeleton-Based Action Recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Seattle,WA,USA:IEEE,2024:2049-2058.
[13] TIAN Q,YU J J,ZHANG Z.Skeleton-Based Action Recogni-tion Combining Adaptive Local Graph Convolution and Multi-Scale Temporal Modeling[J].Computer Applications and Research,2025,42(7):2199-2205.
[14] CHEN H,SHEN Y,ZHANG Y,et al.Skeleton-Based Action Recognition through Dual-Granularity Feature Fusion with Self-Adapting Graph Convolution and Multi-Scale Temporal Convolution[J].Neurocomputing,2025,639:130261.
[15] YAN S J,XIONG Y J,LIN D H.Spatial temporal graph convolutional networks for skeleton-based action recognition[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2018:7444-7452.
[16] LIN L,ZHANG J,LIU J.Actionlet-dependent contrastive learning skeleton-based action for unsupervised recognition[C]//Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Vancouver,BC,Canada,2023:2363-2372.
[17] HUA Y,WU W,ZHENG C,et al.Part aware contrastive learning for self-supervised action recognition[C]//Proceedings of the Thirty Second International Joint Conference on Artificial Intelligence.2023:855-863.
[18] ZHU Y S,HAN H,YU Z T,et al.Modeling the relative visual tempo for self-supervised skeleton-based action recognition[C]//2023 IEEE/CVF International Conference on Computer Vision(ICCV).2023:13867-13876.
[19] SHI L,ZHANG Y F,CHENG J,et al.Two stream adaptivegraph convolutional networks for skeleton-based action recognition[C]//Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Long Beach,2019:12018-12027.
[20] CHI H G,HA M H,CHI S G,et al.InfoGCN:Representation learning for human skeleton-based action recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:20186-20196.
[21] LEE J,LEE M,LEE D,et al.Hierarchically decomposed graph convolutional networks for skeleton-based action recognition[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2023:10444-10453.
[22] ZHOU H Y,LIU Q J,WANG Y H.Learning discriminative representations for skeleton-based action recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2023:10608-10617.
[1] HUANG Haixin, HOU Guangshuai, HE Tianyu. SeguGAN:Research on Super-resolution Reconstruction of License Plate Images UtilizingGenerative Adversarial Networks [J]. Computer Science, 2026, 53(6A): 250600070-5.
[2] CHEN Nuo, ZHAO Peng, HUAN Haisheng. Review of Small Object Detection Based on Deep Learning [J]. Computer Science, 2026, 53(6A): 250700022-9.
[3] GAO Tai, REN Yanzhang, WANG Huiqing, LI Ying, WANG Bin. KGMamba:Gene Regulatory Network Prediction Model Based on Kolmogorov-Arnold Network Optimizing Graph Convolutional Network and Mamba [J]. Computer Science, 2026, 53(4): 101-111.
[4] PENG Juhong, ZHANG Zhengyue, DING Zixu, FAN Xinyu, HU Changyu, ZHAO Mingjun. Multi-view Local Language Feature and Global Feature Fusion for Conversational Aspect-based Sentiment Quadruple Analysis [J]. Computer Science, 2026, 53(4): 384-392.
[5] ZHAO Binbei, ZHU Li, ZHAO Hongli, LI Yutong. Computer Vision Applications in Rail Transit Systems [J]. Computer Science, 2026, 53(3): 214-224.
[6] ZHAI Jie, LI Yanhao, CHEN Lexuan, GUO Weibin. Dynamic Recommendation of Personalized Hands-on Learning Materials Based on LightweightEducational LLMs [J]. Computer Science, 2026, 53(2): 48-56.
[7] CHEN Haitao, LIANG Junwei, CHEN Chen, WANG Yufan, ZHOU Yu. Multimodal Physical Education Data Fusion via Graph Alignment for Action Recognition [J]. Computer Science, 2026, 53(2): 89-98.
[8] CHANG Xuanwei, DUAN Liguo, CHEN Jiahao, CUI Juanjuan, LI Aiping. Method for Span-level Sentiment Triplet Extraction by Deeply Integrating Syntactic and Semantic
Features
[J]. Computer Science, 2026, 53(2): 322-330.
[9] LIU Wei, XU Yong, FANG Juan, LI Cheng, ZHU Yujun, FANG Qun, HE Xin. Multimodal Air-writing Gesture Recognition Based on Radar-Vision Fusion [J]. Computer Science, 2025, 52(9): 259-268.
[10] HU Hailong, XU Xiangwei, LI Yaqian. Drug Combination Recommendation Model Based on Dynamic Disease Modeling [J]. Computer Science, 2025, 52(9): 96-105.
[11] WANG Jia, XIA Ying, FENG Jiangfan. Few-shot Video Action Recognition Based on Two-stage Spatio-Temporal Alignment [J]. Computer Science, 2025, 52(8): 251-258.
[12] LI Mengxi, GAO Xindan, LI Xue. Two-way Feature Augmentation Graph Convolution Networks Algorithm [J]. Computer Science, 2025, 52(7): 127-134.
[13] SU Zhiyuan, ZHAO Lixu, HAO Zhiheng, BAI Rufeng. Suvery of Artificial Intelligence Ensuring eVTOL Flight Safety in the Context of Low-altitudeEconomy [J]. Computer Science, 2025, 52(6A): 250200050-13.
[14] GAO Junyi, ZHANG Wei, LI Zelin. YOLO-BFEPS:Efficient Attention-enhanced Cross-scale YOLOv10 Fire Detection Model [J]. Computer Science, 2025, 52(6A): 240800134-9.
[15] BIAN Hui, MENG Changqian, LI Zihan, CHEN Zihaoand XIE Xuelei. Continuous Sign Language Recognition Based on Graph Convolutional Network and CTC/Attention [J]. Computer Science, 2025, 52(6A): 240400098-9.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!