Computer Science ›› 2024, Vol. 51 ›› Issue (11A): 240400064-6.doi: 10.11896/jsjkx.240400064

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Deep Gait Recognition Network Based on Relative Position Encoding Transformer

REN Yuheng1, ZHAO Yunfeng2, WU Chuang3   

  1. 1 Watrix.AI,Beijing 100083,China
    2 North Pipeline Co.,Ltd.,National Pipe Network Group,Beijing 100084,China
    3 The General Staff of Hunan Provinical Corps of the Chinese People's Armed Police Force,Changsha 410000,China
  • Online:2024-11-16 Published:2024-11-13
  • About author:REN Yuheng,born in 1993,postgra-duate.His main research interests include computer vision and deep lear-ning.

Abstract: Gait recognition is a rapidly evolving long-range biometric identification technique that has wide applications and advantages in various scenarios,including long distances,non-intrusive setups,and cross-view angles.Traditional biometrics identification technique,such as fingerprint recognition and facial recognition,often require close proximity or specific conditions to be effective,while gait recognition technology breaks through these limitations,making it possible to identify individuals in a wider range of environments.Previous research predominantly employed lightweight neural networks for gait feature extraction and achieved significant progress on popular datasets like CASIA-B,which feature cross-view angles and varying attires.However,experimental results indicate a substantial decline in recognition accuracy when simply stacking neural network layers on the CASIA-B dataset.A deep gait recognition network has been proposed,incorporating the relative position encoding transformer module.This module aims to avoid the pitfall of “local feature association"”and enables continuous learning of temporal features within gait sequences.Compared to current mainstream approaches,the proposed method has garnered enhanced identification precision across indoor environments,as exemplified by the CASIA-B and OUMVLP datasets,alongside outdoor settings typified by the Gait3D dataset.Especially in the task of clothes changing,wherein our method surpasses benchmark approaches by of 1.9%,achieving a recognition rate of 85.5%.

Key words: Gait recognition mechanism, Self-attention mechanism, Relative position modeling, Pattern recognition, Deep network

CLC Number: 

  • TP391.41
[1]LARSEN P,SIMONSEN E,LYNNERUP N.Gait analysis inforensic medicine[J].Journal of Forensic Sciences,2008,53:1149-1153.
[2]BOUCHRIKA I,GOFFREDO M,CARTER J,et al.On using gait in forensic biometrics [J].Journal of Forensic Sciences,2011,56:882-889.
[3]CHAO H Q,HE Y W,ZHANG J P,et al.Gaitset:Regarding gait as a set for cross-view gait recognition [C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:8126-8133.
[4]HERMANS A,BEYER L,LEIBE B.In defense of the tripletloss for person re-identification [J].arXiv:1703.07737,2017.
[5]FAN C,PENG Y J,CAO C S,et al.Gaitpart:Temporal part-based model for gait recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2020:14225-14233.
[6]LIN B B,ZHANG S L,YU X.Gait recognition via effective global-local feature representation and local temporal aggregation[C]//Proceedings of the IEEE International Conference on Computer Vision.2021:14648-14656.
[7]HUANG Z,XUE D X,SHEN X,et al.3D local convolutionalneural networks for gait recognition [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2021:14920-14929.
[8]YU S Q,TAN D L,TAN T N.A framework for evaluating the effect of view angle,clothing and carrying condition on gait re-cognition [C]//The International Conference on Pattern Recognition.2006:441-444.
[9]LOPER M,MAHMOOD N,ROMERO J,et al.SMPL:Askinned multi-person linear model [J].ACM Transactions on Graphics,2015:1-16.
[10]SHIRAGA K,MAKIHARA Y,MURAMATSU D,et al.Geinet:Viewinvariant gait recognition using a convolutional neural network [C]//International Conference on Biometrics.2016:1-8.
[11]WU Z F,HUANG Y Z,WANG L,et al.A comprehensive study on crossview gait based human identification with deep cnns [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2016,39:209-226.
[12]HOU S H,CAO C S,LIU X,et al.Gait lateral network:Lear-ning discriminative and compact representations for gait recognition [C]//Proceedings of the European Conference on Computer Vision.2020:382-398.
[13]WANG X L,GIRSHICK R,GUPTA A,et al.Non-local neural networks [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7794-7803.
[14]HU J,SHEN L,ALBANIE S,et al.Squeeze-and-excitation net-works [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[15]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need [C]//Proceedings of the 31st International Confe-rence on in Neural Information Processing Systems.2017:6000-6010.
[16]SU J L,LU Y,PAN S F,et al.Roformer:Enhanced transformer with rotary position embedding [J].arXiv:2104.09864,2021.
[17]TAKEMURA N,MAKIHARA Y,MURAMATSU D,et al.Multi-view large population gait dataset and its performance evaluation for cross-view gait recognition[J].IPSJ transactions on Computer Vision and Applications,2018,10:1-14.
[18]ZHENG J K,LIU X C,LIU W,et al.Gait recognition in thewild with dense 3d representations and a benchmark [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2022:20228-20237.
[19]LIN B B,ZHANG S L,BAO F.Gait recognition with multiple-temporal-scale 3d convolutional neural net-work [C]//Procee-dings of the 28th ACM International Conference on Multimedia.2020:3054-3062.
[20]FAN C,LIANG J H,SHEN C F,et al.OpenGait:RevisitingGait Recognition Toward Better Practicality [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Re-cognition.2023:9707-9716.
[21]DOU H Z,ZHANG P Y,ZHAO Y H,et al.Gaitmpl:Gait recognition with memory-augmented progressive learning[J].IEEE Transactions on Image Processing,2022,33:1464-1475.
[22]CHAI T R,LI A N,ZHANG S X,et al.Lagrange motion analysis and view embeddings for improved gait recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2022:20249-20258.
[1] LI Jiaying, LIANG Yudong, LI Shaoji, ZHANG Kunpeng, ZHANG Chao. Study on Algorithm of Depth Image Super-resolution Guided by High-frequency Information ofColor Images [J]. Computer Science, 2024, 51(7): 197-205.
[2] QUE Yue, GAN Menghan, LIU Zhiwei. Object Detection with Receptive Field Expansion and Multi-branch Aggregation [J]. Computer Science, 2024, 51(6A): 230600151-6.
[3] LIU Xiaohu, CHEN Defu, LI Jun, ZHOU Xuwen, HU Shan, ZHOU Hao. Speaker Verification Network Based on Multi-scale Convolutional Encoder [J]. Computer Science, 2024, 51(6A): 230700083-6.
[4] ZHANG Lanxin, XIANG Ling, LI Xianze, CHEN Jinpeng. Intelligent Fault Diagnosis Method for Rolling Bearing Based on SAMNV3 [J]. Computer Science, 2024, 51(6A): 230700167-6.
[5] BAI Jie, TIAN Ruili, REN Yifu, YUAN Jianxia. Low-rank HOG Voice Detection Method for Short-wave Communication [J]. Computer Science, 2024, 51(6A): 230600115-5.
[6] GAO Jianlei, LUO Minxia. Similarity Measure Between Picture Fuzzy Sets and Its Application in Pattern Recognition [J]. Computer Science, 2024, 51(6A): 230500153-5.
[7] ZHANG Feng, HUANG Shixin, HUA Qiang, DONG Chunru. Novel Image Classification Model Based on Depth-wise Convolution Neural Network andVisual Transformer [J]. Computer Science, 2024, 51(2): 196-204.
[8] XU Junwen, CHEN Zonglei, LI Tianrui, LI Chongshou. Time Series Prediction of Hybrid Neural Networks Based on Seasonal Decomposition [J]. Computer Science, 2024, 51(11A): 231200008-7.
[9] ZHOU Xueyang, FU Qiming, CHEN Jianping, LU You, WANG Yunzhe. Chemical-induced Disease Relation Extraction:Graph Reasoning Method Based on Evidence Focusing [J]. Computer Science, 2024, 51(10): 351-361.
[10] HUANG Chunli, LIU Guimei, JIANG Wenjun, LI Kenli, ZHANG Ji, TAK-SHING Peter Yum. Learning Pattern Recognition and Performance Prediction Method Based on Learners'Behavior Evolution [J]. Computer Science, 2024, 51(10): 67-78.
[11] TENG Sihang, WANG Lie, LI Ya. Non-autoregressive Transformer Chinese Speech Recognition Incorporating Pronunciation- Character Representation Conversion [J]. Computer Science, 2023, 50(8): 111-117.
[12] YAN Mingqiang, YU Pengfei, LI Haiyan, LI Hongsong. Arbitrary Image Style Transfer with Consistent Semantic Style [J]. Computer Science, 2023, 50(7): 129-136.
[13] LI Fan, JIA Dongli, YAO Yumin, TU Jun. Graph Neural Network Few Shot Image Classification Network Based on Residual and Self-attention Mechanism [J]. Computer Science, 2023, 50(6A): 220500104-5.
[14] DOU Zhi, HU Chenguang, LIANG Jingyi, ZHENG Liming, LIU Guoqi. Lightweight Target Detection Algorithm Based on Improved Yolov4-tiny [J]. Computer Science, 2023, 50(6A): 220700006-7.
[15] YANG Bin, LIANG Jing, ZHOU Jiawei, ZHAO Mengci. Study on Interpretable Click-Through Rate Prediction Based on Attention Mechanism [J]. Computer Science, 2023, 50(5): 12-20.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!