Computer Science ›› 2023, Vol. 50 ›› Issue (11A): 221000048-7.doi: 10.11896/jsjkx.221000048

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

2D Human Pose Estimation Based on Adaptive Estimation

ZHENG Quanshi1, JIN Cheng1,2   

  1. 1 School of Computer Science,Fudan University,Shanghai 200438,China
    2 Peng Cheng Laboratory,Shenzhen,Guangdong 518066,China
  • Published:2023-11-09
  • About author:ZHENG Quanshi,born in 1994,postgraduate,is a member of China Computer Federation.His main research interests include human pose estimation and action recognition.
    JIN Cheng,born in 1978,Ph.D,professor,Ph.D supervisor,is a member of China Computer Federation.His main research interests include computer vision and multimedia information retrieval.
  • Supported by:
    Shanghai Municipal Science and Technology Commission(22dz1204900).

Abstract: The regression-based 2D human pose estimation methods directly predict the coordinates of human keypoints.The transformer can effectively establish the relationship between human body parts,and its application significantly improves the accuracy of the regression-based methods.However,related methods have the following two problems:1)In the cross-attention module,for different images,the fixed query can not properly focus on different keypoint regions,which leads to distraction.2)They directly learn the labeled keypoint coordinates and overfit annotations.In this paper,a pose estimation model based on adaptive prediction is proposed to solve these two problems.For the first problem,the model adaptively predicts the region of attention of the query and directs the attention to that region.For the second problem,the model adaptively predicts the probability distribution of keypoint appearing in every position,and alleviates the model's overfitting to annotations by means of soft prediction.Experiments on the MS-COCO dataset show that the model improves the accuracy of the baseline method by 2.8% and improves the highest accuracy of related methods by 0.2%.

Key words: 2D human pose estimation, Regression-based, Adaptive, Region of attention, Probability distribution

CLC Number: 

  • TP391
[1]ANDRILUKA M,PISHCHULIN L,GEHLER P,et al.2d human pose estimation:New benchmark and state of the art analysis[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:3686-3693.
[2]TOSHEV A,SZEGEDY C.Deeppose:Human pose estimationvia deep neural networks[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition.2014:1653-1660.
[3]PISHCHULIN L,ANDRILUKA M,GEHLER P,et al.Poselet conditioned pictorial structures[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2013:588-595.
[4]WANG Y,MORI G.Multiple tree models for occlusion and spatial constraints in human pose estimation[C]//Proceedings of the European Conference on Computer Vision.2008:710-724.
[5]SUN M,KOHLI P,SHOTTON J.Conditional regression forests for human pose estimation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2012:3394-3401.
[6]SUN K,XIAO B,LIU D,et al.Deep high-resolution representa-tion learning for human pose estimation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern recognition.2019:5693-5703.
[7]HUANG J,ZHU Z,GUO F,et al.The devil is in the details:Delving into unbiased data processing for human pose estimation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:5700-5709.
[8]VASWANI A,SHAZEER N,PARMARN,et al.Attention is all you need[J].arXiv:1706.03762,2017.
[9]LI K,WANG S,ZHANG X,et al.Pose recognition with cascade transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:1944-1953.
[10]TIAN Z,CHEN H,SHEN C.Directpose:Direct end-to-endmulti-person pose estimation[J].arXiv:1911.07451,2019.
[11]SUN X,XIAO B,WEI F,et al.Integral human pose regression[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:529-545.
[12]LI J,BIAN S,ZENG A,et al.Human pose regression with residual log-likelihood estimation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:11025-11034.
[13]MENG D,CHEN X,FAN Z,et al.Conditional detr for fasttraining convergence[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:3651-3660.
[14]CARION N,MASSA F,SYNNAEVE G,et al.End-to-end object detection with transformers[C]//European Conference on Computer Vision.Cham:Springer,2020:213-229.
[15]LIN T Y,MAIRE M,BELONGIE S,et al.Microsoft coco:Common objects in context[C]//European Conference on Computer Vision.Cham:Springer,2014:740-755.
[16]LOSHCHILOV I,HUTTER F.Decoupled weight decay regularization[J].arXiv:1711.05101,2017.
[17]GLOROT X,BENGIO Y.Understanding the difficulty of trai-ning deep feedforward neural networks[C]//Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics.JMLR Workshop and Conference Proceedings,2010:249-256.
[18]XIAO B,WU H,WEI Y.Simple baselines for human pose estimation and tracking[C]//Proceedings of the European Confe-rence on Computer Vision(ECCV).2018:466-481.
[19]ZHOU X,WANG D,KRÄHENBÜHL P.Objects as points[J].arXiv:1904.07850,2019.
[20]WEI F,SUN X,LI H,et al.Point-set anchors for object detection,instance segmentation and pose estimation[C]//European Conference on Computer Vision.Cham:Springer,2020:527-544.
[21]NIE X,FENG J,ZHANG J,et al.Single-stage multi-person posemachines[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:6951-6960.
[1] LIU Xuanyu, ZHANG Shuai, HUO Shumin, SHANG Ke. Microservice Moving Target Defense Strategy Based on Adaptive Genetic Algorithm [J]. Computer Science, 2023, 50(9): 82-89.
[2] ZHOU Bo, JIANG Peifeng, DUAN Chang, LUO Yuetong. Study on Single Background Object Detection Oriented Improved-RetinaNet Model and Its Application [J]. Computer Science, 2023, 50(7): 137-142.
[3] DAI Xuesong, LI Xiaohong, ZHANG Jingjing, QI Meibin, LIU Yimin. Unsupervised Domain Adaptive Pedestrian Re-identification Based on Counterfactual AttentionLearning [J]. Computer Science, 2023, 50(7): 160-166.
[4] HENG Hongjun, ZHOU Wenhua. Anomaly Detection Method Based on Context Information Fusion and Noise Adaptation [J]. Computer Science, 2023, 50(7): 237-245.
[5] ZHANG Lianfu, TAN Zuowen. Robust Federated Learning Algorithm Based on Adaptive Weighting [J]. Computer Science, 2023, 50(6A): 230200188-9.
[6] LYU Wenrui, PU Yuanyuan, ZHAO Zhengpeng, XU Dan, QIAN Wenhua. Font Transfer Based on Glyph Perception and Attentive Normalization [J]. Computer Science, 2023, 50(6A): 220100205-6.
[7] LI Yuge, WANG Tianjing, SHEN Hang, LUO Xiaokang, BAI Guangwei. Anti-interference Multiuser Detection Algorithm Based on Variable Step Size Adaptive Matching Pursuit in Grant-free NOMA System [J]. Computer Science, 2023, 50(5): 322-328.
[8] LIU Yifan, OU Bo, XIONG Jianqin. Adaptive Image Adversarial Reprogramming Based on Noise Invisibility Factors [J]. Computer Science, 2023, 50(4): 110-116.
[9] SHANG Yuye, YUAN Jiabin. Task Offloading Method Based on Cloud-Edge-End Cooperation in Deep Space Environment [J]. Computer Science, 2023, 50(2): 80-88.
[10] XUE Jianbin, WANG Jiahao. Design of Adaptive Hybrid Precoder in mmWave MU-MIMO Systems [J]. Computer Science, 2023, 50(11A): 221200047-5.
[11] WANG Ling, HUANG Guan, WANG Peng, BAI Yane, QIU Tianheng. Study on Scale Adaptive Target Detection Algorithm Based on Improved D2Det [J]. Computer Science, 2023, 50(11A): 221100247-9.
[12] YANG Xiuzhang, WU Shuai, LI Na, YANG Wenwen, LIAO Wenjing, ZHOU Jisong. YOLOv3 Vehicle Recognition Algorithm for Adaptive Dehazing in Complex Environments [J]. Computer Science, 2023, 50(11A): 220700147-8.
[13] XU Jie, ZHOU Xinzhi. Multi-elite Interactive Learning Based Particle Swarm Optimization Algorithm with Adaptive Bound-handling Technique [J]. Computer Science, 2023, 50(11): 210-219.
[14] WEI Hongxu, LONG Sheng, TAO Wei, TAO Qing. Adaptive Heavy-Ball Momentum Method Based on AdaGrad+ and Its Optimal Individual Convergence [J]. Computer Science, 2023, 50(11): 220-226.
[15] LI Haochen, CAO Fuyuan, QIAO Shichang. Unbiased Scene Graph Generation Based on Adaptive Regularization Algorithm [J]. Computer Science, 2023, 50(10): 104-111.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!