Computer Science ›› 2020, Vol. 47 ›› Issue (11): 128-136.doi: 10.11896/jsjkx.200700061

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Research Advance on 2D Human Pose Estimation

FENG Xiao-yue, SONG Jie   

  1. Software College,Northeastern University,Shenyang 110819,China
  • Received:2020-07-01 Revised:2020-08-28 Online:2020-11-15 Published:2020-11-05
  • About author:FENG Xiao-yue,born in 2000,undergraduate.Her main research interests include big data management and machine learning.
    SONG Jie,born in 1980,Ph.D,professor.His main research interests include big data management,green computing and machine learning.
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (61672143).

Abstract: Human pose estimation has always been a research hotspot in the field of computer vision.With the continuous improvement of the performance and accuracy of human pose estimation methods,it can be widely used in human-computer interaction,intelligent surveillance and human activity analysis,etc.In this paper,the methods,models and applications of two-dimensional human pose estimation are reviewed and analyzed,and the future research direction is prospected.The introduction of the method is divided into single person and multi-person pose estimation.In terms of the model,it mainly introduces the models based on ResNet,Hourglass and HRNet.In terms of the application,it mainly introduces the application in the field of human-computer interaction and intelligent surveillance.The research prospect is mainly aimed at the expansion of application scenarios.This paper summarizes the research results in recent years and sorts out the possible research directions.

Key words: Hourglass, HRNet, Human pose estimation, Key-point detection, Neural network, ResNet

CLC Number: 

  • TP311
[1] HEN C H,RAMANAN D.3D Human Pose Estimation=2DPose Estimation+Matching[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:7035-7043.
[2] LI X H,LIU J F.A Review of the Research on Two-dimensional Human Posture Estimation[J].Modern Computer,2019(22):33-37.
[3] FISCHLER M A,ELSCHLAGER R A.The Representation and Matching of Pictorial Structures[J].IEEE Transactions on Computers,1973,22(1):67-92.
[4] ANDRILUKA M,ROTH S,SCHIELE B,et al.Pictorial structures revisited:People detection and articulated pose estimation[C]//2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.2009:1014-1021.
[5] DALAL N,TRIGGS B.Histograms of oriented gradients forhuman detection[C]//2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.2005:886-893.
[6] LOWE D G.Distinctive Image Features from Scale-InvariantKeypoints[J].International Journal of Computer Vision,2004,60(2):91-110.
[7] NAGELI T,OBERHOLZER S,PLUSS S,et al.Flycon:real-time environment-independent multi-view human pose estimation with aerial vehicles[C]//International Conference on Computer Graphics and Interactive Techniques.2019.
[8] ACHILLES F,ICHIM A E,COSKUN H,et al.Patient MoCap:Human Pose Estimation Under Blanket Occlusion for Hospital Monitoring Applications[C]//Medical Image Computing and Computer Assisted Intervention.2016:491-499.
[9] WANG J,QIU K,PENG H,et al.AI Coach:Deep Human Pose Estimation and Analysis for Personalized Athletic Training Assistance[C]//ACM Multimedia.2019:2228-2230.
[10] YANG Y,RAMANAN D.Articulated pose estimation withflexible mixtures-of-parts[C]//The 24th IEEE Conference on Computer Vision and Pattern Recognition.2011:1385-1392.
[11] HE K,ZHANG X,REN S,et al.Delving Deep into Rectifiers:Surpassing Human-Level Performance on ImageNet Classification[C]//International Conference on Computer Vision.2015:1026-1034.
[12] GLOROT X,BENGIO Y.Understanding the difficulty of training deep feedforward neural networks[C]//International Conference on Artificial Intelligence and Statistics.2010:249-256.
[13] SAPP B,TASKAR B.MODEC:Multimodal DecomposableModels for Human Pose Estimation[C]//2013 IEEE Conference on Computer Vision and Pattern Recognition.2013:3674-3681.
[14] GKIOXARI G,ARBELAEZ P,BOURDEV L,et al.Articulated Pose Estimation Using Discriminative Armlet Classifiers[C]//2013 IEEE Conference on Computer Vision and Pattern Recognition.2013:3342-3349.
[15] SAPP B,JORDAN C,TASKAR B,et al.Adaptive pose priors for pictorial structures[C]//Computer Vision and Pattern Recognition.2010:422-429.
[16] Dantone M,Gall J,Leistner C,et al.Human Pose Estimation Using Body Parts Dependent Joint Regressors[C]//Computer Vision and Pattern Recognition.2013:3041-3048.
[17] PISHCHULIN L,ANDRILUKA M,GEHLER P V,et al.Poselet Conditioned Pictorial Structures[C]//2013 IEEE Conference on Computer Vision and Pattern Recognition.2013:588-595.
[18] JOHNSON S,EVERINGHAM M.Learning effective humanpose estimation from inaccurate annotation[C]//The 24th IEEE Conference on Computer Vision and Pattern Recognition.2011:1465-1472.
[19] TOSHEV A,SZEGEDY C.DeepPose:Human Pose Estimation via Deep Neural Networks[C]//2014 IEEE Conference on Computer Vision and Pattern Recognition.2014:1653-1660.
[20] TOMPSON J,JAIN A,LECUN Y,et al.Joint Training of aConvolutional Network and a Graphical Model for Human Pose Estimation[C]//Neural Information Processing Systems.2014:1799-1807.
[21] WEI S,RAMAKRISHNA V,KANADE T,et al.Convolutional Pose Machines[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition.2016:4724-4732.
[22] LONG J,SHELHAMER E,DARRELL T,et al.Fully convolutional networks for semantic segmentation[C]//IEEE Conference on Computer Vision and Pattern Recognition.2015:3431-3440.
[23] NEWELL A,YANG K,DENG J,et al.Stacked hourglass networks for human pose estimation[C]//European Conference on Computer Vision.2016:483-499.
[24] LONG J,SHELHAMER E,DARRELL T,et al.Fully convolutional networks for semantic segmentation[C]//IEEE Conference on Computer Vision and Pattern Recognition.2015:3431-3440.
[25] ZEILER M D,TAYLOR G W,FERGUS R,et al.Adaptive deconvolutional networks for mid and high level feature learning[C]//International Conference on Computer Vision.2011:2018-2025.
[26] PAPANDREOU G,ZHU T,KANAZAWA N,et al.Towards Accurate Multi-person Pose Estimation in the Wild[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition.2017:3711-3719.
[27] HE K,GKIOXARI G,DOLLAR P,et al.Mask R-CNN[C]//International Conference on Computer Vision.2017:2980-2988.
[28] FANG H,XIE S,TAI Y,et al.RMPE:Regional Multi-personPose Estimation[C]//International Conference on Computer Vision.2017:2353-2362.
[29] XIAO B,WU H,WEI Y,et al.Simple Baselines for Human Pose Estimation and Tracking[C]//European Conference on Computer Vision.2018:472-487.
[30] CAO Z,SIMON T,WEI S,et al.Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition.2017:1302-1310.
[31] PISHCHULIN L,INSAFUTDINOV E,TANG S,et al.DeepCut:Joint Subset Partition and Labeling for Multi Person Pose Estimation[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition.2016:4929-4937.
[32] NEWELL A,HUANG Z,DENG J,et al.Associative Embedding:End-to-End Learning for Joint Detection and Grouping[C]//Neural Information Processing Systems.2017:2277-2287.
[33] INSAFUTDINOV E,PISHCHULIN L,ANDRES B,et al.DeeperCut:A Deeper,Stronger,and Faster Multi-Person Pose Estimation Model[C]//European Conference on Computer Vision.2016:34-50.
[34] HE K,ZHANG X,REN S,et al.Deep Residual Learning for Ima-ge Recognition[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[35] NING G,HE Z.Dual Path Networks for Multi-Person Human Pose Estimation[J].arXiv:1710.10192.
[36] CAO Z,SIMON T,WEI S,et al.Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition.2017:1302-1310.
[37] CHEN Y,LI J,XIAO H,et al.Dual Path Networks[J].arXiv:1707.01629.
[38] HUANG G,LIU Z,DER MAATEN L V,et al.Densely Connected Convolutional Networks[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition.2017:2261-2269.
[39] XIE S,GIRSHICK R,DOLLAR P,et al.Aggregated ResidualTransformations for Deep Neural Networks[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition.2017:5987-5995.
[40] CHEN Y,WANG Z,PENG Y,et al.Cascaded Pyramid Network for Multi-person Pose Estimation[C]//2018 IEEE Conference on Computer Vision and Pattern Recognition.2018:7103-7112.
[41] MARTIEZ G A,VILLAMIZAR M,CANÉVET O,et al.Real-time Convolutional Networks for Depth-based Human Pose Estimation[C]//Intelligent Robots and Systems.2018:41-47.
[42] LUO D,DU S,IKENAGA T,et al.End-to-End Feature Pyramid Network for Real-Time Multi-Person Pose Estimation[C]//International Conference on Machine Vision.2019:1-4.
[43] ZHANG Z,TANG J,WU G,et al.Simple and Lightweight Human Pose Estimation[J].arXiv:1911.10346.
[44] YANG W,LI S,OUYANG W,et al.Learning Feature Pyramids for Human Pose Estimation[C]//International Conference on Computer Vision.2017:1290-1299.
[45] NIE X,FENG J,XING J,et al.Generative Partition Networks for Multi-Person Pose Estimation[J].arXiv:1705.07422.
[46] NIE X,FENG J,XING J,et al.Pose Partition Networks forMulti-person Pose Estimation[C]//European Conference on Computer Vision.2018:705-720.
[47] ZHAO Y,LUO Z,QUAN C,et al.Lite Hourglass Network for Multi-person Pose Estimation[C]//MultiMedia Modeling - 26th International Conference.2020:226-238.
[48] LUO Y,XU Z,LIU P,et al.Multi-Person Pose Estimation via Multi-Layer Fractal Network and Joints Kinship Pattern[J].IEEE Transactions on Image Processing,2019,28(1):142-155.
[49] LUO Y,XU Z,LIU P,et al.Combining fractal hourglass network and skeleton joints pairwise affinity for multi-person pose estimation[J].Multimedia Tools and Applications,2019,78(6):7341-7363.
[50] ZHAO Y,LUO Z W,QUAN C Q,et al.Cluster-wise Learning Network for Multi-person Pose Estimation[J].Pattern Recognition,2020,98(2):107074.
[51] SUN K,XIAO B,LIU D,et al.Deep High-Resolution Representation Learning for Human Pose Estimation[C]//IEEE Conference on Computer Vision and Pattern Recognition.2019:5693-5703.
[52] CHENG B,XIAO B,WANG J,et al.Bottom-up Higher-Resolution Networks for Multi-Person Pose Estimation[J].arXiv:1908.10357.
[53] ZHANG K,HE P,YAO P,et al.DNANet:De-Normalized Attention Based Multi-Resolution Network for Human Pose Estimation[J].arXiv:1909.05090.
[54] WANG X,CAO Z,WANG R,et al.Improving Human Pose Estimation With Self-Attention Generative Adversarial Networks[J].IEEE Access,2019:119668-119680.
[55] RADWAN I,MOUSTAFA N,KEATING B,et al.Hierarchical Adversarial Network for Human Pose Estimation[J].IEEE Access,2019:103619-103628.
[56] TANG B,FAN Q R,SUN K X,et al.Application of Human Pose Recognition Algorithm in Visual Human-computerInteraction[J].Computer Measurement and Control,2019,27(7):242-247.
[57] TANG X Y,SONG A G.Human Pose Estimation and Its Application in Rehabilitation Training Situational Interaction[J].Journal of Instrumentation,2018,39(11):195-203.
[58] ZENG L Z.Intelligent Campus Management System[J].Communication World,2018(8):309-310.
[59] SONG X Y.Research and Implementation of Abnormal Behavior Identification Technology in Prison Intelligent Monitoring System[D].Nanjing:Nanjing University of Posts and Telecommunications,2013.
[60] LI W.Design of Intelligent Video Surveillance System for Elderly Apartments[D].Huaqiao:Huaqiao University,2017.
[61] ZHOU P X.Design and Implementation of Queue Scoring System Based on Human Pose Estimation[D].University of Electronic Science and Technology,2019.
[62] XIA P.Pedestrian Pose Estimation for Active Safety of Intelligent Vehicle[D].University of Electronic Science and Technology,2019.
[63] LIU H J,ZHOU D M.User Preference Analysis System Based on Human Pose Estimation[J].Tianjin science and technology,2019,46(4):53-56.
[1] ZHOU Fang-quan, CHENG Wei-qing. Sequence Recommendation Based on Global Enhanced Graph Neural Network [J]. Computer Science, 2022, 49(9): 55-63.
[2] ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[3] NING Han-yang, MA Miao, YANG Bo, LIU Shi-chang. Research Progress and Analysis on Intelligent Cryptology [J]. Computer Science, 2022, 49(9): 288-296.
[4] WANG Run-an, ZOU Zhao-nian. Query Performance Prediction Based on Physical Operation-level Models [J]. Computer Science, 2022, 49(8): 49-55.
[5] CHEN Yong-quan, JIANG Ying. Analysis Method of APP User Behavior Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(8): 78-85.
[6] ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[7] YAN Jia-dan, JIA Cai-yan. Text Classification Method Based on Information Fusion of Dual-graph Neural Network [J]. Computer Science, 2022, 49(8): 230-236.
[8] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[9] PENG Shuang, WU Jiang-jiang, CHEN Hao, DU Chun, LI Jun. Satellite Onboard Observation Task Planning Based on Attention Neural Network [J]. Computer Science, 2022, 49(7): 242-247.
[10] ZHAO Dong-mei, WU Ya-xing, ZHANG Hong-bin. Network Security Situation Prediction Based on IPSO-BiLSTM [J]. Computer Science, 2022, 49(7): 357-362.
[11] QI Xiu-xiu, WANG Jia-hao, LI Wen-xiong, ZHOU Fan. Fusion Algorithm for Matrix Completion Prediction Based on Probabilistic Meta-learning [J]. Computer Science, 2022, 49(7): 18-24.
[12] YANG Bing-xin, GUO Yan-rong, HAO Shi-jie, Hong Ri-chang. Application of Graph Neural Network Based on Data Augmentation and Model Ensemble in Depression Recognition [J]. Computer Science, 2022, 49(7): 57-63.
[13] ZHANG Ying-tao, ZHANG Jie, ZHANG Rui, ZHANG Wen-qiang. Photorealistic Style Transfer Guided by Global Information [J]. Computer Science, 2022, 49(7): 100-105.
[14] DAI Zhao-xia, LI Jin-xin, ZHANG Xiang-dong, XU Xu, MEI Lin, ZHANG Liang. Super-resolution Reconstruction of MRI Based on DNGAN [J]. Computer Science, 2022, 49(7): 113-119.
[15] LIU Yue-hong, NIU Shao-hua, SHEN Xian-hao. Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(7): 127-131.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!