计算机科学 ›› 2022, Vol. 49 ›› Issue (4): 174-187.doi: 10.11896/jsjkx.210700084

• 计算机图形学&多媒体 • 上一篇    下一篇

基于单目RGB图像的三维手势跟踪算法综述

张继凯1,2, 李琦1,2, 王月明1,2, 吕晓琪2,3   

  1. 1 内蒙古科技大学信息工程学院 内蒙古 包头 014010;
    2 内蒙古自治区模式识别与智能图像处理重点实验室 内蒙古 包头 014010;
    3 内蒙古工业大学信息工程学院 呼和浩特 010051
  • 收稿日期:2021-07-08 修回日期:2021-10-20 发布日期:2022-04-01
  • 通讯作者: 李琦(17854119664@163.com)
  • 作者简介:(jkzhang0314@imust.edu.cn)
  • 基金资助:
    国家自然科学基金(61771266); 内蒙古自治区自然科学基金(2019BS06005); 内蒙古自治区高等学校科学研究项目(NJZY20095); 内蒙古自治区科技计划项目(2019GG138)

Survey of 3D Gesture Tracking Algorithms Based on Monocular RGB Images

ZHANG Ji-kai1,2, LI Qi1,2, WANG Yue-ming1,2, LYU Xiao-qi2,3   

  1. 1 School of Information Engineering, Inner Mongolia University of Science & Technology, Baotou, Inner Mongolia 014010, China;
    2 Inner Mongolia Key Laboratory of Pattern Recognition and Intelligent Image Processing, Baotou, Inner Mongolia 014010, China;
    3 School of Information Engineering, Inner Mongolia University of Technology, Hohhot 010051, China
  • Received:2021-07-08 Revised:2021-10-20 Published:2022-04-01
  • About author:ZHANG Ji-kai,born in 1988,Ph.D,lecturer.His main research interests include augmented reality and image re-cognition.LI Qi,born in 1996,postgraduate.Her main research interests include gesture estimation and augmented reality.
  • Supported by:
    This work was supported by the National Natural Science Foundation of China(61771266),Natural Science Foundation of Inner Mongolia Autonomous Region(2019BS06005),Scientific Research Projects of Colleges and Universities in Inner Mongolia Autonomous Region(NJZY20095) and Inner Mongolia Autonomous Region Science and Technology Project(2019GG138).

摘要: 鉴于人机交互(Human-Computer Interaction,HCI)系统和虚拟现实(Virtual Reality,VR)系统的应用需要,三维手势跟踪领域的理论和方法研究已成为国内外广泛关注的热点课题之一。近年来,基于计算机视觉的三维手势跟踪算法迅速发展,其中成本较低且较为普遍的单目RGB相机最具潜力,它是三维手势跟踪应用得以照进现实的重要工具和途径,成为了研究者们的研究重点。为了解在此基础上的手势跟踪算法的发展现状,辅助该领域的研究者们进行更深入的探索,文中首先在与传统方法的对比中引出了基于单目RGB图像的三维手势跟踪算法,并将其分为判别法、生成法和混合法3类,概括了其相应的优缺点;其次讨论了RGB图像特点对三维手势跟踪的影响,并归纳了缓解图像深度模糊性的方法;然后根据分类着重分析了以RGB为输入数据的代表性算法,通过可视化的性能评估指标比较了相关算法的具体优势和不足;最后总结了当前三维手势跟踪算法面临的难题并对未来的发展进行了展望。

关键词: RGB图像, 计算机视觉, 人机交互, 三维手势跟踪, 虚拟现实

Abstract: In view of the needs of applications such as human-computer interaction(HCI) systems and virtual reality(VR) systems, the study on theories and methods of 3D gesture tracking has become one of the hot issues with widespread concern at home and abroad.In recent years, the 3D gesture tracking algorithms based on computer vision develop rapidly.Among them, the more economical and ubiquitous monocular RGB camera has the most potential.It is an important tool and way for 3D gesture tracking applications to take into reality, which has been focused by researchers.In order to comprehend the development status of gesture tracking algorithms, and assist researchers in this field to conduct more deep-going explorations, firstly, in comparison with the traditional methods, this paper introduces the 3D gesture tracking algorithms based on monocular RGB image, and divides it into three categories:discriminative methods, generative methods and hybrid methods, and summarizes the corresponding advantages and disadvantages.Secondly, the influence of RGB image characteristics on 3D gesture tracking is discussed, and the methods to alleviate the depth ambiguity of the image are generalized.Thirdly, according to the classification, the representative algorithms with RGB as input data are emphatically analyzed, and the specific superiority and weaknesses of related algorithms are compared through visualized performance evaluation index.Finally, the problems faced with the current 3D gesture tracking algorithms are summarized and the future development is prospected.

Key words: 3D gesture tracking, Computer vision, Human-Computer interaction, RGB image, Virtual reality

中图分类号: 

  • TP391.41
[1] ABDUL A,VERMEULEN J,WANG D,et al.Trends and tra-jectories for explainable,accountable and intelligible systems:an hci research agenda[C]//Proceedings of the 2018 CHI Confe-rence on Human Factors in Computing Systems.2018:1-18.
[2] ANTOSHCHUK S,KOVALENKO M,SIECK J.Gesture re-cognition-based human-computer interaction interface for multimedia applications[M]//Digitisation of Culture:Namibian and International Perspectives.Singapore:Springer,2018:269-286.
[3] CHIU P,TAKASHIMA K,FUJITA K,et al.Pursuit sensing:Extending hand tracking space in mobile vr applications[C]//Proceedings of Symposium on Spatial User Interaction.2019:1-5.
[4] WITHANA A,KALUARACHCHI T,SINGHABAHU C,et al.WaveSense:Low power voxel-tracking technique for resource limited devices[C]//Proceedings of the Augmented Humans International Conference.2020:1-7.
[5] QIAN J,MA J,LI X,et al.Portal-ble:Intuitive free-hand manipulation in unbounded smartphone-based augmented reality[C]//Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology.2019:133-145.
[6] CHO W,PARK G,WOO W.Bare-hand depth inpainting for3d tracking of hand interacting with object[C]//Proceedings of 2020 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).IEEE,2020:251-259.
[7] PONRAJ G,REN H.Sensor fusion of leap motion controller and flex sensors using kalman filter for human finger tracking[J].IEEE Sensors Journal,2018,18(5):2042-2049.
[8] JIANG L,XIA H,GUO C.A model-based system for real-time articulated hand tracking using a simple data glove and a depth camera[J].Sensors,2019,19(21):4680-4788.
[9] PARK G,ARGYROS A,LEE J,et al.3d hand tracking in the presence of excessive motion blur[J].IEEE transactions on visualization and computer graphics,2020,26(5):1891-1901.
[10] DEVINE S,NICHOLSON C,RAFFERTY K,et al.Improvingthe ergonomics of hand tracking inputs to VR HMD’s[C]//Proceedings of International Conference in Central Europe on Computer Graphics.2017:167-174.
[11] OBERWEGER M,LEPETIT V.Deepprior++:Improving fast and accurate 3d hand pose estimation[C]//Proceedings of the IEEE International Conference on Computer Vision Workshops.2017:585-594.
[12] MOON G,CHANG J Y,LEE K M.V2v-posenet:Voxel-to-voxel prediction network for accurate 3d hand and human pose estimation from a single depth map[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).2018:5079-5088.
[13] XIONG F,ZHANG B,XIAO Y,et al.A2j:Anchor-to-joint regression network for 3d articulated pose estimation from a single depth image[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV).2019:793-802.
[14] HUANG W,REN P,WANG J,et al.AWR:Adaptive weighting regression for 3D hand pose estimation[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020:11061-11068.
[15] QIAN C,SUN X,WEI Y,et al.Realtime and robust hand tra-cking from depth[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).2014:1106-1113.
[16] YUAN S,YE Q,STENGER B,et al.Bighand2.2m benchmark:Hand pose dataset and state of the art analysis[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR).2017:4866-4874.
[17] SIMON T,JOO H,MATTHEWS I,et al.Hand keypoint detection in single images using multiview bootstrapping[C]//Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (CVPR).2017:1145-1153.
[18] CHEN L,LIN S Y,XIE Y,et al.Mvhm:A large-scale multi-view hand mesh benchmark for accurate 3d hand pose estimation[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).2021:836-845.
[19] GOMEZ-DONOSO F,ORTS-ESCOLANO S,CAZORLA M.Large-scale multiview 3d hand pose dataset[J].Image and Vision Computing,2019,81(1):25-33.
[20] EROL A,BEBIS G,NICOLESCU M,et al.Vision-based handpose estimation:A review[J].Computer Vision and Image Understanding,2007,108(1/2):52-73.
[21] CHEN L,LIN S Y,XIE Y,et al.TAGAN:Tonality-alignment generative adversarial networks for realistic hand pose synthesis[C]//Proceedings of the 30th British Machine Vision Confe-rence (BMVC).2019:116-128.
[22] CHEN L,LIN S Y,XIE Y,et al.DGGAN:Depth-image guided generative adversarial networks for disentangling rgb and depth images in 3d hand pose estimation[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).2020:411-419.
[23] DIBRA E,MELCHIOR S,BALKIS A,et al.Monocular rgbhand pose inference from unsupervised refinable nets[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).2018:1075-1085.
[24] IQBAL U,MOLCHANOV P,GALL T B J,et al.Hand pose estimation via latent 2.5 d heatmap regression[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018:118-134.
[25] GE L,REN Z,LI Y,et al.3d hand shape and pose estimation from a single rgb image[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).2019:10833-10842.
[26] XIANG D,JOO H,SHEIKH Y.Monocular total capture:posing face,body,and hands in the wild[J].arXiv:1812.01598,2018.
[27] YUAN S,STENGER B,KIM T K.RGB-based 3d hand pose estimation via privileged learning with depth images[J].arXiv:1811.07376,2018.
[28] CAI Y,GE L,CAI J,et al.Weakly-supervised 3d hand pose estimation from monocular rgb images[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018:666-682.
[29] WANG K W,CHEN X L.PMD-Net:Privileged modality distillation network for 3dhand pose estimation from a single rgbimage[C]//Proceedings of the 31th British Machine Vision Conference (BMVC).2020:413-425.
[30] MOON G,SHIRATORI T,LEE K M.DeepHandMesh:Aweakly-supervised deep encoder-decoder framework for high-fidelity hand mesh modeling[C]//Proceedings of European Conference on Computer Vision (ECCV).Cham:Springer,2020:440-455.
[31] ZHANG X,LI Q,MO H,et al.End-to-end hand mesh recovery from a monocular rgb image[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV).2019:2354-2364.
[32] BAEK S,KIM K I,KIM T K.Weakly-supervised domain adaptation via gan and mesh model for estimating 3d hand poses interacting objects[C]//Proceedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition (CVPR).2020:6121-6131.
[33] BAEK S,KIM K I,KIM T K.Pushing the envelope for rgb-based dense 3d hand pose estimation via neural rendering[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).2019:1067-1076.
[34] ZHAO L,PENG X,CHEN Y,et al.Knowledge as priors:Cross-modal knowledge generalization for datasets without superior knowledge[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).2020:6528-6537.
[35] ZHANG Y,CHEN L,LIU Y,et al.Adaptive wasserstein hourglass for weakly supervised hand pose estimation from monocular rgb[J].arXiv:1909.05666,2019.
[36] ZHANG Y,CHEN L,LIU Y,et al.Explicit knowledge distil-lation for 3D hand pose estimation from monocular rgb[C]//Proceedings of the 31th British Machine Vision Conference (BMVC).2020:242-253.
[37] YANG L,YAO A.Disentangling latent hands for image synthesis and pose estimation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).2019:9877-9886.
[38] THEODORIDIS T,CHATZIS T,SOLACHIDIS V,et al.Cross-modal variational alignment of latent spaces[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).2020:960-961.
[39] SPURR A,SONG J,PARK S,et al.Cross-modal deep variatio-nal hand pose estimation[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition (CVPR).2018:89-98.
[40] LIN F,WILHELM C,MARTINEZ T.Two-hand global 3d pose estimation using monocular rgb[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).2021:2373-2381.
[41] DOOSTI B,NAHA S,MIRBAGHERI M,et al.Hope-net:Agraph-based model for hand-object pose estimation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).2020:6608-6617.
[42] SPURR A,IQBAL U,MOLCHANOV P,et al.Weakly supervised 3d hand pose estimation via biomechanical constraints[J].arXiv:2003.09282,2020.
[43] YANG J,CHANG H J,LEE S,et al.SeqHAND:RGB-se-quence-based 3d hand pose and shape estimation[C]//Procee-dings of European Conference on Computer Vision (ECCV).Cham:Springer,2020:122-139.
[44] FENG Z Q,YANG B.Theory and method of 3dnatural gesture tracking[M].Peking:Tsinghua University Press,2013.
[45] AHMAD A,MIGNIOT C,DIPANDA A.Tracking hands in interaction with objects:A review[C]//Proceedings of 2017 13th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS).IEEE,2017:360-369.
[46] YANG L,LI S,LEE D,et al.Aligning latent spaces for 3d hand pose estimation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV).2019:2335-2343.
[47] YANG L,LI J,XU W,et al.BiHand:Recovering hand meshwith multi-stage bisected hourglass networks[J].arXiv:2008.05079,2020.
[48] MOON G,YU S I,WEN H,et al.InterHand2.6M:A dataset and baseline for 3d interacting hand pose estimation from a single rgb image[J].arXiv:2008.09309,2020.
[49] ZHANG F,BAZAREVSKY V,VAKUNOV A,et al.MediaPipe hands:On-device real-time hand tracking[J].arXiv:2006.10214,2020.
[50] LIU W,DAI S M,YANG W J,et al.Color image 3d gesture estimation based on cascade convolution neural network[J].Journal of Chinese Computer Systems,2020,41(3):558-563.
[51] ZHOU Q,GAN Y,HE W M,et al.Research on hand pose estimation using LHPN algorithm[J].Computer Engineering & Software,2020,41(7):66-71.
[52] XIONG J,PENG J,YANG W J,et al.Multi-scale high-resolution preserving and perspective-invariant hand pose estimation[J/OL].Computer Engineering and Applications.http://kns.cnki.net/kcms/dtail/11.2127.TP.20200821.1530.002.html.
[53] ZIMMERMANN C,BROX T.Learning to estimate 3d handpose from single rgb images[C]//Proceedings of the IEEE International Conference on Computer Vision (ICCV).2017:4903-4911.
[54] LI M,GAO Y,SANG N.Exploiting learnable joint groups for hand pose estimation[J].arXiv:2012.09496,2020.
[55] DOOSTI B.Hand pose estimation:A survey[J].arXiv:1903.01013,2019.
[56] LI D,GUO Y,CHEN C,et al.Collaborative differential evolution filtering for tracking hand-object interactions[J].IEEE Access,2020,8:148289-148300.
[57] LI D,CHEN C.Tracking a hand in interaction with an objectbased on single depth images[J].Multimedia Tools and Applications,2019,78(6):6745-6762.
[58] CHO W,PARK G,WOO W.Tracking an object-grabbing handusing occluded depth reconstruction[C]//Proceedings of 2018 IEEE International Symposium on Mixed and Augmented Rea-lity Adjunct (ISMAR-Adjunct).IEEE,2018:232-235.
[59] SANCHEZ-RIERA J,SRINIVASAN K,HUA K L,et al.Robust rgb-d hand tracking using deep learning priors[J].IEEE Transactions on Circuits and Systems for Video Technology,2017,28(9):2289-2301.
[60] MUELLER F,BERNARD F,SOTNYCHENKO O,et al.GANerated hands for real-time 3d hand tracking from monocular rgb[J].arXiv:1712.01057,2017.
[61] THOMPSON P,GALATA A.Hand tracking from monocular rgb with dense semantic labels[C]//Proceedings of 2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG).IEEE Computer Society,2020:709-716.
[62] PANTELERIS P,OIKONOMIDIS I,ARGYROS A.Using asingle rgb frame for real time 3d hand pose estimation in the wild[J].arXiv:1712.03866,2017.
[63] LYU J,XU W,YANG L,et al.HandTailor:Towards high-precision monocular 3d hand recovery[J].arXiv:2102.09244,2021.
[64] ROMERO J,TZIONAS D,BLACK M J.Embodied hands:Modeling and capturing hands and bodies together[J].ACM Transactions on Graphics (ToG),2017,36(6):1-17.
[65] HE Y,HU W,YANG S,et al.3D hand pose estimation in the wild via graph refinement under adversarial learning[J].arXiv:1912.01875,2019.
[66] BOUKHAYMA A,BEM R,TORR P H S.3d hand shape and pose from images in the wild[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).2019:10843-10852.
[67] ZHOU Y,HABERMANN M,XU W,et al.Monocular real-time hand shape and motion capture using multi-modal data[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).2020:5346-5355.
[68] ZHANG J,JIAO J,CHEN M,et al.3d hand pose tracking and estimation using stereo matching[J].arXiv:1610.07214,2016.
[69] MUELLER F,MEHTA D,SOTNYCHENKO O,et al.Real-timehand tracking under occlusion from an egocentric rgb-d sensor[C]//Proceedings of the IEEE Internatio-nal Conference on Computer Vision (ICCV).2017:1154-1163.
[70] SRIDHAR S,MUELLER F,ZOLLHÖFER M,et al.Real-time joint tracking of a hand manipulating an object from rgb-d input[C]//Proceedings of European Conference on Computer Vision (ECCV).Cham:Springer,2016:294-310.
[71] ZIMMERMANN C,CEYLAN D,YANG J,et al.Freihand:A dataset for markerless capture of hand pose and shape from single rgb images[C]//Proceedings of the IEEE/CVF Internatio-nal Conference on Computer Vision (ICCV).2019:813-822.
[72] LIU Y,LI J,ZHANG J L,et al.Research Progress of Two-Dimensional Human Pose Estimation Based on Deep Learning[J].Computer Engineering,2021,47(3):1-16.
[73] WU H B,XU R T,WANG A L,et al.Overview of 3D Human Lumen Reconstruction Technology Based on Computer Vision[J].Computer Engineering,2021,47(10):1-15.
[74] WU Z,HOANG D,LIN S Y,et al.Mm-hand:3d-aware multi-modal guided hand generative network for 3d hand pose synthesis[J].arXiv:2010.01158,2020.
[75] CORONA E,PUMAROLA A,ALENYA G,et al.Ganhand:Predicting human grasp affordances in multi-object scenes[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).2020:5031-5041.
[76] YUAN S,GARCIA-HERNANDO G,STENGER B,et al.Depth-based 3d hand pose estimation:From current achievements to future goals[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).2018:2636-2645.
[77] SRIDHAR S,OULASVIRTA A,THEOBALT C.Interactive markerless articulated hand motion tracking using rgb and depth data[C]//Proceedings of the IEEE International Confe-rence on Computer Vision (ICCV).2013:2456-2463.
[78] ROGEZ G,KHADEMI M,SUPAN?I? III J S,et al.3d hand pose detection in egocentric rgb-d images[C]//Proceedings of European Conference on Computer Vision (ECCV).Cham:Springer,2014:356-371.
[79] GOMEZ-DONOSO F,ORTS-ESCOLANO S,CAZORLA M.Large-scale multiview 3d hand pose dataset[J].arXiv:1707.03742,2017.
[80] WANG Y,PENG C,LIU Y.Mask-pose cascaded cnn for 2d hand pose estimation from single color image[J].IEEE Transactions on Circuits and Systems for Video Technology,2018,29(11):3258-3268.
[81] BRAHMBHATT S,TANG C,TWIGG C D,et al.ContactPose:A dataset of grasps with object contact and hand pose[J].ar-Xiv:2007.09545,2020.
[82] GU J,WANG Z,OUYANG W,et al.3d hand pose estimationwith disentangled cross-modal latent space[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).2020:391-400.
[83] WANG J,MUELLER F,BERNARD F,et al.RGB2Hands:Real-time tracking of 3d hand interactions from monocular rgb video[J].ACM Transactions on Graphics (TOG),2020,39(6):1-16.
[84] RAD M,OBERWEGER M,LEPETIT V.Domain transfer for 3d pose estimation from color images without manual annotations[C]//Proceedings of Asian Conference on Computer Vision (ACCV).Cham:Springer,2018:69-84.
[85] AN S,ZHANG X,WEI D,et al.FastHand:Fast hand pose estimation from a monocular camera[J].arXiv:2102.07067,2021.
[86] KULON D,GULER R A,KOKKINOS I,et al.Weakly-supervised mesh-convolutional hand reconstruction in the wild[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).2020:4990-5000.
[87] TEKIN B,BOGO F,POLLEFEYS M.H+ o:Unified egocentric recognition of 3d hand-object poses and interactions[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).2019:4511-4520.
[88] RONG Y,SHIRATORI T,JOO H.FrankMocap:Fast monocular 3d hand and body motion capture by regression and integration[J].arXiv:2008.08324,2020.
[89] PANTELERIS P,ARGYROS A.Back to rgb:3d tracking ofhands and hand-object interactions based on short-baseline ste-reo[C]//Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCVW).2017:575-584.
[90] HASSON Y,VAROL G,TZIONAS D,et al.Learning joint reconstruction of hands and manipulated objects[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).2019:11807-11816.
[1] 曲倩文, 车啸平, 曲晨鑫, 李瑾如.
基于信息感知的虚拟现实用户临场感研究
Study on Information Perception Based User Presence in Virtual Reality
计算机科学, 2022, 49(9): 146-154. https://doi.org/10.11896/jsjkx.220500200
[2] 刘月红, 牛少华, 神显豪.
基于卷积神经网络的虚拟现实视频帧内预测编码
Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network
计算机科学, 2022, 49(7): 127-131. https://doi.org/10.11896/jsjkx.211100179
[3] 李荪, 曹峰.
智能语音技术端到端框架模型分析和趋势研究
Analysis and Trend Research of End-to-End Framework Model of Intelligent Speech Technology
计算机科学, 2022, 49(6A): 331-336. https://doi.org/10.11896/jsjkx.210500180
[4] 谈馨悦, 何小海, 王正勇, 罗晓东, 卿粼波.
基于Transformer交叉注意力的文本生成图像技术
Text-to-Image Generation Technology Based on Transformer Cross Attention
计算机科学, 2022, 49(2): 107-115. https://doi.org/10.11896/jsjkx.210600085
[5] 干创, 吴桂兴, 詹庆原, 王鹏焜, 彭志磊.
基于骨架模态的多级门控图卷积动作识别网络
Multi-scale Gated Graph Convolutional Network for Skeleton-based Action Recognition
计算机科学, 2022, 49(1): 181-186. https://doi.org/10.11896/jsjkx.201100164
[6] 朱晨爽, 程时伟.
虚拟现实环境下基于眼动跟踪的导航需求预测与辅助
Prediction and Assistance of Navigation Demand Based on Eye Tracking in Virtual Reality Environment
计算机科学, 2021, 48(8): 315-321. https://doi.org/10.11896/jsjkx.200500031
[7] 刘亮, 蒲浩洋.
基于LSTM的多维度特征手势实时识别
Real-time LSTM-based Multi-dimensional Features Gesture Recognition
计算机科学, 2021, 48(8): 328-333. https://doi.org/10.11896/jsjkx.210300079
[8] 冯芙蓉, 张兆功.
目标轮廓检测技术新进展
Recent Advances for Object Contour Detection Technology
计算机科学, 2021, 48(6A): 1-9. https://doi.org/10.11896/jsjkx.201000044
[9] 张开华, 樊佳庆, 刘青山.
视觉目标跟踪十年研究进展
Advances on Visual Object Tracking in Past Decade
计算机科学, 2021, 48(3): 40-49. https://doi.org/10.11896/jsjkx.201100186
[10] 马思琪, 车啸平, 于淇, 岳晨峰.
基于事件的虚拟现实用户体验评估方法研究
Event-based User Experience Evaluation Method for Virtual Reality Applications
计算机科学, 2021, 48(2): 167-174. https://doi.org/10.11896/jsjkx.200100065
[11] 李亚泽, 刘宏哲.
基于相邻特征融合的目标检测
Object Detection Based on Neighbour Feature Fusion
计算机科学, 2021, 48(12): 264-268. https://doi.org/10.11896/jsjkx.201200196
[12] 程铭, 马佩, 何儒汉.
基于流形结构神经网络的服装图像集分类方法
Clothing Image Sets Classification Based on Manifold Structure Neural Network
计算机科学, 2021, 48(11A): 391-395. https://doi.org/10.11896/jsjkx.201200127
[13] 陈浩楠, 雷印杰, 王浩.
基于行列解耦采样的轻量车道线检测模型
Lightweight Lane Detection Model Based on Row-column Decoupled Sampling
计算机科学, 2021, 48(11A): 416-419. https://doi.org/10.11896/jsjkx.201100206
[14] 谢海平, 李高源, 杨海涛, 赵洪利.
超分辨率重构遥感图像分类研究
Classification Research of Remote Sensing Image Based on Super Resolution Reconstruction
计算机科学, 2021, 48(11A): 424-428. https://doi.org/10.11896/jsjkx.210300132
[15] 张宇翔, 任爽.
定位技术在虚拟现实中的应用综述
Overview of Application of Positioning Technology in Virtual Reality
计算机科学, 2021, 48(1): 308-318. https://doi.org/10.11896/jsjkx.200800010
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!