计算机科学 ›› 2024, Vol. 51 ›› Issue (4): 291-298.doi: 10.11896/jsjkx.230300158
王佳昊1, 闫航1, 胡鑫1, 赵德鑫2
WANG Jiahao1, YAN Hang1, HU Xin1, ZHAO Dexin2
摘要: 随着智能手表、手环等可穿戴设备的普及,将其用于人体行为识别领域并从中解码出人类行为活动,对于健康监测、日常行为分析、智能家居等应用具有重要意义。然而,传统的动作识别算法存在特征提取困难、识别准确率较低等问题,并且均基于封闭集假设,即所有的训练数据和测试数据均来自同一个标签空间,而现实世界中大多都是开放集(Open-Set)场景,在测试阶段可能会将未知标签样本送入模型,从而导致分类错误。文中针对人体动作识别问题,提出了多通道自适应卷积网络(Multi-channel Adaptive Convolutional Network,MCACN),针对传统CNN网络特征提取仅局限于一个小范围内的问题,自适应卷积模块能够使用不同大小的卷积核提取不同时间跨度的特征,并自动计算权重求和。此外MCACN的多通道结构使各传感器数据得以分头进行处理,获得能够区分相近动作的特征细节。最后,设计了基于标签的多元变分自编码器,提出了用于开放集识别的模型MCACN-VAE。该模型能够通过计算重建误差来识别未知类,聚焦于已知类别动作,提高了模型的健壮性。实验结果表明,在封闭集实验中,MCACN模型能够有效地对动作进行识别,对7种日常动作的识别准确率均达到了91%以上,总体准确率达到了95%。在开放集实验中,MCACN-VAE在不同开放度下对于已知类别的总体识别准确率均达到了89%以上,对于未知动作片段的识别准确率也保持在75%以上,证明了所提模型能够有效拒绝未知类,识别已知类。
中图分类号:
| [1]SINGH S P,SHARMA M K,LAY-EKUAKILLE A,et al.Deep ConvLSTM with self-attention for human activity decoding using wearable sensors[J].IEEE Sensors Journal,2020,21(6):8575-8582. [2]LECUN Y,BOTTOU L,BENGIO Y,et al.Gradient-basedlearning applied to document recognition[J].Proceedings of the IEEE,1998,86(11):2278-2324. [3]MIKOLOV T,KARAFIÁT M,BURGET L,et al.Recurrent neural network based language model[C]//Interspeech.2010:1045-1048. [4]KINGMA D P,WELLING M.Auto-encoding variational bayes[C]//Proceedings of the International Conference on Learning Representations(ICLR).Banff,Canada:ICLR,2014. [5]PARKKA J,ERMES M,KORPIPAA P,et al.Activity classification using realistic data from wearable sensors[J].IEEE Transactions on Information Technology in Biomedicine,2006,10(1):119-128. [6]KWAPISZ J R,WEISS G M,MOORE S A.Activity recognition using cell phone accelerometers[J].ACM SigKDD Explorations Newsletter,2011,12(2):74-82. [7]LI J.Research on Motion Capture Algorithm Based on 3D Static Model [D].Beijing:North China University of Technology,2019. [8]MOTTAGHI A,SORYANI M,SEIFI H.Action recognition in freestyle wrestling using silhouette-skeleton features[J].Engineering Science and Technology,an International Journal,2020,23(4):921-930. [9]LAPTEV I,MARSZALEK M,SCHMID C,et al.Learning rea-listic human actions from movies[C]//2008 IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2008:1-8. [10]MAHBUB U,IMTIAZ H,AHAD M A R.An optical flowbased approach for action recognition[C]//14th International Conference on Computerand Information Technology(ICCIT 2011).IEEE,2011:646-651. [11]LIU Z,ZHANG C,TIAN Y.3D-based deep convolutional neuralnetwork for action recognition with depth sequences[J].Image and Vision Computing,2016,55:93-100. [12]DU Y,WANG W,WANG L.Hierarchical recurrent neural network for skeleton based action recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:1110-1118. [13]XIA K,HUANG J,WANG H.LSTM-CNN architecture for human activity recognition[J].IEEE Access,2020,8:56855-56866. [14]SENYUREK V Y,IMTIAZ M H,BELSARE P,et al.A CNN-LSTM neural network for recognition of puffing in smoking episodes using wearable sensors[J].Biomedical Engineering Letters,2020,10(2):195-203. [15]ORDÓÑEZ F J,ROGGEN D.Deep convolutional and lstm recurrent neural networks for multimodal wearable activity recognition[J].Sensors,2016,16(1):115. [16]AGAC S,SHOAIB M,INCEL O D.Context-aware and dynamically adaptable activity recognition with smart watches:A case study on smoking[J].Computers & Electrical Engineering,2021,90:106949. [17]PHILLIPS P J,GROTHER P,MICHEALS R.Evaluation me-thods in face recognition[M]//Handbook of Face Recognition.2011:551-574. [18]SCHEIRER W J,DE REZENDE ROCHA A,SAPKOTA A,et al.Toward open set recognition[J].IEEE Transactionson Pattern Analysis and Machine Intelligence,2012,35(7):1757-1772. [19]DONG H,FU Y,SIGAL L,et al.Learning to separate domains in generalized zero-shot and open set learning:a probabilistic perspective[J].arXiv.1810.07368,2018. [20]NEIRA M A C,JÚNIOR P R M,ROCHA A,et al.Data-fusion techniques for open-set recognition problems[J].IEEE Access,2018,6:21242-21265. [21]YANG H M,ZHANG X Y,YIN F,et al.Convolutional prototype network for open set recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,44(5):2358-2370. [22]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[J].Advances in Neural Information Processing Systems,2017,30:5998-6008. [23]BENDALE A,BOULT T E.Towards open set deep networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:1563-1572. [24]OZA P,PATEL V M.C2ae:Class conditioned auto-encoder for open-set recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:2307-2316. | 
| 
 | ||