计算机科学 ›› 2020, Vol. 47 ›› Issue (6): 164-169.doi: 10.11896/jsjkx.190500013

• 计算机图形学&多媒体 • 上一篇    下一篇

融合视点机制与姿态估计的行人再识别方法

裴嘉震, 徐曾春, 胡平   

  1. 南京工业大学计算机科学与技术学院 南京211816
  • 收稿日期:2019-05-05 出版日期:2020-06-15 发布日期:2020-06-10
  • 通讯作者: 徐曾春(xzc@njtech.edu.cn)
  • 作者简介:pjz34567@163.com
  • 基金资助:
    国家自然科学基金(61672279);国家重点研发计划(2017YFC0805605);江苏省重点研发计划(BE2017617)

Person Re -identification Fusing Viewpoint Mechanism and Pose Estimation

PEI Jia-zhen, XU Zeng-chun, HU Ping   

  1. College of Computer Science and Technology,Nanjing Tech University,Nanjing 211816,China
  • Received:2019-05-05 Online:2020-06-15 Published:2020-06-10
  • About author:PEI Jia-zhen,born in 1995,master.His main research interests include person re-identification,computer vision and deep learning.
    XU Zeng-chun,born in 1973,engineer.Her main research interests include computer science and technology,wireless sensor networks and intelligence computation.
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (61672279),National Key R&D Program of China (2017YFC0805605) and Key R&D Program of Jiangsu Province (BE2017617)

摘要: 行人再识别是视频监控中一项极具挑战性的任务。图像中的遮挡、光照、姿态、视角等因素,会对行人再识别的准确率造成极大影响。为了提高行人再识别的准确率,提出一种融合视点机制与姿态估计的行人再识别方法。首先,采用姿态估计算法Openpose定位行人关节点;然后,对行人图像进行视图判别以获得视点信息,并根据视点信息与行人关节点位置进行局部区域推荐,生成行人局部图像;接着,将全局图像与局部图像同时输入CNN提取特征;最后,采用特征融合网络将全局与局部的特征融合,以获取更具鲁棒性的特征表示。实验结果表明:提出的方法具有更高的行人再识别准确率,其在CHUK03数据集上的rank1达到了71.3%,在Market1501和DukeMTMC-reID数据集上的mAP分别达到了63.2%与60.5%。因此,所提方法能够很好地应对行人姿态变化和视角变化等问题。

关键词: 深度学习, 特征融合, 相机视点, 行人再识别, 姿态估计

Abstract: Person re-identification is a very challenging task in video surveillance.Person have significant changes in appearance due to occlusion and differences in illumination,posture and perspective,which will ultimately have a great impact on the accuracy of person re-identification.To overcome these difficulties,this paper proposes a method for person re-identification based on viewpoint mechanism and pose estimation.First,the pose estimation algorithm Openpose is used to locate the joint points of person.Then,view discrimination is performed on the image to obtain viewpoint information.Local regions based on viewpoint information and joint point locations is proposed to generate a partial image.Next,the global image and the partial image are input into the CNN simultaneously to extract features.Finally,in order to obtain a more robust feature representation,the feature fusion network is used to fuse the global and local features.Experimental results show that the proposed method has higher person re-identification accuracy.On CHUK03 dataset,the value of rank1 reaches 71.3%,and on Market1501 dataset and DukeMTMC-reID dataset,the mAP reaches 63.2% and 60.5%,respectively.Therefore,the proposed methokd can well cope with person attitude changes,pose changes and other issues.

Key words: Camera viewpoint, Deep learning, Feature fusion, Person re-identification, Pose estimation

中图分类号: 

  • TP391
[1]ZHENG L,SHEN L,TIAN L,et al.Scalable Person Re-identification:A Benchmark[C]//2015 IEEE International Conference on Computer Vision (ICCV).IEEE Computer Society,2015:1116-1124.
[2]LI W,ZHAO R,XIAO T,et al.DeepReID:Deep Filter Pairing Neural Network for Person Re-identification[C]//2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE Computer Society,2014:152-159.
[3]RISTANI E,SOLERA F,ZOU R,et al.Performance measures and a data set for multi-target,multi-camera Tracking[C]//European Conference on Computer Vision workshop on Benchmarking Multi-Target Tracking.2016:17-35.
[4]LIAO S,HU Y,ZHU N X,et al.Person re-identification by Local Maximal Occurrence representation and metric learning[C]//2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE Computer Society,2015:2197-2206.
[5]JOSE C,FLEURET F.Scalable metric learning via weighted approximate rank component analysis[C]//European Conference on Computer Vision.Cham:Springer,2016:875-890.
[6]LIAO S,LI S Z.Efficient PSD Constrained Asymmetric Metric Learning for Person Re-identification[C]//2015 IEEE International Conference on Computer Vision (ICCV).IEEE Computer Society,2015:3685-3693.
[7]YI D,LEI Z,LIAO S,et al.Deep Metric Learning for Person Re-identification[C]//2014 22nd International Conference on Pattern Recognition (ICPR).IEEE Computer Society,2014:34-39.
[8]VARIOR R R,SHUAI B,LU J W,et al.A siamese long short-term memory architecture for human re-identification[C]//European Conference on Computer Vision.Cham:Springer,2016:135-153.
[9]VARIOR R R,HALOI M,WANG G.Gated siamese convolutional neural network architecture for human re-identification[C]//European Conference on Computer Vision.Cham:Springer,2016:791-808.
[10]DING S,LIN L,WANG G,et al.Deep feature learning with relative distance comparison for person re-identification[J].Pattern Recognition,2015,48(10):2993-3003.
[11]WANG F,ZUO W,LIN L,et al.Joint learning of single-image and cross-image representations for person re-identification[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE Computer Society,2016:1288-1296.
[12]ZHENG L,HUANG Y,LU H,et al.Pose Invariant Embedding for Deep Person Re-identification[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE Computer Society,2017:1456-1465.
[13]ZHAO H,TIAN M,SUN S,et al.Spindle Net:Person Re-identification with Human Body Region Guided Feature Decomposition and Fusion[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE Computer Society,2017:1077-1085.
[14]WANG H,LIU Z F,FANG B F,et al.Human pose estimation based on constellation tree structure appearance model[J].Computer Science,2014,41(3):76-79.
[15]LI D,ZHANG Z,CHEN X,et al.A Richly Annotated Dataset for Pedestrian Attribute Recognition[C]//2016 IEEEConfe-rence on Computer Vision and Pattern Recognition (CVPR).IEEE Computer Society,2016.
[16]CAO Z,SIMON T,WEI S E,et al.Realtime multi-person 2D pose estimation using part affinity fields[C]//IEEE Conference on Computer Vision & Pattern Recognition.IEEE,2017:1302-1310.
[17]ZHANG L,XIANG T,GONG S.Learning a Discriminative Null Space for Person Re-identification[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2016:1239-1248.
[18]MATSUKAWA T,OKABE T,SUZUKI E,et al.Hierarchical Gaussian Descriptor for Person Re-identification[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE,2016:1363-1372.
[19]SCHUMANN A,STIEFELHAGEN R.Person Re-identification by Deep Learning Attribute-Complementary Information[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).IEEE Computer Society,2017:1435-1443.
[20]ZHENG Z,ZHENG L,YANG Y.Unlabeled samples generate d by GAN improve the person re-identification baseline in vitro[C]//IEEE International Conference on Computer Vision.IEEE Computer Society,2017:3774-3782.
[21]SUN Y,ZHENG L,DENG W,et al.SVDNet for Pedestrian Retrieval[C]//IEEE International Conference on Computer Vision.Piscataway:IEEE,2017:3820-3828.
[1] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[2] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[3] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[4] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[5] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[6] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[7] 陈坤峰, 潘志松, 王家宝, 施蕾, 张锦.
基于双目叠加仿生的微换衣行人再识别
Moderate Clothes-Changing Person Re-identification Based on Bionics of Binocular Summation
计算机科学, 2022, 49(8): 165-171. https://doi.org/10.11896/jsjkx.210600140
[8] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[9] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
[10] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[11] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[12] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[13] 周慧, 施皓晨, 屠要峰, 黄圣君.
基于主动采样的深度鲁棒神经网络学习
Robust Deep Neural Network Learning Based on Active Sampling
计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044
[14] 苏丹宁, 曹桂涛, 王燕楠, 王宏, 任赫.
小样本雷达辐射源识别的深度学习方法综述
Survey of Deep Learning for Radar Emitter Identification Based on Small Sample
计算机科学, 2022, 49(7): 226-235. https://doi.org/10.11896/jsjkx.210600138
[15] 祝文韬, 兰先超, 罗唤霖, 岳彬, 汪洋.
改进Faster R-CNN的光学遥感飞机目标检测
Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN
计算机科学, 2022, 49(6A): 378-383. https://doi.org/10.11896/jsjkx.210300121
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!