计算机科学 ›› 2021, Vol. 48 ›› Issue (9): 146-152.doi: 10.11896/jsjkx.200800200

• 计算机图形学&多媒体 • 上一篇    下一篇

一种基于改进三元组损失和特征融合的行人重识别方法

张新峰, 宋博   

  1. 北京工业大学信息学部 北京100124
  • 收稿日期:2020-08-29 修回日期:2020-10-18 出版日期:2021-09-15 发布日期:2021-09-10
  • 通讯作者: 宋博(bosong0812@foxmail.com)
  • 作者简介:zxf@bjut.edu.cn

A Person Re-identification Method Based on Improved Triple Loss and Feature Fusion

ZHANG Xin-feng, SONG Bo   

  1. Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China
  • Received:2020-08-29 Revised:2020-10-18 Online:2021-09-15 Published:2021-09-10
  • About author:ZHANG Xin-feng,born in 1974,Ph.D,associate professor,is a member of China Computer Federation.His main research interests include image proces-sing and machine learning.
    SONG Bo,born in 1993,postgraduate.His main research interests include person re-identification and data mining.

摘要: 行人重识别旨在跨摄像头条件下,从目标数据库中检索出特定的行人目标,其在视频监控领域有重要的应用价值。目前其研究难点为样本图像类内差异大、类间差异小,因此如何设计并训练深度神经网络对行人图片提取一个判别力更强的特征成为了其关键。针对以往研究只单独进行全局特征或局部特征学习的不足,提出了一种联合全局特征和局部特征学习的网络结构,该结构能够同时提取全局特征和具有较强区分力的局部细节特征;针对每部分局部特征对行人特征描述的重要性不同,文中提出了一种局部特征的融合方式,该方法能够自适应地生成各个局部特征的权重,最后将融合后的局部特征和全局特征结合使行人特征得到更全面的表征;另外,针对以往的基于难样本挖掘的三元组损失具有优化目标模糊的特点,提出了一种改进的基于难样本挖掘的三元组损失函数。文中分别在行人重识别主流数据集Market-1501和DukeMTMC-reID上验证了所提方法的有效性,其mAP值分别达到了82.16%和74.02%,Rank-1值分别达到了92.75%和86.8%。

关键词: 检索, 三元组损失, 深度学习, 特征融合, 行人重识别

Abstract: Person re-identification aims to retrieve specific pedestrian targets from the target database under the condition of cross camera.It has important application value in the field of video surveillance.At present,the difficulty of the research is that the sample images have large intra class differences and small inter class differences.Therefore,how to design and train the deep neural network to extract a more discriminative feature from pedestrian images is the key.In this paper,we propose a network structure combining global features and local features learning,which can extract global features and local features simultaneously.In view of the different importance of each part of the local features to the description of pedestrian features,this paper proposes a fusion method of local features,which can adaptively generate the weight of each local feature.Finally,the local features and glo-bal features are combined to make the pedestrian features get more comprehensive representation.In addition,in view of the fuzzy optimization objective of the previous triple loss based on hard sample mining,this paper proposes an improved triple loss function based on hard sample mining.The effectiveness of the proposed method is verified on the mainstream person re-identification data sets Market-1501 and DukeMTMC-reID,respectively,and the mAP values are 82.16% and 74.02%,and the Rank-1 values are 92.75% and 86.8%,respectively.

Key words: Deep learning, Feature fusion, Person re-identification, Retrieval, Triplet loss

中图分类号: 

  • TP391.4
[1]LIU X,SONG M,TAO D,et al.Semi-supervised coupled dictionary learning for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:3550-3557.
[2]LIAO S,HU Y,ZHU X,et al.Person re-identification by local maximal occurrence representation and metric learning[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:2197-2206.
[3]ZHANG L,XIANG T,GONG S.Learning a discriminative null space for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:1239-1248.
[4]ZHAO R,OUYANG W,WANG X.Learning mid-level filters for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:144-151.
[5]KOESTINGER M,HIRZER M,WOHLHART P,et al.Large scale metric learning from equivalence constraints[C]//2012 IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2012:2288-2295.
[6]LUO H,JIANG W,FAN X,et al.A survey on deep learning based person re-identiflcation[J].Acta Automatica Sinica,2019,45(11):2032-2049.
[7]ZHENG L,YANG Y,HAUPTMANN A G.Person re-identification:Past,present and future[J].arXiv:1610.02984,2016.
[8]HERMANS A,BEYER L,LEIBE B.In defense of the tripletloss for person re-identification[J].arXiv:1703.07737.
[9]YI D,LEI Z,LIAO S,et al.Deep metric learning for personreidentification[C]//2014 22nd International Conference on Pattern Recognition.IEEE,2014:34-39.
[10]LI W,ZHAO R,XIAO T,et al.Deepreid:Deep filter pairingneural network for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:152-159.
[11]ZHENG L,ZHANG H,SUN S,et al.Person re-identification in the wild[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:1367-1376.
[12]ZHENG L,HUANG Y,LU H,et al.Pose Invariant Embedding for Deep Person Re-identification[J].arXiv:1701.07732,2017.
[13]SU C,LI J,ZHANG S,et al.Pose-driven deep convolutionalmodel for person re-identification[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:3960-3969.
[14]LIN Y,ZHENG L,ZHENG Z,et al.Improving Person Re-identification by Attribute and Identity Learning[J].arXiv:1703.07220,2017.
[15]SUN Y,ZHENG L,YANG Y,et al.Beyond part models:Person retrieval with refined part pooling (and a strong convolutional baseline)[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018:480-496.
[16]WANG G,YUAN Y,CHEN X,et al.Learning discriminative features with multiple granularities for person re-identification[C]//Proceedings of the 26th ACM International Conference on Multi-media.2018:274-282.
[17]SCHROFF F,KALENICHENKO D,PHILBIN J.Facenet:Aunified embedding for face recognition and clustering[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:815-823.
[18]ZHENG Z,ZHENG L,YANG Y.A discriminatively learned cnn embedding for person reidentification[J].ACM Transactions on Multimedia Computing,Communications,and Applications (TOMM),2017,14(1):1-20.
[19]YE M,SHEN J,LIN G,et al.Deep Learning for Person Rei-dentification:A Survey and Outlook[J].arXiv:2001.04193,2020.
[20]LUO H,GU Y,LIAO X,et al.Bag of tricks and a strong baseline for deep person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops.2019:1487-1495.
[21]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[22]ZHENG Z,ZHENG L,YANG Y.Pedestrian alignment network for large-scale person re-identification[J].IEEE Transactions on Circuits and Systems for Video Technology,2018,29(10):3037-3045.
[23]ZHENG L,SHEN L,TIAN L,et al.Scalable person re-identification:A benchmark[C]//Proceedings of the IEEE Internatio-nal Conference on Computer Vision.2015:1116-1124.
[24]RISTANI E,SOLERA F,ZOU R,et al.Performance measures and a data set for multi-target,multi-camera tracking[C]//European Conference on Computer Vision.Cham:Springer,2016:17-35.
[25]ZHAO H,TIAN M,SUN S,et al.Spindle net:Person re-identification with human body region guided feature decomposition and fusion[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:1077-1085.
[26]SHAO X,SHUAI H,LIU Q.Person re-identification method combining attribute features[J/OL].Acta Automatica Sinica.https://doi.org/10.16383/j.aas.c190763.
[1] 聂秀山, 潘嘉男, 谭智方, 刘新放, 郭杰, 尹义龙.
基于自然语言的视频片段定位综述
Overview of Natural Language Video Localization
计算机科学, 2022, 49(9): 111-122. https://doi.org/10.11896/jsjkx.220500130
[2] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[3] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[4] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[5] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[6] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[7] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[8] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[9] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[10] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
[11] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[12] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[13] 孟月波, 穆思蓉, 刘光辉, 徐胜军, 韩九强.
基于向量注意力机制GoogLeNet-GMP的行人重识别方法
Person Re-identification Method Based on GoogLeNet-GMP Based on Vector Attention Mechanism
计算机科学, 2022, 49(7): 142-147. https://doi.org/10.11896/jsjkx.210600198
[14] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[15] 周慧, 施皓晨, 屠要峰, 黄圣君.
基于主动采样的深度鲁棒神经网络学习
Robust Deep Neural Network Learning Based on Active Sampling
计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!