Computer Science ›› 2021, Vol. 48 ›› Issue (9): 146-152.doi: 10.11896/jsjkx.200800200

• Computer Graphics & Multimedia • Previous Articles     Next Articles

A Person Re-identification Method Based on Improved Triple Loss and Feature Fusion

ZHANG Xin-feng, SONG Bo   

  1. Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China
  • Received:2020-08-29 Revised:2020-10-18 Online:2021-09-15 Published:2021-09-10
  • About author:ZHANG Xin-feng,born in 1974,Ph.D,associate professor,is a member of China Computer Federation.His main research interests include image proces-sing and machine learning.
    SONG Bo,born in 1993,postgraduate.His main research interests include person re-identification and data mining.

Abstract: Person re-identification aims to retrieve specific pedestrian targets from the target database under the condition of cross camera.It has important application value in the field of video surveillance.At present,the difficulty of the research is that the sample images have large intra class differences and small inter class differences.Therefore,how to design and train the deep neural network to extract a more discriminative feature from pedestrian images is the key.In this paper,we propose a network structure combining global features and local features learning,which can extract global features and local features simultaneously.In view of the different importance of each part of the local features to the description of pedestrian features,this paper proposes a fusion method of local features,which can adaptively generate the weight of each local feature.Finally,the local features and glo-bal features are combined to make the pedestrian features get more comprehensive representation.In addition,in view of the fuzzy optimization objective of the previous triple loss based on hard sample mining,this paper proposes an improved triple loss function based on hard sample mining.The effectiveness of the proposed method is verified on the mainstream person re-identification data sets Market-1501 and DukeMTMC-reID,respectively,and the mAP values are 82.16% and 74.02%,and the Rank-1 values are 92.75% and 86.8%,respectively.

Key words: Deep learning, Feature fusion, Person re-identification, Retrieval, Triplet loss

CLC Number: 

  • TP391.4
[1]LIU X,SONG M,TAO D,et al.Semi-supervised coupled dictionary learning for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:3550-3557.
[2]LIAO S,HU Y,ZHU X,et al.Person re-identification by local maximal occurrence representation and metric learning[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:2197-2206.
[3]ZHANG L,XIANG T,GONG S.Learning a discriminative null space for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:1239-1248.
[4]ZHAO R,OUYANG W,WANG X.Learning mid-level filters for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:144-151.
[5]KOESTINGER M,HIRZER M,WOHLHART P,et al.Large scale metric learning from equivalence constraints[C]//2012 IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2012:2288-2295.
[6]LUO H,JIANG W,FAN X,et al.A survey on deep learning based person re-identiflcation[J].Acta Automatica Sinica,2019,45(11):2032-2049.
[7]ZHENG L,YANG Y,HAUPTMANN A G.Person re-identification:Past,present and future[J].arXiv:1610.02984,2016.
[8]HERMANS A,BEYER L,LEIBE B.In defense of the tripletloss for person re-identification[J].arXiv:1703.07737.
[9]YI D,LEI Z,LIAO S,et al.Deep metric learning for personreidentification[C]//2014 22nd International Conference on Pattern Recognition.IEEE,2014:34-39.
[10]LI W,ZHAO R,XIAO T,et al.Deepreid:Deep filter pairingneural network for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:152-159.
[11]ZHENG L,ZHANG H,SUN S,et al.Person re-identification in the wild[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:1367-1376.
[12]ZHENG L,HUANG Y,LU H,et al.Pose Invariant Embedding for Deep Person Re-identification[J].arXiv:1701.07732,2017.
[13]SU C,LI J,ZHANG S,et al.Pose-driven deep convolutionalmodel for person re-identification[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:3960-3969.
[14]LIN Y,ZHENG L,ZHENG Z,et al.Improving Person Re-identification by Attribute and Identity Learning[J].arXiv:1703.07220,2017.
[15]SUN Y,ZHENG L,YANG Y,et al.Beyond part models:Person retrieval with refined part pooling (and a strong convolutional baseline)[C]//Proceedings of the European Conference on Computer Vision (ECCV).2018:480-496.
[16]WANG G,YUAN Y,CHEN X,et al.Learning discriminative features with multiple granularities for person re-identification[C]//Proceedings of the 26th ACM International Conference on Multi-media.2018:274-282.
[17]SCHROFF F,KALENICHENKO D,PHILBIN J.Facenet:Aunified embedding for face recognition and clustering[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:815-823.
[18]ZHENG Z,ZHENG L,YANG Y.A discriminatively learned cnn embedding for person reidentification[J].ACM Transactions on Multimedia Computing,Communications,and Applications (TOMM),2017,14(1):1-20.
[19]YE M,SHEN J,LIN G,et al.Deep Learning for Person Rei-dentification:A Survey and Outlook[J].arXiv:2001.04193,2020.
[20]LUO H,GU Y,LIAO X,et al.Bag of tricks and a strong baseline for deep person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops.2019:1487-1495.
[21]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[22]ZHENG Z,ZHENG L,YANG Y.Pedestrian alignment network for large-scale person re-identification[J].IEEE Transactions on Circuits and Systems for Video Technology,2018,29(10):3037-3045.
[23]ZHENG L,SHEN L,TIAN L,et al.Scalable person re-identification:A benchmark[C]//Proceedings of the IEEE Internatio-nal Conference on Computer Vision.2015:1116-1124.
[24]RISTANI E,SOLERA F,ZOU R,et al.Performance measures and a data set for multi-target,multi-camera tracking[C]//European Conference on Computer Vision.Cham:Springer,2016:17-35.
[25]ZHAO H,TIAN M,SUN S,et al.Spindle net:Person re-identification with human body region guided feature decomposition and fusion[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:1077-1085.
[26]SHAO X,SHUAI H,LIU Q.Person re-identification method combining attribute features[J/OL].Acta Automatica Sinica.https://doi.org/10.16383/j.aas.c190763.
[1] RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[2] TANG Ling-tao, WANG Di, ZHANG Lu-fei, LIU Sheng-yun. Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy [J]. Computer Science, 2022, 49(9): 297-305.
[3] NIE Xiu-shan, PAN Jia-nan, TAN Zhi-fang, LIU Xin-fang, GUO Jie, YIN Yi-long. Overview of Natural Language Video Localization [J]. Computer Science, 2022, 49(9): 111-122.
[4] XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[5] WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[6] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[7] JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[8] ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[9] CHEN Kun-feng, PAN Zhi-song, WANG Jia-bao, SHI Lei, ZHANG Jin. Moderate Clothes-Changing Person Re-identification Based on Bionics of Binocular Summation [J]. Computer Science, 2022, 49(8): 165-171.
[10] SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[11] HU Yan-yu, ZHAO Long, DONG Xiang-jun. Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification [J]. Computer Science, 2022, 49(7): 73-78.
[12] ZHANG Ying-tao, ZHANG Jie, ZHANG Rui, ZHANG Wen-qiang. Photorealistic Style Transfer Guided by Global Information [J]. Computer Science, 2022, 49(7): 100-105.
[13] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
[14] MENG Yue-bo, MU Si-rong, LIU Guang-hui, XU Sheng-jun, HAN Jiu-qiang. Person Re-identification Method Based on GoogLeNet-GMP Based on Vector Attention Mechanism [J]. Computer Science, 2022, 49(7): 142-147.
[15] HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!