Computer Science ›› 2024, Vol. 51 ›› Issue (11): 166-173.doi: 10.11896/jsjkx.230900078

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Unsupervised Target Drift Correction and Tracking Based on Hidden Space Matching

FAN Xiaopeng, PENG Li, YANG Jielong   

  1. School of Internet of Things Engineering,Jiangnan University,Wuxi,Jiangsu 214026,China
  • Received:2023-09-14 Revised:2024-01-28 Online:2024-11-15 Published:2024-11-06
  • About author:FAN Xiaopeng,born in 1998,postgra-duate.His main research interests include object tracking and pose estimation.
    PENG Li,born in 1967,Ph.D,professor.His main research interests include computer vision and pattern recognition.
  • Supported by:
    Young Scientists Fund of the National Natural Science Foundation of China(62106082).

Abstract: Object tracking is a basic research issue in the field of computer vision.With the development oftracking technology,existing trackers mainly have two challenges,namely relying on a large amount of data annotation information and tracking drift,which seriously limits the improvement of tracker performance.In order to overcome the above challenges,unsupervised target tracking and hidden space matching methods are proposed.Firstly,image pairs are generated in the foreground via a correctable optical flow method.Secondly,the generated image pairs are utilized to train the siamese tracker from scratch.Finally,the hidden space matching method is used to solve the problem of losing track when the target deforms greatly,is occluded,goes out of the field of view and drifting.Experimental results show that the algorithm UHOT significantly improves on multiple datasets and demonstrates strong robustness in difficult scenarios.Compared with the latest unsupervised algorithm SiamDF,UHOT gaines 8% gain on the VOT dataset,comparable to state-of-the-art supervised siamese trackers.

Key words: Unsupervised, Sliding window, Hidden space, Template matching, Object tracking

CLC Number: 

  • TP391.4
[1] WANG N,SONG Y B,MA C,et al.Unsupervised deep tracking[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:1308-1317.
[2] CHONG H S,MA Y J,CHEN J C,et al.S2siamfc:Self-supervised fully convolutional siamese network for visual tracking[C]//Proceedings of the 28th ACM International Conference on Multimedia.2020:1948-1957.
[3] LIU L,ZHANG J N,HE R F,et al.Learning by analogy:Reliable supervision from transformations for unsupervised optical flow estimation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:6489-6498.
[4] LUCA B,JACK V,JOAO F,et al.Fully-convolutional siamese networks for object tracking[C]//Computer Vision-ECCV 2016 Workshops:Amsterdam,The Netherlands,October 8-10 and 15-16,2016,Proceedings,Part II 14.Springer,2016:850-865.
[5] CAI H Y,LAN L,ZHANG J,et al.Siamdf:Tracking training data-free siamese tracker[M].Neural Networks,2023.
[6] LI H J,PENG L.High-speed tracking algorithm based on negative sample mining and feature fusion[J].Control and Decision Making,2023,38(9):2554-2562.
[7] SUN K W,WANG Z H,LIU H,et al.Maximum Overlap Single Target Tracking Algorithm Based on Attention Mechanism.[J].Computer Science,2023,50(S1):397-401.
[8] ZENG Z H,LUO H L.Cross-dataset Learning CombiningMulti-object Tracking and Human Pose Estimation[J].Compu-ter Science,2023,50(S1):512-518.
[9] ZHENG J L,MA C,PENG H W,et al.Learning to track objects from unlabeled videos[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:13546-13555.
[10] WU Q Q,WAN J,ANTONI B C.Progressive unsupervisedlearning for visual object tracking[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:2993-3002.
[11] ZHANG J,LIU Y,LIU H,et al.Learning local-global multiple correlation filters for robust visual tracking with Kalman filter redetection[J].Sensors,2021,21(4):1129.
[12] LE N,RATHOUR V S,YAMAZAKI K,et al.Deep reinforcement learning in computer vision:a comprehensive survey[J].Artificial Intelligence Review,2022,55(4):2733-2819.
[13] WANG Q,LI Z,LUCA B,et al.Fast online object tracking and segmentation:A unifying approach[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:1328-1338.
[14] CHENG Y M,LI L L,XU Y Y,et al.Segment and track anything[J].arXiv:2305.06558,2023.
[15] LI B,YAN J J,WU W,et al.High performance visual tracking with siamese region proposal network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8971-8980.
[16] TARG S S,DIOGO A,KEVIN L.Resnet in resnet:Generalizing residual architectures[J].arXiv:1603.08029,2016.
[17] YU J H,JIANG Y N,WANG Z Y,et al.Unitbox:An advanced object detection network[C]//Proceedings of the 24th ACM International Conference on Multimedia.2016:516-520.
[18] HO Y S,SAMUEL W.The real-worldweight cross-entropy loss function:Modeling the costs of mislabeling[J].IEEE Access,2019,8:4806-4813.
[19] JADERBERG M,SIMON K,ZISSERMAN A,et al.Spatialtransformer networks.[J].arXiv:1506.02025,2015.
[20] JOOST V A,ANITHA K,MARC A R,et al.Transformation-based models of video sequences[J].arXiv:1701.08435,2017.
[21] TOMAS J,ANKUSH G,HAKAN B,et al.Unsupervised lear-ning of object landmarks through conditional image generation[C]//Advances in Neural Information Processing Systems 31.2018.
[22] OLAF R,PHILIPP F,THOMAS B.U-net:Convolutional net-works for biomedical image segmentation[C]//Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015:18th Interna=tional Conference,Munich,Germany,October 5-9,2015,Proceedings,Part III 18.Springer,2015:234-241.
[23] ALIAK S,STEPHANE L,SERGEY T,et al.First order motion model for image animation[C]//Advances in Neural Information Processing Systems 32.2019.
[24] MATEJ K,ALES L,JIRI M,et al.The sixth visual object tra-cking vot2018 challenge results[C]//Proceedings of theEuro-pean Conference on Computer Vision(ECCV).2018.
[25] WU Y,LI J W,YANG M H.Online object tracking:A bench-mark[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2013:2411-2418.
[26] ZHANG Z P,PENG H W.Deeper and wider siamese networks for real-time visual tracking[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:4591-4600.
[27] LI Y M,FU C H,DING F Q,et al.Autotrack:Towards high-performance visual tracking for uav with automatic spatio-temporal regularization[C]//Proceedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition(CVPR).2020.
[28] MARTIN D,GOUTAM B,FAHAD S K,et al.Atom:Accuratetracking by overlap maximization[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:4660-4669.
[1] TIAN Qing, LU Zhanghu, YANG Hong. Unsupervised Domain Adaptation Based on Entropy Filtering and Class Centroid Optimization [J]. Computer Science, 2024, 51(7): 345-353.
[2] LI Dongyang, NIE Rencan, PAN Linna, LI He. UMGN:An Infrared and Visible Image Fusion Network Based on Unsupervised Significance MaskGuidance [J]. Computer Science, 2024, 51(6A): 230600170-5.
[3] SUN Ziwen, YUAN Guanglin, LI Congli, QIN Xiaoyan, ZHU Hong. Object Tracking of Structured SVM Based on DIoU Loss and Smoothness Constraints [J]. Computer Science, 2024, 51(6A): 230700113-8.
[4] ZHENG Shenhai, GAO Xi, LIU Pengwei, LI Weisheng. Occluded Video Instance Segmentation Method Based on Feature Fusion of Tracking and Detection in Time Sequence [J]. Computer Science, 2024, 51(6A): 230600186-6.
[5] LOU Ren, HE Renqiang, ZHAO Sanyuan, HAO Xin, ZHOU Yueqi, WANG Xinyuan, LI Fangfang. Single Stage Unsupervised Visible-infrared Person Re-identification [J]. Computer Science, 2024, 51(6A): 230600138-7.
[6] HE Yifan, HE Yulin, CUI Laizhong, HUANG Zhexue. Subspace-based I-nice Clustering Algorithm [J]. Computer Science, 2024, 51(6): 153-160.
[7] CAI Jiacheng, DONG Fangmin, SUN Shuifa, TANG Yongheng. Unsupervised Learning of Monocular Depth Estimation:A Survey [J]. Computer Science, 2024, 51(2): 117-134.
[8] KONG Senlin, ZHANG Hui, HUANG Zhennan, LIU Youwu, TAO Yan. Asymmetric Teacher-Student Network Model for Industrial Image Anomaly Detection [J]. Computer Science, 2024, 51(11A): 240200069-7.
[9] ZHANG Xinyi, TAN Guang. Real-time Accurate Object Tracking for Resource-constrained Edge Devices [J]. Computer Science, 2024, 51(11A): 231200167-9.
[10] ZHUO Mingsong, MO Lingfei. Spiking Neural Network Classification Model Based on Multi-subnetwork Pre-training [J]. Computer Science, 2024, 51(11A): 240300191-6.
[11] XUE Ruxiang, WEI Junjie, ZHOU Huawei, YANG Hai, WANG Zhe. Infrared Dim and Small Target Detection Based on Cross-domain Migration of Visible Light andInfrared [J]. Computer Science, 2024, 51(10): 287-294.
[12] GU Shiwei, LIU Jing, LI Bingchun, XIONG Deyi. Survey of Unsupervised Sentence Alignment [J]. Computer Science, 2024, 51(1): 60-67.
[13] JING Yeyiran, YU Zeng, SHI Yunxiao, LI Tianrui. Review of Unsupervised Domain Adaptive Person Re-identification Based on Pseudo-labels [J]. Computer Science, 2024, 51(1): 72-83.
[14] XU Jie, WANG Lisong. Contrastive Clustering with Consistent Structural Relations [J]. Computer Science, 2023, 50(9): 123-129.
[15] WANG Luo, LI Biao, FU Ruigang. Infrared Ground Multi-object Tracking Method Based on Improved ByteTrack Algorithm [J]. Computer Science, 2023, 50(9): 176-183.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!