计算机科学 ›› 2026, Vol. 53 ›› Issue (3): 231-239.doi: 10.11896/jsjkx.241100094

• 计算机图形学 & 多媒体 • 上一篇    下一篇

关联策略多特征增强的多目标跟踪

陈云芳, 方倩, 吕尊威, 张伟   

  1. 南京邮电大学计算机学院 南京 210023
  • 收稿日期:2024-11-15 修回日期:2025-02-20 发布日期:2026-03-12
  • 通讯作者: 张伟(zhangw@njupt.edu.cn)
  • 作者简介:(chenyf@njupt.edu.cn)
  • 基金资助:
    国家自然科学基金(62072252)

Multi-feature Enhanced Association Strategy for Multi-object Tracking

CHEN Yunfang, FANG Qian, LYU Zunwei, ZHANG Wei   

  1. School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
  • Received:2024-11-15 Revised:2025-02-20 Online:2026-03-12
  • About author:CHEN Yunfang,born in 1976,Ph.D,postgraduate supervisor.His main research interests include artificial intelligence algorithms,domain-specific functional analysis and application development using intelligent systems.
    ZHANG Wei,born in 1973,Ph.D,Ph.D supervisor.His main research interests include smart sensing and cognition in UAV systems,privacy protection and artificial intelligence security.
  • Supported by:
    National Natural Science Foundation of China (62072252).

摘要: 在复杂场景下,多目标跟踪面临密集的目标遮挡、目标非线性运动、关联匹配算法欠佳导致身份匹配错误以及频繁的身份切换等问题。对此,以ByteTrack为基线算法,充分利用现有的判别性特征,从运动模型、弱特征数据关联、匹配算法3个方面对其关联策略进行改进,提出了一种关联策略多特征增强的多目标跟踪算法。首先,针对常规卡尔曼滤波难以对非线性运动的目标位置进行预测的问题,利用预测相似度以及检测置信度动态调整卡尔曼滤波的噪声协方差,提升运动模型对位置预测的准确性。其次,整合二次关联算法,在低置信度检测框和第一次关联后未匹配的轨迹之间,执行弱特征数据关联,减少其与轨迹之间的匹配错误。最后,针对低置信度检测目标,利用相对深度对检测目标以及轨迹进行分解,并采用级联匹配算法进行关联,有效减少IoU匹配碰撞,提高了算法在密集遮挡场景下的跟踪表现。在MOT17与MOT20测试集上,所提算法的HOTA分别为64.5%与63.2%,与基线算法相比,所有评估指标均取得显著提升。

关键词: 多目标跟踪, 联合检测跟踪, 数据关联

Abstract: In complex scenarios,multi-target tracking faces problems such as dense target occlusion,nonlinear target motion,poor association matching algorithms leading to identity matching errors,and frequent identity switching.This paper takes ByteTrack as the baseline algorithm and improves its association strategy from three aspects:motion model,weak feature data association,and matching algorithm by fully utilizing existing discriminative features.A multi-objective tracking algorithm with multi discri-minative feature enhanced association strategy is proposed.Firstly,in response to the problem that conventional Kalman filtering is difficult to predict the target position of nonlinear motion,the noise covariance of Kalman filtering is dynamically adjusted using prediction similarity and detection confidence to optimize the motion model and improve the accuracy of target position prediction.Secondly,by integrating a secondary association algorithm,weak feature data association is performed between low-confidence detections and tracks that remain unmatched after the initial association,reducing mismatches between them.Finally,for low confidence detection targets,relative depth is used to decompose the detection targets and trajectories,and a cascaded matching algorithm is used for association,effectively reducing IoU matching collisions and improving the tracking performance of the algorithm in dense occlusion scenes.On the MOT17 and MOT20 test sets,HOTA is 64.5% and 63.2%,respectively,and all evaluation metrics show significant improvement compared to the baseline algorithm.

Key words: Multi-object tracking, Joint detection and tracking, Data association

中图分类号: 

  • TP391
[1]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2017,39(6):1137-1149.
[2]REDMON J,FARHADI A.Yolov3:An incremental improve-ment[J].arXiv:1804.02767,2018.
[3]BOCHINSKI E,SENST T,SIKORA T.Extending IOU based multi-object tracking by visual information[C]//2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance(AVSS).IEEE,2018:1-6.
[4]BEWLEY A,GE Z,OTT L,et al.Simple online and realtime tracking[C]//2016 IEEE International Conference on Image Processing(ICIP).IEEE,2016:3464-3468.
[5]KALMAN R E.A New Approach to Linear Filtering and Prediction Problems[J].Journal of Fluids Engineering,1959,82D:35-45.
[6]BERGMANN P,MEINHARDT T,LEAL-TAIXE L.Trackingwithout bells and whistles[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:941-951.
[7]DU Y,ZHAO Z,SONG Y,et al.Strongsort:Make deepsortgreat again[J].IEEE Transactions on Multimedia,2023,25:8725-8737.
[8]XIAO T,LI S,WANG B,et al.Joint detection and identification feature learning for person search[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:3415-3424.
[9]LIANG C,ZHANG Z,ZHOU X,et al.Rethinking the competition between detection and ReID in multiobject tracking[J].IEEE Transactions on Image Processing,2022,31:3182-3196.
[10]ZHANG Y,SUN P,JIANG Y,et al.Bytetrack:Multi-objecttracking by associating every detection box[C]//European Conference on Computer Vision.Springer,2022:1-21.
[11]GE Z,LIU S,WANG F,et al.YOLOX:Exceeding YOLO Series in 2021[J].arXiv:2107.08430,2021.
[12]WANG Z,ZHENG L,LIU Y,et al.Towards real-time multi-object tracking[C]//European Conference on Computer Vision.Springer,2020:107-122.
[13]AHARON N,ORFAIG R,BOBROVSKY B Z.BoT-SORT:Robust Associations Multi-Pedestrian Tracking[J].arXiv:2206.14651,2022.
[14]CAO J,PANG J,WENG X,et al.Observation-centric sort:Rethinking sort for robust multi-object tracking[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2023:9686-9696.
[15]WOJKE N,BEWLEY A,PAULUS D.Simple online and real-time tracking with a deep association metric[C]//2017 IEEE International Conference on Image Processing(ICIP).IEEE,2017:3645-3649.
[16]MAGGIOLINO G,AHMAD A,CAO J,et al.Deep oc-sort:Multi-pedestrian tracking by adaptive re-identification[J].ar-Xiv:2302.11813,2023.
[17]ZHANG Y,WANG C,WANG X,et al.Fairmot:On the fairness of detection and re-identification in multiple object tracking[J].International Journal of Computer Vision,2021,129:3069-3087.
[18]MILAN A,LEAL-TAIXÉ L,REID I,et al.MOT16:A benchmark for multi-object tracking[J].arXiv:1603.00831,2016.
[19]DENDORFER P,REZATOFIGHI H,MILAN A,et al.Mot20:A benchmark for multi object tracking in crowded scenes[J].arXiv:2003.09003,2020.
[20]BERNARDIN K,STIEFELHAGEN R.Evaluating multiple object tracking performance:the clear mot metrics[J].EURASIP Journal on Image and Video Processing,2008,2008:1-10.
[21]RISTANI E,SOLERA F,ZOU R,et al.Performance measures and a data set for multi-target,multi-camera tracking[C]//European Conference on Computer Vision.Springer,2016:17-35.
[22]LUITEN J,OSEP A,DENDORFER,et al.Hota:A higher order metric for evaluating multi-object tracking[J].International Journal of Computer Vision,2021,129:548-578.
[23]ZHOU X,KOLTUN V,KRÄHENBÜHL P.Tracking objectsas points[C]//European Conference on Computer Vision.Springer,2020:474-490.
[24]XU Y,BAN Y,DELORME G,et al.TransCenter:Transformers with dense representations for multiple-object tracking[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2022,45(6):7820-7835.
[25]WANG S W,GAO C Q,HUANG X,et al.UAV multiple object tracking based on feature separation[J].Journal of Chongqing University of Posts and Telecommunication(Natural Science Edition),2024,36(5):896-906.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!