Computer Science ›› 2023, Vol. 50 ›› Issue (6A): 220400023-5.doi: 10.11896/jsjkx.220400023

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Maximum Overlap Single Target Tracking Algorithm Based on Attention Mechanism

SUN Kaiwei, WANG Zhihao, LIU Hu, RAN Xue   

  1. Key Laboratory of Data Engineering and Visual Computing,Chongqing University of Posts and Telecommunications,Chongqing 400065,China
  • Online:2023-06-10 Published:2023-06-12
  • About author:SUN Kaiwei,born in 1987,Ph.D,asso-ciate professor.His main research intere-sts include machine learning,data mi-ning and big data analysis.WANG Zhihao,born in 1997,master’s degree.His main research interests include computer vision and single target tracking.
  • Supported by:
    Natural Science Foundation of Chongqing(cstc2019jcyj-msxmX0021);Science and Technology Research Program of Chongqing Municipal Education Commission(KJCXZD2020027) and National Natural Science Foundation of China(61806033).

Abstract: With the development of artificial intelligence,deep learning has attracted extensive attention in the research of computer vision.In the field of single target tracking,the single target tracking algorithm based on deep learning has been studied.The algorithm complexity of deep learning algorithm is relatively high.The complete segmentation of target classification and target state estimation is conducive to the in-depth discussion of each task.However,the current single target tracking algorithm can not deal with the complex tracking environment well.When the model encounters the complex tracking environment,it often tracks a certain area of the background or tracks the surrounding similar targets.In order to solve the above problems.In this paper,a method is proposed:different attention mechanisms are added to the task of target classification and target state estimation respectively,so that the model can better deal with background confusion and occlusion of similar targets.In order to verify the effectiveness of the above methods,this paper has done a lot of comparative experiments on multiple datasets,and compared with the previous single target tracking algorithm based on deep learning.The proposed algorithm improves 3.1% in the EAO index and 2.3% in the Robustness index.It shows the effectiveness and progressiveness of this method.

Key words: Computer vision, Single target tracking, Attention mechanism, Weight distribution, anomaly detection

CLC Number: 

  • TP391
[1]ZHANG W C,SUN C M.Corner Detection Using Multi-directional Structure Tensor with Multiple Scales[J].International Journal of Computer Vision,2020,128(2):438-459.
[2]ZHAO F,WANG J Q,WU Y,et al.Adversarial Deep Tracking[J].IEEE Transactions on Circuits & Systems for Video Technology,2018,29(7):1998-2011.
[3]YU L Y,FAN C X,MING Y.Improved Target Tracking Algorithm based on Kernelized Correlation Filter[J].Journal of Computer Applications,2015,35(12):3550-3554.
[4]WANG N,SONG Y B.Unsupervised Deep Tracking[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Long Beach:IEEE,2020:1308-1317.
[5]WU Y,LIM J.Object Tracking Benchmark[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(9):1834-1848.
[6]WU Y,LIM J.A Numerical Algorithm for the Coupled PDEs Control Problem[J].Computation Aleconomics,2019,53(2):697-707.
[7]DANELLJAN M,BHAT G,KHAN F S,et al.ATOM:Accurate tracking by overlap maximization[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Long Beach:IEEE,2019:4655-4664.
[8]KUMAR D,MONDAL S,KARUPPUSWAMI S,et al.Har-monic RFID Communication Using Conventional UHF System[J].IEEE Journal of Radio Frequency Identification,2019,3(4):227-235.
[9]ZHANG X M,ZHANG X H,DU X D,et al.Learning multi-domain convolutional network for rgb-t visual tracking[C]//2018 11th International Congress on Image and Signal Processing,BioMedical Engineering and Informatics(CISP-BMEI).Beijing:IEEE,2018:1-6.
[10]JUNG I,SON J,BAEK M,et al.Real-time mdnet[C]//Procee-dings of the European Conference on Computer Vision(ECCV).Cham:Springer,2018:83-98.
[11]FERRAZ P A P,DE OLIVEIRA B A G,FERREIRA F M F,et al.Three-stage RGBD architecture for vehicle and pedestrian detection using convolutional neural networks and stereo vision[J].IET Intelligent Transport Systems,2020,14(10):1319-1327.
[1] ZHANG Yian, YANG Ying, REN Gang, WANG Gang. Study on Multimodal Online Reviews Helpfulness Prediction Based on Attention Mechanism [J]. Computer Science, 2023, 50(8): 37-44.
[2] SONG Xinyang, YAN Zhiyuan, SUN Muyi, DAI Linlin, LI Qi, SUN Zhenan. Review of Talking Face Generation [J]. Computer Science, 2023, 50(8): 68-78.
[3] WANG Xu, WU Yanxia, ZHANG Xue, HONG Ruize, LI Guangsheng. Survey of Rotating Object Detection Research in Computer Vision [J]. Computer Science, 2023, 50(8): 79-92.
[4] ZHOU Ziyi, XIONG Hailing. Image Captioning Optimization Strategy Based on Deep Learning [J]. Computer Science, 2023, 50(8): 99-110.
[5] TENG Sihang, WANG Lie, LI Ya. Non-autoregressive Transformer Chinese Speech Recognition Incorporating Pronunciation- Character Representation Conversion [J]. Computer Science, 2023, 50(8): 111-117.
[6] WANG Jiahao, ZHONG Xin, LI Wenxiong, ZHAO Dexin. Human Activity Recognition with Meta-learning and Attention [J]. Computer Science, 2023, 50(8): 193-201.
[7] WANG Yu, WANG Zuchao, PAN Rui. Survey of DGA Domain Name Detection Based on Character Feature [J]. Computer Science, 2023, 50(8): 251-259.
[8] LI Hui, LI Wengen, GUAN Jihong. Dually Encoded Semi-supervised Anomaly Detection [J]. Computer Science, 2023, 50(7): 53-59.
[9] YAN Mingqiang, YU Pengfei, LI Haiyan, LI Hongsong. Arbitrary Image Style Transfer with Consistent Semantic Style [J]. Computer Science, 2023, 50(7): 129-136.
[10] HENG Hongjun, ZHOU Wenhua. Anomaly Detection Method Based on Context Information Fusion and Noise Adaptation [J]. Computer Science, 2023, 50(7): 237-245.
[11] GAO Xiang, TANG Jiqiang, ZHU Junwu, LIANG Mingxuan, LI Yang. Study on Named Entity Recognition Method Based on Knowledge Graph Enhancement [J]. Computer Science, 2023, 50(6A): 220700153-6.
[12] ZHANG Tao, CHENG Yifei, SUN Xinxu. Graph Attention Networks Based on Causal Inference [J]. Computer Science, 2023, 50(6A): 220600230-9.
[13] CUI Lin, CUI Chenlu, LIU Zhengwei, XUE Kai. Speech Emotion Recognition Based on Improved MFCC and Parallel Hybrid Model [J]. Computer Science, 2023, 50(6A): 220800211-7.
[14] DUAN Jianyong, YANG Xiao, WANG Hao, HE Li, LI Xin. Document-level Relation Extraction of Graph Attention Convolutional Network Based onInter-sentence Information [J]. Computer Science, 2023, 50(6A): 220800189-6.
[15] QIN Jing, WANG Weibin, ZOU Qijie, WANG Zumin, JI Changqing. Review of 3D Target Detection Methods Based on LiDAR Point Clouds [J]. Computer Science, 2023, 50(6A): 220400214-7.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!