计算机科学 ›› 2019, Vol. 46 ›› Issue (2): 279-285.doi: 10.11896/j.issn.1002-137X.2019.02.043

• 图形图像与模式识别 • 上一篇    下一篇

基于深度学习的交互似然目标跟踪算法

张明月, 王静   

  1. 南京工业大学计算机科学与技术学院 南京211816
  • 收稿日期:2017-12-22 出版日期:2019-02-25 发布日期:2019-02-25
  • 通讯作者: 王 静(1982-),女,博士,副研究员,主要研究领域为计算机应用、人工智能与无线传感网络,E-mail:cec_job@126.com。
  • 作者简介:张明月(1993-),女,硕士生,主要研究领域为机器学习与深度学习

Interactive Likelihood Target Tracking Algorithm Based on Deep Learning

ZHANG Ming-yue, WANG Jing   

  1. School of Computer Science and Technology,Nanjing Tech University,Nanjing 211816,China
  • Received:2017-12-22 Online:2019-02-25 Published:2019-02-25

摘要: 针对传统的视频跟踪算法对视频跟踪的精度不足以及主成分分析(PCA)的非线性拟合能力较弱的问题,将卷积神经网络与交互似然(IL)算法相结合,在深度学习的基础上对粒子滤波算法进行了优化改进。将核主成分分析(KPCA)网络应用于视频跟踪来获取目标的深层次特征表达,并采用一种新的交互似然图像跟踪器,非迭代地计算,对不同区域进行跟踪取样来减少数据之间的关联需求。在图像集上将所提算法与多种改进算法进行评估对比,结果表明所提算法具有非常好的鲁棒性及精确性。

关键词: 核主成分分析, 交互似然, 卷积神经网络, 目标跟踪, 深度学习

Abstract: The traditional video target tracking methods usually prossess low accuracy.This paper proposed an improved scheme based on convolution neural network and the interactive likelihood algorithm,and optimized the particle filter algorithm on the basis of deep learning.To address the issue of deficient nonlinear fitting ability of the principal component analysis (PCA),a kernel principal component analysis (KPCA) tracking algorithm was provided to obtain the deeper characteristic expression of the target.Then,a novel interactive likelihood (ILH) method was performed for image-based trackers,which can non-iteratively compute the sampling of areas belonging to different targets and thus reducing the requirement for data associations.The performance of the presented algorithm was evaluated in comparison with several related algorithms on image datasets.The experimental results demonstrate the great robustness and accuracy of the proposed algorithm.

Key words: Convolutional neural network(CNN), Deep learning, Interactive likelihood(IL), Kernel principal component analysis(KPCA), Target tracking

中图分类号: 

  • TP181
[1]邓力,俞栋.深度学习:方法及应用[M].北京:机械工业出版社,2016.
[2]NAIYAN W,SIYI L,ABHINAY G,et al.Transferring Rich Feature Hierarchies for Robust Visual Tracking[J].arXiv preprint arXiv,2015:1501-04587.
[3]HOSEINNEZHAD R,VO B N,VO B T,et al.Visual tracking of numerous targets via multi-Bernoulli filtering of image data[J].Pattern Recognition,2012,45(10):3625-3635.
[4]LI H Y,BI D Y,ZHA Y F,et al.An Easily Initialized Visual Tracking Algorithm Based on Sim ilar Structure for Convolutional Neural Network[J].Journal of Electronics and Information Technology,2016,38(1):1-7.(in Chinese)
李寰宇,毕笃彦,查宇飞,等.一种易于初始化的类卷积神经网络视觉跟踪算法[J].电子与信息学报,2016,38(1):1-7.
[5]HU W P,HU H F,GU J Q,et al.Kernel Principal Component Analysis Network Method for Face Recognition[J].Acta Scientiarum Naturalium Universitatis Sunyatseni,2016,55(5):48-51,56.(in Chinese)
胡伟鹏,胡海峰,顾建权,等.核主成分分析网络的人脸识别方法[J].中山大学学报(自然科学版),2016,55(5):48-51,56.
[6]WANG L,W O,WANG X,et al.Visual Tracking with Fully Convolutional Networks[C]∥2015 IEEE International Confe-rence on Computer Vision(ICCV).IEEE Computer Society.2015:3119-3127.
[7]LI H Y,BI D Y,YANG Y,et al.Research on Visual Tracking Algorithm Based on Deep Feature Expression and Learning[J].Journal of Electronics and Information Technology,2015,37(9):2033-2039.(in Chinese)
李寰宇,毕笃彦,杨源,等.基于深度特征表达与学习的视觉跟踪算法研究[J].电子与信息学报,2015,37(9):2033-2039.
[8]FAN J,XU W,WU Y.Human Tracking Using Convolutional Neural Networks[J].IEEE Transactions on Neural Networks,2010,21(10):1610-1623.
[9]MA C,XU Y,NI B,et al.When Correlation Filters Meet Convolutional Neural Networks for Visual Tracking[J].IEEE Signal Processing Letters,2016,23(10):1454-1458.
[10]PANG S C,DU A A,YU Z Z.Robust multi-object tracking using deep learning framework[J].Journal of Optical Technology,2015,82(8):516-528.
[11]WEI C,CHEN J,SONG Z.Nonlinear process monitoring using improved kernel principal component analysis[C]∥ Control and Decision Conference.IEEE,2016:5832-5843.
[12]MARTNEZ S V,KNEBEL J F,THIRAN J P.Multi-object tracking using the particle filter algorithm on the top-view plan[C]∥Signal Processing Conference.European:IEEE,2015:285-288.
[13]XIAO J,STOLKIN R,OUSSALAH M.Continuously Adaptive Data Fusion and Model Relearning for Particle Filter Tracking with Multiple Features[J].IEEE Sensors Journal,2016,16(8):2639-2649.
[14]HINTON G E.Learning multiple layers of representation[J].Trends in Cognitive Sciences,2007,11(10):428.
[15]FAN F,GAO G,LI J.Visual object tracking based on perceptual hash algorithm[C]∥ International Computer Conference on Wavelet Active Media Technology and Information Processing.IEEE,2016:233-236.
[16]HOAK A,MEDEIROS H,POVINELLI R J.Image-BasedMulti- Target Tracking through Multi-Bernoulli Filtering with Interactive Likelihoods[J].Sensors,2017,17(3):501-522.
[17]标准的OTB-2015图像集[OL].http://cvlab.hanyang.ac.kr/tracker_b-enchmark/datasets.html.
[1] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[2] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[3] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[4] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[5] 陈泳全, 姜瑛.
基于卷积神经网络的APP用户行为分析方法
Analysis Method of APP User Behavior Based on Convolutional Neural Network
计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121
[6] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[7] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[8] 沈祥培, 丁彦蕊.
多检测器融合的深度相关滤波视频多目标跟踪算法
Multi-detector Fusion-based Depth Correlation Filtering Video Multi-target Tracking Algorithm
计算机科学, 2022, 49(8): 184-190. https://doi.org/10.11896/jsjkx.210600004
[9] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[10] 李其烨, 邢红杰.
基于最大相关熵的KPCA异常检测方法
KPCA Based Novelty Detection Method Using Maximum Correntropy Criterion
计算机科学, 2022, 49(8): 267-272. https://doi.org/10.11896/jsjkx.210700175
[11] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[12] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[13] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[14] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[15] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!