论文:2019,Vol:37,Issue(6):1310-1319
引用本文:
张博言, 钟勇, 李振东. 基于动态模式和卷积特征的单目标跟踪算法[J]. 西北工业大学学报
ZHANG Boyan, ZHONG Yong, LI Zhendong. A Visual Object Tracking Algorithm Based on Dynamics Pattern and Convolutional Feature[J]. Northwestern polytechnical university

基于动态模式和卷积特征的单目标跟踪算法
张博言1,2, 钟勇1,2, 李振东1,2
1. 中国科学院 成都计算机应用研究所, 四川 成都 610041;
2. 中国科学院大学, 北京 100049
摘要:
基于深度特征的目标跟踪网络凭借其对目标视觉特征强大的表征能力获得了令人印象深刻的表现。然而,在一些复杂的跟踪场景中常常涉及目标物体快速运动、光线变化、旋转等,仅仅依赖深度视觉特征难以准确地表征目标物体。针对以上问题,提出了一种基于融合特征的视频单目标跟踪网络。该网络结合了2种深度学习模型:卷积神经网络(convolutional neural network,CNN)和长短期记忆网络(long short-term memory,LSTM)。首先,运用长短期记忆网络提取目标基于时间序列的动态特征,产生当前时刻的目标状态,由此获得准确的预处理目标框;然后基于产生的预处理目标框,使用卷积神经网络提取目标的深度卷积特征,确定目标位置;在跟踪过程中,通过采集成功跟踪时目标样本,对网络参数进行短期和长期更新,以增强网络的适应性。对比实验结果表明,所提出的方法在目标运动过程中被部分遮挡、运动模糊、快速运动情况下具有优异的跟踪表现和鲁棒性。
关键词:    目标跟踪    卷积神经网络    长短期记忆网络   
A Visual Object Tracking Algorithm Based on Dynamics Pattern and Convolutional Feature
ZHANG Boyan1,2, ZHONG Yong1,2, LI Zhendong1,2
1. Chengdu Institute of Computer Applications, Chinese Academy of Sciences, Chengdu 610041, China;
2. University of Chinese Academy of Sciences, Beijing 100049, China
Abstract:
Deep visual feature-based method has demonstrated impressive performance in visual tracking attributing to its powerful capability of visual feature representation. However, in some complex environments such as dramatic change of appearance, illumination variation and rotation, the extracted deep visual feature is insufficient for accurately characterizing the target. To solve this problem, we present an integrated tracking framework which combines a Long Short-Term Memory (LSTM) network and a Convolutional Neural Network (CNN). Firstly, the LSTM extracted dynamics feature of target on time sequence, resulting the state of target at present time step. With that state, the accurate preprocessed bounding box was obtained. Then, deep convolutional feature of the target was extracted using a CNN, based on the processed bounding box. Finally, the position of the target was determined based on the score of the feature. During tracking stage, in order to improve the adaptation of the network, the parameters of the network were updated using samples of the target captured while successful tracking. The experiment shows that the proposed method achieves outstanding tracking performance and robustness in cases of partial occlusion, out-of-view, motion blur and fast motion.
Key words:    visual object tracking    convolutional neural network    long short-term memory network   
收稿日期: 2018-12-24     修回日期:
DOI: 10.1051/jnwpu/20193761310
基金项目: 四川省科技厅科技成果转化项目(2014CC0043)资助
通讯作者:     Email:
作者简介: 张博言(1991-),中国科学院成都计算机应用研究所博士研究生,主要从事计算机视觉、目标跟踪研究。
相关功能
PDF(3349KB) Free
打印本文
把本文推荐给朋友
作者相关文章
张博言  在本刊中的所有文章
钟勇  在本刊中的所有文章
李振东  在本刊中的所有文章

参考文献:
[1] LAURENSE V A, GOH J Y, GERDES J C. Path-Tracking for Autonomous Vehicles at the Limit of Friction[C]//Proceedings of the American Control Conference(ACC), 2017:5586-5591
[2] SIVANANTHAM S, PAUL N N, IYER R S. Object Tracking Algorithm Implementation for Security Applications[J]. Far East Journal of Electronics and Communications, 2016, 16(1):1-13
[3] ONATE J M B, CHIPANTASI D J M, ERAZO N R V. Tracking Objects Using Artificial Neural Networks and Wireless Connection for Robotics[J]. Journal of Telecommunication, Electronic and Computer Engineering, 2017, 9(1/2/3):161-164
[4] PEREZ P, HUE C, VERMAAK J, et al. Color-Based Probabilistic Tracking[C]//Proceedings of European Conference on Computer Vision, 2002:661-675
[5] WANG Z, YANG X, XU Y, et al. CamShift Guided Particle Filter for Visual Tracking[J]. Pattern Recognition Letters, 2009, 30(4):407-413
[6] 李冠彬, 吴贺丰. 基于颜色纹理直方图的带权分块均值漂移目标跟踪算法[J]. 计算机辅助设计与图形学学报, 2011, 23(12):2059-2066 LI Guanbin, WU Hefeng. Weighted Fragments-Based Meanshift Tracking Using Color-Texture Histogram[J]. Journal of Computer-Aided Design and Computer Graphics, 2011, 23(12):2059-2066(in Chinese)
[7] HARE S, SAFFARI A, TORR P H S. Struck:Structured Output Tracking with Kernels[C]//Proceedings of the IEEE International Conference on Computer Vision, 2011:263-270
[8] HENRIQUES, JOÃO F, CASEIRO R, MARTINS P, et al. High-Speed Tracking with Kernelized Correlation Filters[J]. IEEE Trans on Pattern Analysis & Machine Intelligence, 2014, 37(3):583-596
[9] BABENKO B, YANG M H, BELONGIES S. Robust Object Tracking with Online Multiple Instance Learning[J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 2011, 33(8):1619-1632
[10] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet Classification with Deep Convolutional Neural Networks[C]//Proceedings of International Conference on Neural Information Processing Systems, 2012:1097-1105
[11] DANELLJAN M, GUSTAV HÄGER, KHAN F S, et al. Convolutional Features for Correlation Filter Based Visual Tracking[C]//Proceedings of IEEE International Conference on Computer Vision Workshop, 2016:621-629
[12] NAM H, HAN B. Learning Multi-Domain Convolutional Neural Networks for Visual Tracking[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognitionm, 2016:4293-4302
[13] SUTSKEVER I, VINYALS O, LE Q V. Sequence to Sequence Learning with Neural Networks[C]//Proceedings of Advances in Neural Information Processing Systems, 2014:3104-3112
[14] GRAVES A, MOHAMED A, HINTON G. Speech Recognition with Deep Recurrent Neural Networks[C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, 2013:6645-6649
[15] CUI Z, XIAO S, FENG J, et al. Recurrently Target-Attending Tracking[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2016:1449-1458
[16] GREFF K, SRIVASTAVA R K, KOUTNÍK J, et al. LSTM:A Search Space Odyssey[J]. IEEE Trans on Neural Networks and Learning Systems, 2017, 28(10):2222-2232
[17] RUSSAKOVSKY O, DENG J, SU H, et al. Image Net Large Scale Visual Recognition Challenge[J]. International Journal of Computer Vision, 2014, 115(3):211-252
[18] VEDALDI A, LENC K. Matconvnet:Convolutional Neural Networks for Matlab[C]//Proceedings of the 23rd ACM International Conference on Multimedia, 2015:689-692
[19] MVLLER M, BIBI A, GIANCOLA S, et al. Tracking Net:A Large-Scale Dataset and Benchmark for Object Tracking in the Wild[C]//Proceedings of European Conference on Computer Vision, 2018:310-327
[20] WU Y, LIM J, YANG M H. Object Tracking Benchmark[J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 2015, 37(9):1834-1848
[21] DANELLJAN M, HAGER G, SHAHBAZ KHAN F, et al. Learning Spatially Regularized Correlation Filters for Visual Tracking[C]//Proceedings of the IEEE International Conference on Computer Vision, 2015:4310-4318
[22] VALMADRE J, BERTINETTO L, HENRIQUES J, et al. End-to-End Representation Learning for Correlation Filter Based Tracking[C]//Proceedings of Computer Vision and Pattern Recognition, 2017:5000-5008