  • 學位論文


Deep Visual Tracking using Single Domain Neural Network with Reptile Meta-Learning

指導教授 : 蔡奇謚




The goal of visual tracking is to locate a specific object in the form of bounding box throughout a video or a sequence of images. While visual tracking has been one of the main topics in the field of computer vision for decades, it is still a very challenging topic. Visual tracking requires algorithms to recognize and locate objects down to instances level, and this requirement produces some unique challenges especially for some tracking algorithms based on deep learning techniques that require online leaning during the tracking process. Although deep leaning models could provide really strong and robust feature representation, it is easy to be over-fitted if given a really small set of training data thus making the overall performance throughout tracking poor. To deal with this issue, the proposed algorithm adopts first-order meta learning technique so that during initialization, the visual tracker only requires few training examples and few steps of optimization to perform well. Experiment results shows that it can achieve up to 66.4% of mean success rate on OTB2015 dataset.


[1] Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel: “Backpropagation applied to handwritten zip code recognition,” Neural Computation, vol. 1, no. 4, pp. 541–551, Winter 1989.
[2] Krizhevsky, A., Sutskever, I., and Hinton, G. E.: “ImageNet classification with deep convolutional neural networks,” In NIPS, pp. 1106–1114, 2012.
[3] Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-Fei, L.: “Imagenet: A large-scale hierarchical image database,” In CVPR, pp. 248-255, 2009.
[4] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich.: “Going deeper with convolutions.” In CVPR, pp. 1-9 2015.
[5] He, K., Zhang, X., Ren, S., Sun, J. “Deep residual learning for image recognition.” In CVPR, pp. 770-778, 2016.
