透過您的圖書館登入
IP:3.16.69.143
  • 學位論文

用於影像辨識之擴增學習系統演算法與架構設計

Algorithm and Architecture Design of Incremental Learning System for Visual Recognition

指導教授 : 陳良基

摘要


最近幾年機器學習在電腦視覺領域扮演越來越重要的角色,許多電腦視覺的研究與應用都離不開機器學習的技術。除了過去我們所熟知的視覺應用,在未來 也將會因應智慧生活的需求,結合機器學習提供更多的應用,例如家護監控、智慧助理和機器人等,將徹底地改變了每個人的生活。 但是,機器學習要完全能應用到我們的生活中仍有一大段路要走。機器學習演算法仍有許多限制。首先,目前的大多數機器學習算法都是在處理靜態的問題 在一般的機器學習方法中,我們提供完整的訓練資料與資料類別,訓練出模型,但在生活中,所接觸到的影像與資料是動態的而且包含了各種變化,大幅增加了辨識的困難度。另一問題是,識別系統無法處理的沒學習過的類別。一般的機器學習演算法學習新的知識是昂貴的,需要去重新訓練整個模型,因此,我們需要找出一種方法。使機器能夠逐步並有效地學習。 在此論文中,我們首先介紹一些增量學習演算法與應用,以及增量學演算法所面臨的困難與挑戰,而我們的增量式學習系統基於支持向量機器SVM演算法,擴增新的類別於原本的模型上,另外改善原本的演算法讓系統能夠有效率地學習動態的影像資料,篩選出具有代表性的資料保留,大幅減少記憶體使用並讓準確度保持在可接受誤差範圍,最後因應未來即時互動的需求,我們提出可能的硬體架構來加速學習系統。 總結而言,我們的系統應用在圖像以及影像的辨識上,我們提出基於SVM的增量學習系統,最主要的貢獻在於可以逐步擴增學習新的類別,並改進了記憶體使用量與線上運算的時間。

並列摘要


Machine learning has received much attention in the computer vision community in the past few years and is involved various applications. Many future application such as home-care surveillance, intelligent agent and robotics become more and more popular in recent year. However, there are still lots of limitations to apply the machine learning techniques into real-world learning scenario. Most of the current visual learning algorithm are dealing with static recognition problem, assuming that the numbers of categories and the training data are fixed. Another problem is that the recognition system can not handle the unseen category. To learn the new knowledge, it is costly to retrain the whole system each time when a new category is presented. Therefore, we need to figure out a way to make the robotic system learn incrementally and efficiently. In this thesis, a novel incremental learning algorithm are presented. Our incremental learning system is based on SVM learning model and learns new classes in online scenario. We propose a novel incremental strategy to extend our model, and we learn with Learning Vectors, which is proposed to select the representative samples for incremental learning and can largely reduce the data storage. In addition, we also adopt online training techniques in our learning algorithm to learn the streaming data efficiently. In the end, we present the hardware architecture design for our learning system. With the acceleration on training process, the system can deal with new knowledge instantly and it is suitable for many real-world visual learning applications such as human action recognition and multiple object tracking. To sum up, we propose a SVM-based incremental learning system which can learn incrementally and largely reduce the memory with acceptable decease in accuracy comparing with retraining the whole system.

參考文獻


[3] B.-F. Zhang, J.-S. Su, and X. Xu, A class-incremental learning method for multi-class support vector machines in text classification," in Ma-chine Learning and Cybernetics, 2006 International Conference on, pp. 2581-2585, Aug 2006.
[4] O. Oreifej and Z. Liu, Hon4d: Histogram of oriented 4d normals for activity recognition from depth sequences," in Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pp. 716-723, June 2013.
[6] T. K. Ho, Random decision forests," in Document Analysis and Recog-nition, 1995., Proceedings of the Third International Conference on, vol. 1, pp. 278-282 vol.1, Aug 1995.
[7] O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei, ImageNet Large Scale Visual Recognition Challenge," In-ternational Journal of Computer Vision (IJCV), pp. 1-42, April 2015.
[9] A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classifica-tion with deep convolutional neural networks," in Advances in Neural Information Processing Systems 25 (F. Pereira, C. Burges, L. Bottou,

延伸閱讀