透過您的圖書館登入
IP:3.133.131.168
  • 學位論文

基於深度卷積神經網路之手勢辨識技術研究

Hand Gesture Recognition based on Deep Convolutional Neural Network

指導教授 : 王聖智

摘要


在這篇論文中,我們提出了一個使用任意單一攝影機在不同角度下仍能遠距離辨識多重手勢的技術。此技術不需固定攝影機角度,不需要由特定使用者操作,且能分辨多種手勢。此技術在影片中自動找出使用者的手的位置,並判斷使用者想傳達的訊息,希望能進行遠距離的操作以及在任何背景之下皆能達到手勢辨識的效果。在此設定議題下,為了能在複雜背景與不同視角拍攝的情況下有效的找到手部出現的區域以及辨識使用者所傳達的訊息,我們不採用易被複雜背景所誤導膚色資訊且不使用事先設定之特徵萃取技術,而是利用卷積神經網路有效且準確的學習不同手勢所擁有的特徵,並結合不同形狀與大小的特徵,以找到能分離不同手勢最佳的特徵空間,再利用深度神經網路找出特徵之間的關係以及不同手勢與各特徵的連接,藉此達到手部偵測與多重手勢辨識的成果。此外,我們也利用影片中已經得到的手部位置與移動資訊加上手勢辨識結果,推測之後較有可能出現手部的區域以及最佳的手勢辨識結果。

關鍵字

手勢辨識

並列摘要


In this thesis, we propose an algorithm which recognize hand gestures with a single camera under different view-points within a range remotely. The algorithm can recognize multi-gestures without fixing view-point of the camera or a particular user controlling. In order to find the hand position and recognize the gestures from the video automatically and efficiency in the clutter background under different view-points within a range, we don’t take the skin color as the information which is easily influence by the clutter background, nor do we use the specific feature extraction processing. Instead, we use the convolutional neural network to learn the features in the hand gesture image and combine the different kernel sizes to get the best feature space for separating different gestures. Then we use the deep neural network to find the relationship between the hand features and the gesture classes. Under this setting, we are able to locate the hand and recognize multi hand gestures. Furthermore, with the help of the temporal information for the hand position and motion getting from the video, we are able to infer the most possible area where the hand would appear and the best recognition result.

並列關鍵字

Hand gesture recognition

參考文獻


[1] R. Palm. “Prediction as a candidate for learning deep hierarchical models of data. “Master’s thesis,Technical University of Denmark, DTU Informatics, 2012.
[2] G. E. Hinton and R. R. Salakhutdinov, "Reducing the Dimensionality of Data with Neural Networks," Science, vol. 313, pp. 504-507, 2006.
[4] Rautaray, Siddharth S., and Anupam Agrawal. "Vision based hand gesture recognition for human computer interaction: a survey," Artificial Intelligence Review, 1-54, 2012.
[5] Hinton, Geoffrey E., Simon Osindero, and Yee-Whye Teh. "A fast learning algorithm for deep belief nets," Neural computation, 18, 1527-1554, 2006.
[6] G. E. Hinton, “A practical guide to training restricted Boltzmann machines,” Technical Report UTML TR 2010-003, Dept. of Computer Science, University of Toronto, 2010..

延伸閱讀