透過您的圖書館登入
IP:18.224.59.231
  • 學位論文

利用深度攝影機擷取手指特徵之即時動態手勢辨識

Real-time Dynamic Hand Gesture Recognition Based on Finger Features Using a Depth Sensing Camera

指導教授 : 顏嗣鈞

摘要


在人機互動的領域中,人們一直想要找個方法來取代傳統的鍵盤和滑鼠,所以在這情況下就衍生了利用手勢來進行對機器的操作。而運用手勢辨識的概念不但經常在科技電影可以看到,也在具有多點觸控的智慧型手機和觸控板上成為流行,但是觸控式螢幕尺寸的大小限制將會影響到手勢辨識的準確性以及多元性,因此本論文目的為利用三維空間資訊為主來達到即時的手勢辨識,且在無多點觸控能力之螢幕的情況下,依舊能夠辨識出使用者所作的手勢。 本系統使用Kinect感應器得到完整的三維資訊,並運用深度直方圖機制,無論在任何環境下都可以偵測出使用者的手,在使用K-means分群法下,即使手有重疊的情況也可以正確地區分數量。為了發展更多元的手勢,我們利用多指的合併和分開來發展更多元的手勢,但因為每個人的習慣和手指的粗細不盡相同,因此我們利用了機器學習和支持向量機依照不同的特徵值來判斷手指正確的數量,最後再利用有限狀態機來判斷動態的手勢。

並列摘要


In recent years, people have tried to find more efficient ways to replace the old-fashioned keyboards and mice in communication between humans and computers. Among several attempts in this direction, gestures have received considerable attention as they already serve as a natural form of human interaction. The use of gestures in human-computer interaction, once only appeared in science fiction movies, has gradually become reality thanks to the advance of technologies such as multi-touch screens. The size of a touch screen, however, restricts the development of gesture recognition to a certain extent. The objective of this thesis is to develop a real-time system capable of recognizing hand gestures with a touch-less interface by taking advantage of 3D sensing capabilities of depth information. The proposed system acquires accurate 3D data from Kinect, and use depth histograms in order to perform hand localization from any arbitrary background. The K-means clustering algorithm is used to determine the number of hands found in the image, even when occlusion occurs due to hand overlapping. In order to accommodate a diversity of gestures, we take advantage of different combinations and separations of fingertips. To cope with a variety of user habits and thickness of fingers, we use machine learning and SVM to determine the accurate amounts of fingers based on different features. Finally, a finite-state machine is used to determine the dynamic gestures of hand movements.

參考文獻


[2] P. Mistry and P. Maes, “SixthSense – a wearable gestural interface,” Proceedings of SIGGRAPH Asia 2009, 2009.
[4] W. Westerman, J. G. Elias, and A. Hedge, “Multi-touch: a new Tactile 2-D Gesture Interface for Human-Computer Interaction, “ Proceedings of the Human Factors and Ergonomics Society 45th Annual Meeting, vol. 1, pp. 632-636, 2001.
[5] S. Mitra and T. Acharya, “Gesture Recognition: a survey,” Proceedings of IEEE Transactions on Systems, Man, and Cybernetics – Part C, vol. 37, no. 3, pp. 311-324, 2007.
[6] T. Starner and A. Pentland, “Real Time American Sign Language Recognition from Viedo using Hidden Markov Models,” MIT Media Lab, Cambridge, Massachusetts, Tech. Rep. 375, 1995.
[8] L. Gallo, A. P. Placitelli, M. Ciampi, “Controller-Free Exploration of Medical Image Data: Experiencing the Kinect,” Proceedings of 24th International Symposium on Computer-based Medical systems, pp. 1-6, 2011.

延伸閱讀