從單張深度影像估測三維手部骨架模型

在本篇論文中，我們提出了一個手部骨架估測系統，能從單一張深度影像估測出影像中手部關節點的位置。手部骨架關節估測能廣泛運用於人機互動與手勢辨識等領域。最早有許多研究致力於人體的姿勢估測與識別，並發展出相應的感應器與套件應用於人機與體感方面；而相較於此，手勢由於單一的膚色造成特徵點不明顯，所占區域太小容易受限於環境與解析度，遮蔽與變化自由度高等困難，使得達成較為不易。目前也有基於模型或學習的方法來克服此困難。本方法結合了影像上的學習方法與主動形狀模型的技術，用於手部骨架關節估測，並實際以支持向量機測試估測結果，確實達到手勢辨識的應用。提出的系統主要可分為兩部分：第一部分必須先針對輸入影像估測手部形狀類別，用以選擇使用相應的模型；第二部分則利用基於骨架的主動形狀模型，從初始位置開始遞迴地優化骨架模型。為此，在訓練過程中，首先會對資料庫進行隨機森林的訓練，得到能由輸入像素與影像組合的資料點預測屬於該群的機率值的多棵樹。同時也在各自群中的資料做關節位置的主成分分析與建立關節點表現模型。因此，對任一單張包含手部區域影像可由隨機森林決定出適合的主成分模型，並偵測三維骨架模型。在實驗結果中，我們透過客觀的數據來展示提出的系統能夠有效地偵測關節點位置，並可運用於手勢辨識。

關鍵字

手部骨架估測

並列摘要

In this thesis, we propose a novel 3D hand skeleton estimation system that can estimate the positions of hand joints from a single depth image. The hand skeleton estimation can be widely used in the fields of Human Computer Interface (HCI) and gesture recognition. Numerous researches on depth sensors have been endeavored with applications in these domains. However, the monotonous skin color, self-occlusions, view variations and high degree of freedom are the difficulties for 3D hand skeleton model estimation and gesture recognition from color or RGBD images. Currently, the model-based and discriminative methods have been proposed for solving these problems. In this work, we combine the vision-based learning and Active Shape Model approaches for 3D hand skeleton estimation from a single depth image. The proposed approach is decomposed into two principal steps: the first part is to select the corresponding ASM model from the depth image, and the second part uses the skeleton-based ASM to iteratively refine the joint positions. In the training phase, we first build a random forest from a dataset of annotated hand depth images, which are clustered via K-means algorithm. With the random forest, we can compute the probability of each cluster for the input data point. Meanwhile, the PCA skeleton models and joints profile models are constructed for each cluster. Thus, for an input hand depth image, the system first determines the associated ASM model from random forest and then estimates the 3D hand skeleton model with a modified ASM fitting process. Our experiments demonstrate the effective 3D hand skeleton estimation by using the proposed algorithm for quantitative evaluations.

並列關鍵字

hand skeleton estimation

參考文獻

[1] R.-Y. Wang, Y. Robert, and J. Popović. Real-time hand-tracking with a color glove. ACM Transactions on Graphics (TOG), 28(3):63, 2009.

[2] M. de La Gorce, Martin, D. J. Fleet, and N. Paragios. Model-based 3d hand pose estimation from monocular video. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 33(9):1793-1805, 2011.

[3] I. Oikonomidis, M. Lourakis, and A. A. Argyros. Evolutionary Quasi-Random Search for Hand Articulations Tracking. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014.

[4] C. Keskin, F. Kıraç, Y. E. Kara and L. Akarun. Hand pose estimation and hand shape classification using multi-layered randomized decision forests. European Conference on Computer Vision (ECCV), pages 852-863, 2012.

[5] D. Tang, H.-J. Chang, A. Tejani and T.-K. Kim. Latent regression forest: Structured estimation of 3d articulated hand posture. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014.

國際替代計量

從單張深度影像估測三維手部骨架模型

主題瀏覽