透過您的圖書館登入
IP:13.58.201.235
  • 學位論文

以深度影像及三維樣板比對為基礎之即時頭部運動軌跡追蹤

Depth-Based Real Time Head Motion Tracking Using 3D Template Matching

指導教授 : 歐陽明

摘要


在本篇論文中,我們對於以深度影像為基礎達成即時頭部軌跡追蹤之問題提出了兩種不同的方法。在方法一中,我們在深度圖上對使用者的正臉取樣產生點雲(Point Cloud),並使用這些點雲以最小平方法計算出最佳近似的平面,再將臉部輪廓以最小平方法計算出最佳近似的橢圓,以此平面之法向量及橢圓之傾斜角計算出頭部之旋轉角度。在方法二中,我們使用梯度坡降法迭代地將特定距離函數進行最佳化,此方法可算出更精準的旋轉角度及移動距離。我們所提出的這兩個方法,在單一CPU運算下皆可達到30fps的計算速度。在此系統所使用之攝影器材─Microsoft Kinect以及Asus Xtion Pro,乃較為普及、平價、易取得且易使用的深度攝影機,其優點亦伴隨著較高的拍攝雜訊。為了使系統得以不被環境光線改變所影響,我們只利用其拍攝出之深度影像為輸入來源做計算。在本篇論文中,我們證明,三維空間中,六個自由度的即時頭部軌跡追蹤,是可以經由分析帶有雜訊之深度影像達成的。

並列摘要


In this thesis, we propose a system to estimate head poses only using depth information in real-time. Two methods are developed. First, assuming that a head can be approximated by a bounding box, we find the best fitted plane for the frontal face by the least square error method. Thus, the normal vector of this plane represents the head orientation. Second, an optimization method based on 3D model fitting is developed. We iteratively minimize the distance between source and target point clouds of a user's head. This method is more robust and the results are more precise. Both of the proposed methods give fully real-time responses (30fps) without needing the GPU speedup. We adopt a commodity depth sensor named Microsoft Kinect as well as Asus Xtion, and use the depth image as the only input so that our system will not be affected by illumination variations. The simplicity of this acquisition device comes at the cost of frequent noises in the acquired data. We demonstrate that 6 degree of freedom real-time head motion tracking in 3D space can be achieved with noisy depth data.

參考文獻


[1] V.N. Balasubramanian, Jieping Ye, and S. Panchanathan. Biased manifold embedding: A framework for person-independent head pose estimation. In Computer Vision and Pattern Recognition, 2007. CVPR ’07. IEEE Conference on, pages 1 –7, june 2007.
[3] L. Chen, L. Zhang, Y. Hu, M. Li, and H. Zhang. Head pose estimation using fisher manifold learning. In Analysis and Modeling of Faces and Gestures, 2003. AMFG 2003. IEEE International Workshop on, pages 203 – 207, oct. 2003.
[5] G. Fanelli, T. Weise, J. Gall, and L. Van Gool. Real time head pose estimation from consumer depth cameras. In 33rd Annual Symposium of the German Association for Pattern Recognition (DAGM’11), September 2011.
[6] Sotiris Malassiotis and Michael G. Strintzis. Robust real-time 3d head pose estimation from range data. Pattern Recogn., 38:1153–1165, August 2005.
[7] Y. Matsumoto and A. Zelinsky. An algorithm for real-time stereo vision implementation of head pose and gaze direction measurement. In Automatic Face and Gesture Recognition, 2000. Proceedings. Fourth IEEE International Conference on, pages 499 –504, 2000.

延伸閱讀