在不同頭部姿勢下的視線估測

本篇論文提出一個視線估算的系統，該系統可以在不同的使用者和不同的頭部姿勢下，根據人眼影像去估算出目前人所看到的位置。此一研究有助於開發有別於觸控或體感的人機互動控制模式。在我們的系統中，為了達到不同受測者皆可使用的特性，我們採用了包含多種頭部姿勢資訊以及多個受測者的UT dataset來學習出各種不同頭部移動的情形。另外，我們建立3D臉部模型來作頭部姿勢的估算來得到轉動的3D資訊，藉此達到全程採用單一相機的基於影像學習視線估測，以擴充應用的廣泛性及一般性。對於視線估測這類回歸問題，我們引入近年來流行的深度學習架構來解決問題。然而，大部分的視線估測演算法都是在固定頭部姿勢下對於瞳孔在不同位置來判斷人所看的地方，這樣的研究並不適用於一般看電視的情境，比如移動的物體或是人在不同位置，人的視線都會隨著頭部轉動而移動。因此為了解決頭部移動所導致眼睛形狀不同的問題，我們針對區域性的頭部姿勢來訓練不同的深度網路來估算目光位置。透過實驗，我們證明了如此的方法可以有效的解決在不同頭部姿勢下視線估測的問題，且在訓練時間和表現結果都有所提升。

關鍵字

視線估測；深度學習；人機互動；電腦視覺

並列摘要

In this thesis, we propose a new gaze estimation algorithm that estimates where a user looks from the eye images. The proposed gaze estimation algorithm is based on using multiple convolutional neural networks (CNN) to learn the regression networks for estimating gaze angles from eye images. The proposed algorithm can provide accurate gaze estimation for users with different head poses, since it explicitly uses the head pose information in the proposed gaze estimation framework. To achieve person independent system, we train the deep CNN regression networks with UT Multiview dataset, which contains a large number of subjects with large head pose variations. On the other hand, we estimate the head pose from the 2D face image and a generic 3D face model. It is the reason that the proposed algorithm can be widely used for appearance-based gaze estimation in practice. Our experimental results show that the proposed gaze estimation system improves the accuracy of appearance-based gaze estimation under head pose variations compared to the previous methods.

並列關鍵字

Gaze Estimation ； Deep Learning ； Human-Computer Interaction ； Computer Vision

參考文獻

[2] J. Nielsen, K. Pernice. How to conduct eyetracking studies, Nielsen Norman Group, 2009.

[3] B. A. Smith, Q. Yin, S. K. Feiner, and S. K. Nayar, Gaze locking: passive eye contact detection for human-object interaction, in Proc. UIST, pages 271–280, 2013.

[4] C. H. Morimoto and M. R. Mimica, Eye gaze tracking techniques for interactive applications, Comput. Vi. Image Understand., Special Issue on Eye Detection and Tracking, vol. 98, no. 1, pp. 4–24, 2005.

[6] J. P. Rae, W. Steptoe, and D. J. Roberts, Some Implications of Eye Gaze Behavior and Perception for the Design of Immersive Telecommunication Systems, 2011 IEEE/ACM 15th Int. Symp. Distrib. Simul. Real Time Appl., pp. 108–114, Sep. 2011.

[7] D. W. Hansen and Q. Ji, In the eye of the beholder: A survey of models for eyes and gaze, IEEE Trans. Pattern Anal. Mach. Intell., vol. 32, no. 3, pp. 478–500, Mar. 2010.

國際替代計量

在不同頭部姿勢下的視線估測

全文下載

主題瀏覽