人類的眼睛是一種強力的非語言交流工具:眼神不僅能夠表達興趣、展現專注、透露意圖,更能在許多面對面互動中扮演重要的角色。此外,人們的視線在社交活動中會無意識地恪守某種不成文的規則。然而當我們談到機器人時,鮮少有人機互動應用專門處理人類視線,且既使有,他們也只處理特定層面與場合。本論文致力於創造一套具有自動地感應、理解和對人類眼神做出反應功能的全面性智慧型系統,以利提升人機互動的流暢度並使機器人更加人性化。使用於該實驗室開發輪型機器人身上的系統,利用經卷積神經網路處理過的二維圖像來偵測並追蹤視線,然後使用嶄新的變形漸進式隱馬爾可夫模型來評估與該機器人交流的對象的意圖。最後,機器人按照所獲得的人類意圖來做出反應。該系統準確率高達80%以上,已經被證明可以增加人類接近時的交流成立成功率且降低會話中錯誤的發生,也被證明在提升總體人機互動之使用者體驗水平上是有效的提升。
Human eyes represent a strong non-verbal communication tool: eye gaze not only gives cues about interest, attention and intention of people, but also manages several kinds of social face-to-face interaction. Moreover, people unconsciously but rigorously follow specific unwritten rules when directing their gaze during social interactions. When it comes to robots, however, only few applications take into account human gaze in Human-Robot Interaction (HRI), focusing on some specific aspects or scenarios. This thesis aims to create a comprehensive intelligent system to automatically sense, understand and react to human eye gaze, in order to both improve HRI smoothness and make robots behave more human-likely. The online system, mounted on a mobile robot, detects and tracks the gaze of humans from 2D images based on a Convolutional Neural Network (CNN), it then uses a novel incremental variant of Hidden Markov Model (iCHMM) to estimate the intention of the person with whom the interaction is taking place. Finally, with this information, the robot acts accordingly to its own intention. The system has been proved to have an overall accuracy greater than 80% in correctly estimate the intention of people. The robot both increased the success rate of interaction establishment during human approaching and decreased turn taking mistakes in conversations. It was also proved to be effective in raising the overall quality of user experience during HRI.