透過您的圖書館登入
IP:3.145.74.54
  • 學位論文

在虛擬戲劇中根據角色的恐懼情緒學習審問的策略

Learning Interrogation Strategies Based on Fear Emotion in Virtual Drama Dialogue

指導教授 : 蘇豐文

摘要


對話系統已經發展了好幾年,隨著對話系統的普及,對話策略也受到越來越多的關注。然而,要選擇一個適當的對話策略並不是一件容易的事。在本篇論文中,我們提出使用增強式學習方式產生審問對話的策略。我們的第一個貢獻是描述一個增強式學習的框架來產生審問對話的策略,而第二個貢獻是將代理人的社會情境及情緒狀態做為選擇對話策略的考量。為了模擬及實驗,我們根據一部小說的一段故事做為情境建立世界的背景知識。特別的是,我們透過來模擬嫌疑人的情緒變化,而警長根據這些情緒變化產生不同的對話策略。而我們得到的結果是不僅警長很有效率的偵測到嫌疑犯說謊的時間點,並且可以因此得到更多正確的資訊。

並列摘要


Dialogue systems have been developed for several years. As dialogue systems become ubiquitous, dialogue strategies of virtual agents are receiving more and more attention. However, to know how to select a proper dialogue in a specific social context is not a trivial task since the world is complex. In this thesis, we propose reinforcement learning to learn the strategy of “interrogation dialogue” in virtual drama. Our first contribution is describing a new reinforcement learning framework that can learn dialogue strategies from the interrogation dialogue. The second contribution is bringing the social context and emotion states of agents into the dialogue strategies. In order to demonstrate and simulate the performance, we based on a scenario from a detective novel to build the background knowledge of the world. In particular, we model the emotion variations of a suspect using a generation function of human emotion based on psychological literature so that the detective can learn the dialogue strategies based on the suspect emotion context. And the result of the learned dialogue policy is very sensitive in detecting lying of a suspect, and the superintendent gets more correct answer.

參考文獻


[3] F. Jurčíček, B. Thomson, and S. Young, “Reinforcement learning for parameter estimation in statistical spoken dialogue systems,” Computer Speech and Language, 26(3):168–192.
[5] M. L. Puterman, “Markov decision processes: Discrete stochastic dynamic programming,” John Wiley & Sons, Inc., 2005.
[7] S. Russell, and P. Norvig, Artificial Intelligence: A Modern Approach. Pearson Education, 2003.
[8] R. Sutton and A. Barto, Reinforcement Learning. MIT Press, Cambridge MA, 1998.
[9] C.J.C.H. Watkins, Learning from delayed rewards. PhD Thesis, University of Cambridge, England, 1989.

延伸閱讀