透過您的圖書館登入
IP:18.118.226.26
  • 學位論文

用於漫畫式影片摘要之對話角色定位

Speaker Localization for Comic-Styled Film Summarization

指導教授 : 莊永裕
共同指導教授 : 陳炳宇
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


漫畫是一種靜態呈現動態跟語音資訊的方法。漫畫式電影摘要是一個新而有用的方法可以在二維的平面空間中呈現影片中動態跟語音的資訊。然而,要將一部電影轉換成一本漫畫書有很多挑戰需要被克服。其中之一就是我們需要將漫畫式的對話框放在畫格中正確的位置。我們提出一個可用於漫畫式電影摘要的半自動說話角色定位系統使得我們可以在最少的使用者提示下找出對話框放置的位置。在一部影片中,演員們可能會做出很複雜的動作或互動,諸如打光之類的環境因素也可能變化多端。為了得到可靠的結果,我們的系統是基於一個貪多式人臉分類演算法,而且獲得了足夠好的結果可以在實際的漫畫式影片摘要系統中使用。

並列摘要


Comics is a static presentation but has temporal and speech information. A comic-styled film summarization is a new and useful method to summarize a movie in 2D. But, there are many challenges when we try to transform a movie into a comic book. One of them is that we should place dialog balloons at correct positions. We propose a semi-automatic speaker localization system for comic-styled film summarization such that we can locate speakers with very less user hints. In a film, actors may do many complex motions and interactions, and environment such as lighting may change very much. For robustness, this work is based on a greedy face clustering algorithm, and has good enough performance in practice to be used in a comic-styled film summarization system.

參考文獻


[Can86] J Canny. A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell., 8(6):679–698, 1986.
[CD00] R. Cutler and L. Davis. Look who’s talking: speaker detection using video and audio correlation. In Proceedings of IEEE International Conference on Multimedia and Expo, volume 3, pages 1589–1592, 2000.
[Lie99] Rainer W. Lienhart. Comparison of automatic shot boundary detection algorithms. In SPIE, 1999.
[LTB96] J. Luettin, N.A. Thacker, and S.W. Beet. Learning to recognise talking faces. In Proceedings of the 13th International Conference on Pattern Recognition, volume 4, pages 55–59, 1996.
[PBC05] S.L. Phung, Sr. Bouzerdoum, A., and Sr. Chai, D. Skin segmentation using color pixel classification: analysis and comparison. In Processings of IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005.

延伸閱讀