透過您的圖書館登入
IP:18.220.187.178
  • 學位論文

自動建構電影中角色的社群網路方法之研製

An Automatic Approach to Construct Roles’ Social Networks in Movies

指導教授 : 葉梅珍
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


人臉偵測與辨識的研究發展至今已累積了相當多的方法,傳統方法大多數著重於開發各種不同的低階臉部特徵進行分析,但以這個觀點出發的研究成果逐漸趨於極限,尤其用於情況複雜的真實影像中(如照片或電影資料),其面臨著難以將辨識準確率向上提升的瓶頸。檢視近幾年電腦視覺與圖形辨識以及多媒體領域的研究趨勢,許多文獻引進了潛藏語義情境(Latent Semantic Context)或是概念分析(Concept Analysis)的高階特徵來輔助低階的特徵,期望可以藉此突破傳統方法的瓶頸以增進辨識效能,並試圖縮短電腦機器與人類認知之間的語義間隙 (Semantic Gap)。本研究以具有角色互動關係的電影為實驗對象,著手於發展一個非監督式(Unsupervised)自動化的方法在電影中建立角色的社群網路(Roles’ Social Network)這類型的高階資料結構。利用臉部的低階特徵來進行相似性傳遞(Affinity Propagation)分群演算法,將相同角色的人臉軌跡群聚在一起。並提出一個新的方式 - 基於鏡頭交錯切換的線索(Shot Alternation Cues)來量化角色之間的互動程度以完成社群網路的建立。這個方法在我們所建立的社群網路中,更適合用來描述角色之間的關係。最後,我們將極大集團(Maximal Clique)的概念應用於從自動化所建立的社群網路中找出群體(Community)。針對真實的電影資料進行實驗,驗證了我們所提出的方法之有效性。

並列摘要


A vast amount of researches have been conducted on the subject of face detection and face recognition in the past decades. Most existing methods develop low-level features to tackle these problems. However, approaches based on low-level audiovisual features can rarely achieve promising recognition performances given real-world, complex data. Reviewing the literatures in the fields of computer vision, pattern recognition and multimedia computing, many studies have introduced high-level features, such as latent semantic context and concept analysis, based on which complementary solutions have been proposed to the problem. These high-level feature based approaches are shown to be able to more effectively bridge the semantic gap between machine and human perspective, and, thus, improve the recognition performance. In this thesis, we develop an automatic, unsupervised method to construct roles’ social network in movies. The resulting social network is useful for extracting high level features (e.g. the leading roles and the roles’ communities) for the movie’s contents. We first apply the affinity propagation clustering algorithm on preprocessed face tracks and generate face clusters for the roles. Next, we propose a new method that quantifies the interactions between roles based on shot alternation cues. This method is more appropriate to describe the relationship between roles and based on which we construct the roles’ social network. Finally, we use the maximal clique concept to identify communities from our automatically constructed social network. Experiments on real-world data validate the effectiveness of the proposed method.

參考文獻


[02]Elmagarmid, Ahmed K., Managing and Mining Graph Data, Springer US, 2010.
[01]M. E. J. Newman and M. Girvan, “Finding and Evaluating Community Structure in Network”, Physical Reviwe E, vol. 69 no. 2, Aug., 2003.
[03]Chung-Yi Weng, Wei-Ta Chu and Ja-Ling Wu, “Movie analysis based on roles’ social network,” Proc. IEEE ICME, Beijing, China, 2007.
[04]Jae-Ho Lee and Whoi-Yul Kim, “Video Summarization and Retrieval System Using Face Recognition and MPEG-7 Descriptors,” Proc. ACM CIVR, pp.170-178, 2004.
[07]Chung-Yi Weng, Wei-Ta Chu and Ja-Ling Wu, “RoleNet: Movie analysis from the perspective of social network,” IEEE Trans. on Multimedia, vol.11, no. 2, pp.256-271, February, 2009.

延伸閱讀