透過您的圖書館登入
IP:3.238.142.134
  • 學位論文

整合頻繁樣式分析與多模隱藏式馬可夫模型的棒球影片事件分類架構

A Framework for Incorporating Frequent Pattern Analysis into Multimodal HMM Event Classification for Baseball Videos

指導教授 : 蔡文錦

摘要


影片的高階語意事件辨識已發展成為多媒體搜尋與索引領域最有趣的研究議題之一。因為影片低階特徵與高階事件在語意上距離遙遠,需要提出一個階層式的影片分析架構,利用中階特徵連結低階的聲音視覺特徵和高階語意事件。因此這篇論文提出一個使用中階時間區間特徵相互間時間前後關係的影片事件分類架構。在此架構中,我們提出了一個共同出現符號轉換方法來探勘隱藏式馬可夫模型機率事件分類架構中多種來源特徵間的完整時間關係。另外,資料探勘和頻繁樣式分析已成為從資料中發現新知識很受歡迎的方法,但是卻幾乎沒有被應用到影片語意分析領域,因此這篇論文提出兩種方法整合頻繁樣式分析和多模隱藏式馬可夫模型棒球事件分類,包含利用頻繁樣式訓練的隱藏式馬可夫模型和依照頻繁樣式設計的隱藏式馬可夫模型。除此之外,不同的時間序列編碼方法也被提出來和共同出現符號編碼法比較分類效果。實驗結果證明我們提出來的方法在棒球影片事件分類的優越性以及整合頻繁樣式分析至事件分類架構有助於提升事件分類的效能。

並列摘要


Semantic high-level event recognition of videos is one of most interesting issues for multimedia searching and indexing. Since low-level features are semantically distinct from high-level events, a hierarchical video analysis framework is needed, i.e. using mid-level features to provide clear linkages between low-level audio-visual features and high-level semantics. Therefore, this thesis presents a framework for video event classification using temporal context of mid-level interval-based multimodal features. In the framework, a co-occurrence symbol transformation method is proposed to explore full temporal relations among multiple modalities in probabilistic HMM event classification. Besides, data mining and frequent pattern analysis have recently become a popular way of discovering new knowledge from a data set. However, it is rarely applied to video semantic analysis. Therefore, this thesis introduces two methods: frequent-pattern trained HMM and frequent-pattern tailored HMM to incorporate frequent pattern analysis into multimodal HMM event classification for baseball videos. Moreover, different symbol coding methods including temporal sequence coding and co-occurrence symbol coding for multimodal HMM classification are compared. The results of our experiments on baseball video event classification demonstrate the superiority of the proposed approach and demonstrate that integration of frequent pattern analysis could help to improve event classification performances.

參考文獻


1. Lexing X, Sundaram H, Campbell M (2008), Event Mining in Multimedia Streams. Proceedings of the IEEE 96(4):623-647.
2. Ballan L, Bertini M, Bimbo AD, Seidenari L, Serra G (2011), Event detection and recognition for semantic annotation of video. Multimedia Tools and Applications 51(1): 279–302.
3. Yan WQ, Kieran DF, Rafatirad S, Jain R (2011), A comprehensive study of visual event computing. Multimedia Tools and Applications 55(3): 443–481.
5. Rehman A and Saba T (2014), Features extraction for soccer video semantic analysis: current achievements and remaining issues. Artificial Intelligence Review 41(3): 451–461.
6. Oskouie P, Alipour S, Eftekhari-Moghadam AM (2014), Multimodal feature extraction and fusion for semantic mining of soccer video: a survey. Artificial Intelligence Review 42(2): 173–210.

被引用紀錄


Chang, S. C. (2013). 無線環境的認證方法及其在電子商務應用之研究 [doctoral dissertation, National Chung Cheng University]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0033-2110201613541599

延伸閱讀