透過您的圖書館登入
IP:3.134.78.106
  • 學位論文

利用使用者注意模型決定視訊之興趣區

User Attention Model in Region-of-Interest Determination on Videos

指導教授 : 吳家麟

摘要


隨著多媒體文件在數量上的急遽增加,人們對於如何簡明地表現該些文件的精華變得更加熱切。其中一個重要的技術即為興趣區 (region-of-interest, ROI) 決定。傳統的興趣區分析主要著重於兩種多媒體文件型式:影像 (image) 與視訊 (video)。然而,對於視訊方面的研究成果卻遠落後於影像的相關研究。這種情形肇因於沒有適當地考量影像及視訊兩者間在本質上的差異,同時更忽略了視訊獨有的部份特性。 面對如此一個具挑戰性的研究課題,我們提出了一個以使用者注意模型 (user attention model) 為基礎的自動視訊興趣區決定架構。在這個研究中,視訊的注意特徵值 (attention features) 及應用媒體美學 (applied media aesthetics) 的知識都被同時考慮且利用。我們將視覺注意特徵值區分為三個基本種類:亮度 (intensity) 、顏色 (color) 及運動 (motion)。參考美學的原則,這些特徵值以一個新提出之稱為訊框切片 (Frame-segment) 的視訊分析單位為基礎,同時依據攝影機運鏡 (camera motion) 的種類而加以整合。在實驗中,對於數種不同的視訊資料進行了興趣區分析及使用者相關研究並證明了所提架構的有效性。我們視本研究為達成更高階具意義性視訊分析的一個重要基礎。

並列摘要


With the amazing growth in the amount of multimedia documents, people have become enthusiastic to acquire a more concise and informative representation of these documents. One of the desired technologies is the region-of-interest (ROI) determination. Conventional ROI analysis concentrates on two fundamental types of multimedia documents: image and video. However, the research performance of videos is far behind that of images. The phenomena are arisen from unsuitably considering the essential differences between image and video, and some video’s specific characteristics are ignored. Facing such a challenging issue, we propose a framework for automatic ROI determination in videos based on user attention model. In this work, a set of attempts on using video attention features and knowledge of applied media aesthetics are made. We classify visual attention features into three fundamental categories: intensity, color, and motion. Referring to aesthetic principles, these features are combined according to the camera motion types on the basis of a proposed video analysis unit, the frame-segment. We conducted lots of experiments on several kinds of video data and demonstrated the effectiveness of the proposed framework. This work is viewed as a preliminary step towards the solution of high-level semantic video analysis.

參考文獻


[2] X. Fan, X. Xie, H.-Q. Zhou, and W.-Y. Ma, “Looking into video frames on small displays,” in ACM Multimedia Conf., 2003, pp. 247-250.
[3] L. Liu and G. Fan, “A new JPEG2000 region-of-interest image coding method: partial significant bitplanes shift,” IEEE Signal Processing Letters, vol. 10, no. 2, pp. 35-38, Feb. 2003.
[5] S. Xiao, F. Zhang, C. Wu, Y. Li, and Y. Yan, “A new robust multiple description coding method based on region of interest,” in IEEE AINA, 2003, pp. 501-504.
[6] M.-J. Chen, C.-W. Pan, and M.-C. Chi, “Improved region-of-interest image coder and its application,” in IEEE ICCE, 2002, pp. 226-227.
[7] R. Wang, Q. Cheng, and T. Huang, “Identify regions of interest (ROI) for video watermark embedment with principle component analysis,” in ACM Multimedia Conf., 2002, pp. 459-461.

延伸閱讀