透過您的圖書館登入
IP:3.23.101.60
  • 學位論文

使用核心模組與空間時間域的相似性做影像物件分

Video Object Analysis Using Kernel-based Models and Spatiotemporal Similarity

指導教授 : 謝君偉

摘要


物件切割在許多的先進的應用中扮演著重要的腳色,例如人機互動的運用、車輛監控與視訊壓縮等都需要倚賴這樣的技術。本篇論文運用時域、空間域的資訊以及區塊的追蹤技術,從影像中擷取出重要的物件。首先,基於資訊可以互相補償的特性,我們採用了顏色、邊緣偵測、像素移動向量與核心技術模組來擷取影像中的物件。由於這些資訊可以互相補償彼此的缺點,所以即便是在影片因為像機晃動的情況下,也有不錯的切割效果。另外根據顏色的差異程度的多寡,可將相似的像素合併成為區塊,並透過區塊相鄰圖的分析,來合併的有相似特性的區塊以達到物件切割的效果。對於區塊的合併,我們採用貝氏歸納法則,找尋最佳的合併組合。最後的核心技術分析,則是結合了蒙地卡羅法則與機率統計的觀點在連續的圖片中追蹤物件的變化,其優點是可以準確定位物件的位置並提供有效的物件資訊來補償上述技術的不足,近而增加切割的正確性。詳細的核心分析技術與結果在第二、三章會有詳細的討論。 實驗結果證明,我們所提出的方法,對於物件切割的技術,有相當不錯的效果。

並列摘要


Video object segmentation plays an important role in many advanced application such as human-computer interaction, video surveillance, content-based video coding. In this paper we proposes a semantic video object segmentation system which combines spatiotemporal video segmentation and region tracking together to extract important semantic objects from videos. At beginning, the paper uses multiple cues to segment video frames to different regions. The cues include color, edges, motions, and kernel-based models. Since these features are complementary to each other, all desired regions can be well segmented from input frames even though they are captured from a non-stationary camera. Then, according to spatial information of each segmented region, we can construct a region adjacency graph (RAG) which can well record the relative relations between each region. Based on the RAG, we propose a Bayesian classifier which can group regions by properly checking their spatial and temporal similarities such that different regions will be merged and associated together to form a meaningful object. Since we include a kernel-based analysis into the designed classier, all desired semantic objects can be well extracted from video sequences. The kernel-based analysis can provide rich information for segmenting semantic objects if they are still in the background and cannot be identified using other features like motions. Experimental results have proved the superiority of the proposed method in object segmentation.

參考文獻


[1] V. Bove Jr., J. Dakss, E. Chalom, and S. Agamanolis, “Hyperlinked television research at the MIT Media laboratory,” IBM Systems Journal, 39(3/4), 470-479, 2000.
[2] D. Comaniciu, V. V. Ramesh, and P. Meer, “Kernel Based Object Tracking,” IEEE Trans. Pattern Anal. Machine Intell., vol. 25, no. 5, pp.564-577, May 2003.
[3] D. Comaniciu and P. Meer, “Mean Shift: a Robust Approach toward Feature Space Analysis,” IEEE Trans. Pattern Anal. Machine Intell., vol. 24, no. 5, pp.603-619, May 2002.
[5] J. W. Hsieh, “Fast Stitching Algorithm for Moving Object Detection and Mosaic Construction,” Image Vision and Computing Journal, vol. 22, no. 4, pp. 291-306, April 2004.
[6] J. Canny, “A computational approach to edge detection,” IEEE Trans. Pattern Anal. Machine Intell., vol. 9, no. 6, pp. 679-698, Nov. 1986.

被引用紀錄


張佳樺(2014)。含大豆蛋白及乳清蛋白之管灌配方對長期照護住民營養改善之研究〔碩士論文,中山醫學大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0003-2102201417052400
林淑華(2017)。不同鼻胃管灌食方法對重症加護病房病人 胃殘餘量、腹脹、嘔吐之分析:回溯性研究〔碩士論文,長榮大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0015-1607201712220800
陳美如(2017)。以多媒體護理指導介入對印尼籍看護執行鼻胃管照護之成效探討〔碩士論文,中山醫學大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0003-0507201710584000

延伸閱讀