透過您的圖書館登入
IP:3.129.69.151
  • 學位論文

基於虛擬視角的立體影片合成

Virtual-view-based Stereo Video Composition

指導教授 : 杭學鳴

摘要


立體數位內容日益被重視,新型態的技術包含了自由視點視訊(FTV, Free Viewpoint Television)與擴增實境(AR, Augmented Reality),這些應用以任意視點合成技術為最主要的關鍵議題。有許多的任意視點合成演算法被提出,通常都是利用多重影像以及其對應的深度資訊圖來產生虛擬試點的影像以達到任意視點的效果。我們利用這種基於影像與深度的影像合成渲染(DIBR, Depth Image-based Rendering)來產生背景置換後的立體視訊內容。 輸入為兩組多個攝影機分別拍攝的兩組視訊,我們希望結合這些輸入來產生新的立體場景。此立體場景由其中一組輸入的前景物體,與另外一組的背景場景共同組成。為了這個目的,我們將以多組視訊間場景間不匹配(mismatch)的角度來觀察,在此論文中主要將討論包括攝影機參數以及攝影機定位的不匹配。當使用者在背景影像中選取了定位點(landing point),我們需要經由改變攝影機相關參數來合成出背景場景的對應虛擬視角(配合前景攝影機),以達成背景置換。這樣的方式可以大幅增加創作的自由度。 相較於傳統的影像創作(Image Composition),上述的過程需要利用到深度幾何的資訊。欲被合成的背景場景需要經由虛擬攝影機參數的計算。此外,為了保持場景物體間互相遮蔽的關係,在背景置換時深度競爭(Depth Competition)是另外一個被探討的議題。當我們將靜態影像延伸至視訊時,我們需要攝影機移動行為的資訊來補償不同場景間攝影機的移動不匹配問題。實驗結果顯示我們可以達成令人滿意的視覺觀感。

並列摘要


3D video is gaining its popularity recently. In addition to the conventional left-right view 3D pictures, new forms of 3D video such as free viewpoint TV (FTV) and augmented reality (AR) are introduced. The Depth Image-based Rendering (DIBR) technique is one enabling rendering technique behind these applications. Typically, it uses multiple views with depth information to generate the intermediate view at any arbitrary viewpoint. We can use the DIBR techniques to produce new stereo videos with background substitution. Given two sets of videos captured by two sets of multiple cameras, we like to combine them to create a new stereo scene with the foreground objects from one set of video and the background from the other set. We will study a few mismatch issues between two scenes such as camera parameter mismatch and camera orientation mismatch problems in this thesis. We propose a floor model to adjust the camera orientation. Once we pick up the landing point (of foreground) in the background scene, we need to adjust the background camera parameters (position etc.) to match the foreground object, which enriches the freedom of composition. In contrast to the conventional 2D composition methods, the depth information is used in the above calculation. Thus, the new background scenes may have to be synthesized based on the calculated virtual camera parameters and the given background pictures. The depth competition problem is another issue to maintain the inter-occlusion relationship in the composite new scene. If we extend this 3D composition form still pictures to motion pictures, we need the camera movement information too. The camera motion is estimated for individual scene to solve the mismatch of camera motion of two scenes. Plausible results are demonstrated using the proposed algorithms.

參考文獻


[1] P. Kauff, N. Atzpadin, C. Fehn, M. Muller, O. Schreer, A. Smolic, and R. Tanger, “Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability,” Signal Process: Image Communication, Special Issue on 3DTV, pp. 217-234, Feb. 2007.
[2] A. Kubota, A. Smolic, M. Magnor, M. Tanimoto, T. Chen, and C. Zhang, “Multiview Imaging and 3DTV,” IEEE Signal Processing Magazine, vol. 24, no. 6, pp. 10–21, Nov. 2007.
[6] P. Kauff, N. Atzpadin, C. Fehn, M. Muller, O. Schreer, A. Smolic, and R. Tanger, “Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability,” Signal Process: Image Communication, Special Issue on 3DTV, pp. 217-234, Feb. 2007.
[7] G. Chen, Y. Liu, and N. Max, “Real-time view synthesis from a sparse set of views,” Signal Process.: Image Commun., 22, (2), pp. 188–202, 2007.
[10] M. Tanimoto, “FTV: Free-viewpoint Television,” Signal Processing: Image Communication, Volume 27, Issue 6, July 2012.

被引用紀錄


黃品宏(2006)。甲狀腺與乳房細針穿刺細胞學檢查抹片充足性分析: 以臺灣某醫學中心為例〔碩士論文,國立臺灣大學〕。華藝線上圖書館。https://doi.org/10.6342/NTU.2006.00890
陳冠諭(2016)。景深感應器的3D影像合成〔碩士論文,國立交通大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0030-0803201714323814

延伸閱讀