3D立體顯示技術逐漸成為未來的主流,而目前多數現存的照片皆為單視角2D格式,因此為了將其轉換為立體影像格式,本論文將針對單視角影像估測深度資訊。目前2D轉3D的文獻雖然能估測不同物件彼此的前後關係,但卻無法提供合理之相對深度;而透過統計訓練之演算法,雖然能在某些場景下逼近於真實深度資訊,但難以找到泛用性高的訓練特徵。 本論文主要針對單視角之戶外影像估測深度資訊,我們將透視投影與針孔相機模型結合,用於估測合理之相對深度,再將場景中之物件分類為四種類型,分別為天空、地面、人工建物與自然景物,並且根據物件類型採用不同的深度配置方式,最後自然景物之深度,將更進一步透過灰暗通道模型修正。實驗結果證明我們估測之深度資訊,可以在不需訓練的情況下,逼近於真實深度資訊,並且轉換之3D影像能提供令人滿意的立體感。
Stereoscopic 3D display becomes accessible to all people. Most existing digital photos are captured in monocular 2D format, and it is desire to estimate their depth for the purpose of stereoscopic format conversion. Existing 2D-to-3D works may be able to estimate depth but they can’t provide reasonable relative depth between different objects. Approaches based on statistical training with the ground truth database may approximate some scenarios of images, but it is easy to fail with caveat that it’s hard to adopt representative training features without loss of generality. This paper focuses on depth estimation for the monocular images taken outdoors. We combine the perspective projection with pinhole camera model with the intention of estimating reasonable relative depth between different objects. We also classify objects in scene into four types - sky, ground, man-made and natural objects; and assign depth to each type with different rules. The depth of natural objects is further corrected by dark channel model. The experiments show our estimated depth map can approximate the ground truth without training, as well as providing the satisfyingly visual results.