透過您的圖書館登入
IP:3.145.65.134
  • 學位論文

使用多次優化之區域為基礎的立體匹配方法

Region-Based Stereo Matching Method Using Multi-Step Refinements

指導教授 : 陳自強
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


影像處理在今天已經是多媒體領域的一門重要學問,應用在立體影像上的立體匹演算法亦扮演著重要的地位。本論文主要是先介紹與立體匹配演算法有關的數學模型以及相關假設,再來利用影像分割的原理,討論影像中深度平面與原始影像信號中相關性的區域,分析關聯性決定出區域,最後再合併區域找尋最佳解。在此使用區域正交相關匹配深度演算法(Local matching with Orthogonal Integral Segment)配合簡單樹狀動態演算法(Simple tree dynamic programming )來決定初始深度影像,並利用正向匹配與反向匹配找出可信點區域後,細部修正可信點區域使之雜訊大幅減少,並對原始影像進行顏色相關度切割決定出深度平面,再利用平面方程式與B-spline平面修正使預測的深度平面可以有不同曲度展現形式。接下來利用原始影像與經過深度修正後的影像進行色彩統計上的相似度比較,並佐以比對Census轉換後的紋理影像,找出最適合的影像分割與曲度集合,最後對深度影像不連續區域做以正交相關區域為基底,進行匹配能量重定修正藉以達到更好的表現。整體演算法於Middle Bury立體視覺影像研究資料庫所提供之四組影像對計算結果與真實深度之比較,錯誤率分別為Tsukuba 2.37%、Venus 1.10%、Teddy 7.83%、Cones 3.99% 整體錯誤排名為56名

並列摘要


Nowadays, 3D image processing has become a critical technique in multimedia applications. Particularly, stereo matching plays an essential role where a segmentation-based stereo matching approach is widely adopted in disparity computing, region clustering, and segmentation refining. This work first introduces the theoretical foundation with respect to segmentation-based stereo matching. Second, based on image segmentation, we explore the depths, disparity planes and relevance among segmented regions and then determine the association to find an optimized solution of region grouping. Particularly, the local matching with orthogonal integral segmentation and simple tree dynamic programming are used to determine initial depths of planes. Additionally, the B-spline surface, plane fitting function and majority voting are applied to predict reliable region depths where the cost function is based on statistical color similarity using the mutual information and texture similarity using the census transform. Third, the edge correction scheme based on the orthogonal integral segmentation is adopted to raise the performance. Finally, the obtained depths are used as an initial depth map for the next iteration. When the cost function cannot be lowered, the iteration is stopped. The four sample image sets from Middlebury are tested by using the proposed method to yield the error rates of bad pixels, 2.37%, 1.10%, 7.83% and 3.99% for Tsukuba, Venus, Teddy and Cones, respectively, where the overall ranking score is 56.

參考文獻


[1] A. Klaus, M. Sormann and K. Karner, “Segment-based stereo matching using belief propagation and a self-adapting dissimilarity measure,” Proceeding of International Conference on Pattern Recognition, pp. 15-18, Sep. 2006.
[2] Z. Wang and Z. Zheng, “A region based stereo matching algorithm using cooperative optimization,” Proceeding of Computer Vision and Pattern Recognition, pp. 23-28, June 2008.
[4] D. Comaniciu and P. Meer, “Mean shift: a robust approach towards feature space a analysis,” IEEE Trans.on Pattern Analysis and Machine Intelligence, vol. 24, no. 5, pp. 603-619, May. 2002.
[6] S. Birchfield and C. Tomasi, “Multiway cut for stereo and motion with slanted surfaces,” Proceedings of the Seventh IEEE International Conference on Computer Vision ,pp. 489-495,July 1999.
[8] M. Bleyer and M. Gelautz, “Simple but effective tree structures for dynamic programming-based stereo matching,.” International Conference on Computer Vision Theory and Applications, vol 2, pp. 415–422, June 2008.

延伸閱讀