透過您的圖書館登入
IP:3.137.187.233
  • 學位論文

基於視覺感知的影像分析原理應用於立體視訊編碼之位元率控制

Rate Control Algorithm for Multi-view Video Coding Using Visual Perception Based Stereo Video Analysis

指導教授 : 李佩君
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


立體影像技術的成長一日千里,使得三維立體影像逐漸普及,與傳統二維影像最大的不同在於,三維影像需要至少兩個以上的影像序列以產生立體影像。雖然多個影像序列可提昇三維視訊的觀賞品質但也造成儲存資料量的增加,面臨資料傳輸、壓縮的負擔及困難。為了解決資料量龐大的問題,聯合視訊專家群(JVT)提出以鄰近視角間的相似性,提升壓縮效能降低整體位元率。然而,如何在壓縮位元率及影像品質間達到平衡,有效的位元率控制在三維視訊資料壓縮扮演著關鍵性的角色。 為了兼具影像品質及壓縮率進而提出有效的控制位元率方法,本論文提出以視覺感知的心理學理論分析三維視訊影像,發現人眼對於動態以及邊緣變化性高的物體有較高的興趣程度,藉此發展出以視覺感知為基礎的演算法。利用此觀念,將畫面中吸引人眼注視的物體藉由提高位元率提升該區域的畫面品質,反之容易忽略的區域藉由降低該區域的位元率降低,在不影響觀看品質的條件下有效節省位元率,達到在降低壓縮位元率及提昇觀賞影像品質的有效控制。 本論文提出之有效位元率控制方法是以視覺感知來定義人眼觀賞畫面之感興趣區域,其中利用目前編碼畫面的Macro-block(MB)與參考畫面的MB之間的平均亮度差值,透過just noticeable difference(JND)方程式定義具有高動量特性的興趣區域,另外以目前編碼畫面MB與參考畫面MB的亮度直方圖變化程度,定義該MB具有低動量的興趣區域及邊緣保留法描述外型明顯獨立於周圍的物件興趣區域。在正確描述興趣區域之後,根據不同的興趣程度,藉由調整量化參數(QP)提高興趣程度較高區域的位元率用以提昇畫面品質。 實驗結果顯示以平均亮度差值與亮度JND的比較、直方圖變化及邊緣保留法可以正確描述具有高動量、低動量及物件特性的興趣區域。實驗數據顯示本論文所提出的方法在相近的位元率的情況下感興趣程度較高的區域在PSNR上比JMVC提升約0.86dB的影像品質,在興趣程度較低的區域bit-rate節省2.5(bit/MB)。因此,本論文以視覺感知為基礎,有效的以三種方法描述畫面中的感興趣區域,在相近的位元率條件之下,本論文所提出的方法相較於JMVC在興趣區域有更好的畫面品質,並且在不感興趣區域以不影響觀看品質的條件下,節省了位元率,達到兼具影像品質及壓縮效率的目的位元率控制。

並列摘要


Currently, thanks for the development of video technology, the major topic of TV change from 2D to 3D. Compare to the traditional 2D video contents and 3D video contents, that need view content for 2D video and at least two or more than views for 3D video, respectively. Huge information of 3D video contents causes the awkward situation during data compression and data transmission. To solve the problem, Joint Video Team (JVT) proposes a coding method that utilizes the content-similar relationship between neighboring views to increase the efficiency of multi-view video coding (MVC). Therefore, rate control (RC) algorithm is the key technique to tradeoff data compression ratio and video quality of 3D video. To control the bit-rate efficiently, considering video quality and compression ratio, the proposed algorithm utilizes the characteristic of visual perception theory from the analysis of 3D video that the region with high motion and individual object feature are interesting to viewer. Based on the concept, the proposed rate control algorithm gives the high bit-rate at the interesting region for preserve the image quality, decreases the image quality at the region of non-interesting for saving the bit-rate. The proposed algorithm determines interesting region with high motion feature by using the relationship between luminance difference and Just Noticeable Distortion (JND) function at the current and reference macro-block (MB). To enhance the ability of describing motion feature, the proposed algorithm uses luminance histogram bin variation of the two MBs to describe low motion feature, edge strength (ES) describes the object with individual shape and obvious texture. By the interesting region determination, the rate-control algorithm is used by quantization parameter (QP) defining to control bit-rate at different levels of interesting region. Simulation result of “PSNR of similar bit-rate” shows the proposed algorithm has better image quality than JMVC at the interesting region with high motion feature that increase 0.86dB of PSNR and save 2.5(bit/MB) at the non-interesting region.

參考文獻


[1] A.Vetro, Yea Sehoon, M. Zwicker, W. Matusik and H. Pfister, “Overview of Multiview Video Coding and Anti-Aliasing for 3D Displays,” IEEE International Conference on Image Processing, vol 1, pp. 17-20, 2007.
[2] H. Schwarz, D. Marpe, and T. Wiegand, “Overview of the Scalable Video Coding Extension of the H.264/AVC Standard,” IEEE Trans. Circuits Syst. Video Technology, vol. 17, no. 9, pp. 1103-1124, Sept. 2007.
[3] P. Merkle, A. Smolic, K. Muller, and T. Wiegand, “Comparative Study of MVC Prediction Structures,” ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, June 2007.
[4] C. Ha, W. Lee, S. Jin, and J. Jeong, “Human Perception of Asymmetrical 3-D Inputs,” 3DTV Conference, pp. 1-4, 2007
[5] G-L Wu, T-H Wu, Y-J Fu, S-Y Chien, “Perpcetual-Aware H.264/AVC Encoder with Hardware Perception Analysis, Conference on Multimedia and Expo (ICME), IEEE International , pp.790-795, 2010

延伸閱讀