透過您的圖書館登入
IP:3.144.38.130
  • 學位論文

用於相依視訊編碼之影質控制策略

Picture Quality Control Strategies for Dependent Video Coding

指導教授 : 杭學鳴

摘要


傳統視訊位元率控制演算法通常追求影像失真總量最小化,然而往往付出影值大幅度變動的代價,特別是在視訊內容較激烈且經常性場景變換時。為了減輕影值變動所帶來的負面作用,許多演算法追求全部影值均等。如某些研究者指出,雖然現有演算法已能產生影值均等的視訊,但是這些演算法經常無法精確地使用分配的位元來減少失真總量。本論文嘗試一次達成三個目標。即平穩視訊品質、失真總量最小化、精準地使用位元預算等。我們共提供三個演算法,針對兩種不同應用,定速率與變速率通道,來完成這些目標。其中兩個演算法適用於定速率通道,如儲存應用;一個演算法適用於變速率通道,如網際網路傳輸應用。 第一個演算法使用籬柵圖(Trellis-Based)架構來達成具備一致性品質的視訊。我們第一個貢獻是推導出,失真最小化問題與位元預算最小化問題的等效條件。第二,籬柵圖狀態定義為失真量,方便於一致性品質控制。第三,只需在提出的演算法中調整一個參數,一個介於,最小失真總量與固定品質的視訊解,可以被求得。第二個演算法結合拉格蘭乘數(Lagrange Multiplier)、快速分支延展與最佳化程序。與第一個演算法比較,它的峰信雜比效能只有些微的降低,但是運算複雜度顯著地降低。模擬結果顯示,這兩個演算法都只比MPEG 所提JM位元率控制演算法的平均峰信雜比些微低。當與近期發表的MultiStage與LPF演算法比較,我們所提演算法能夠較準確地使用分配位元預算,且輸出最大的峰信雜比與很小的峰信雜比變動率。 第三個演算法在變速率通道追求優雅的品質變動。我們取代一致性品質限制,換成最大相鄰幀間影值變動限制。因為這個演算法在單獨GOP內運作,相鄰GOP品質控制需求也需要被考量。每個GOP通道位元率被設定成給定的頻寬晃動模型。模擬結果顯示,我們的峰信雜比曲線函數很平滑,且在每個GOP邊界並沒有品質突然掉落。我們所提演算法也能夠準確地利用分配位元預算值。 總結,我們發展出彈性的影值控制架構,提出三個演算法。這些解能滿足三個目標,品質變動最小化、失真總量最小化、精準地使用位元預算。此外,附錄A呈現通道編碼效能分析結果,未來可用於整合視訊與通道編碼研究。

並列摘要


A conventional video rate control algorithm typically minimizes the total distortion at the cost of large temporal quality variation, especially for videos with high motion and frequent scene changes. To alleviate the negative effect of video quality variation, a few algorithms have been proposed to target on the constant quality across the entire sequence. As being pointed out by some researchers, although the existing proposals can produce constant-quality videos, they often fail to accurately utilize the available bits to minimize the global distortion. In this thesis, we would like to achieve three goals simultaneously. They are (1) producing smooth video quality (2) minimizing the total distortion, and (3) meeting the bit budget strictly. Three algorithms are proposed to accomplish this set of goals for two application scenarios: constant bitrate channels and variable bitrate channels. Two algorithms are designed for the constant bitrate channels, which may be used on the storage applications. And one algorithm is designed for the variable bitrate channels, which is needed for, say, Internet transmission applications. The first algorithm uses the trellis-based structure to achieve the consistent quality video. Our first contribution is to derive an equivalent condition between the distortion minimization problem and the budget minimization problem. Second, the trellis state (tree node) is defined in terms of distortion, which facilitates the consistent quality control. Third, by adjusting one key parameter in our algorithm, a solution in between the minimum total distortion and the constant quality criteria can be obtained. The second algorithm combines the Lagrange multipliers together with the proposed fast branch expansion process and optimization procedure. Compared to the first algorithm, its PSNR performance is degraded slightly but the computational complexity is significantly reduced. Simulation results show that our two algorithms produce a much smaller PSNR variation at a slight average PSNR loss as compared to the MPEG committee JM rate control. When they are compared to the recently published MultiStage and LPF algorithms, our proposed algorithms can meet the bit budget more accurately and produce the largest average PSNR at a small PSNR variation. The third algorithm aims at graceful quality variation for time-varying channels. We replace the consistent quality constraint in the second algorithm by a maximal inter-frame quality variation constraint. Because this algorithm operates on individual GOP’s, the quality variation across GOP boundaries has also to be considered. In our experiments, the channel bit rate for each GOP is set to follow the given bandwidth fluctuation pattern. Simulation results show that our PSNR curve has a smoother shape and has no sudden drop at the GOP boundaries. Also, the proposed algorithm meets the budget bits very accurately. In summary, we develop a flexible quality control framework that leads to 3 separate algorithms. They are nearly optimal solutions that achieve the triple goal: minimizing quality variation, minimizing global distortion, and satisfying the bit budget constraint. In addition, a channel coding study is presented in Appendix A for solving combined source-channel coding in the future.

參考文獻


[4] “Video coding for low bitrate communication,” ITU-T Recommendation H.263 Version 1, 1995. Version 2, Sept. 1997.
[7] T.Wiegand, G.-J. Sullivan, G. Bjontegarrd, and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 560-576, Jul. 2003.
[8] A.-E. Mohr, “Bit allocation in sub-linear time and the multiple-choice knapsack problem,” in Proc. IEEE. Data Compression Conf., pp. 352-361, Mar. 2002.
[9] K. Ramchandran, A. Ortega, and M. Vetterli, “Bit allocation for dependent quantization with application to multi-resolution and MPEG video coders,” IEEE Trans. Image Processing, vol. 3, no. 5, pp. 533-545, Sep. 1994.
[10] Y. Sermadevi and S.-S. Hemami, “Efficient bit allocation for dependent video coding,” in Proc. IEEE. Data Compression Conf., pp. 232-241, Mar. 2004.

延伸閱讀