透過您的圖書館登入
IP:3.133.160.156
  • 學位論文

以全域移動補償為基礎之視訊編碼方式及其在提高視訊解析度上之應用

A Global-Motion-Compensation Based Video Coding Scheme and Its Application to Video Resolution Enhancement

指導教授 : 吳家麟

摘要


全域移動預測 (global motion estimation) 在許多應用中被廣泛使用,而全域移動補償 (global motion compensation) 的概念在過去十年中蓬勃發展,至今已達成熟並運用於視訊編碼標準。另一方面,為提高視訊影像之解析度 (video resolution enhancement),除了硬體製造技術改良求新之外,運用多張低解析度影像達到高解析度影像重建之技術,已被驗證其高效能並廣泛地使用在各種領域中。 本論文針對MPEG-4全域移動補償工具提出技術改良,以期在更複雜的影片場景中能有更高的利用效能。我們也將全域移動補償的觀念引至最新的視訊編碼標準,H.264/AVC。由實驗結果證實,對於含有明顯相機移動 (camera motion) 的影片,採用我們所提出的技術與先前的標準相比,其壓縮結果更為有效率。 此外,本論文提出一視訊解析度提升之架構。此架構考慮了常見的相機移動和前後景動作之不一致 (inconsistent moving foreground),在提升視訊影像銳利度的同時並能防止雜訊的產生。經由主觀實驗測試證實本論文所提出的方法擊敗了常見的畫素內插法 (pixel interpolation) 及前人所提出的文獻探討。 最後,我們將全域移動補償及視訊解析度之提升結合為一項新的應用。在此應用中,運算最為複雜耗時的全域移動預測工作將被移至編碼端 (encoder),所產生的移動參數 (motion parameter) 將由視訊串流傳遞至解碼端 (decoder)。如此一來,解碼端只需進行簡單的畫素註冊動作 (pixel registration),對於有速度考量之應用便可在有限的時間內完成。實驗結果驗證了我們的方法只需引入極少的額外頻寬,即可有效率地達到解析度之提升。

並列摘要


Global motion estimation helps in many applications. The concept of global motion compensation (GMC) has been developed in the last decade, and reaches maturity for being applied to video coding recently. On the other hand, super-resolution (SR) image reconstruction is widely used to build a high spatial resolution image through referencing a series of low-resolution (LR) images of the same scene. In this thesis, we propose some refinements to the global motion compensation in MPEG-4 Advanced Simple Profile (ASP) for higher utilization in complex scenes. We also introduce the GMC concept into the latest video coding standard, H.264/AVC. Experiment results show that by applying our scheme, videos with apparent camera motion, i.e., pan, rotate, or zoom, are coded in a more efficient way. On the other hand, a scheme of video resolution enhancement is proposed. It considers common camera motion and precise outlier removal which enhance the video sharpness while avoid unfavorable noise interference at the same time. Through subjective quality test, our proposed scheme outperforms the simple interpolation method and previous SR approaches. Finally, a framework to combine GMC and SR reconstruction for video decoding applications is designed. In the proposal, most computation-intensive task is shifted to an offline encoding process. All GMC parameters are generated and embedded in the video bitstreams. Then, the major task for the decoder is simply doing the registration, whose computation is acceptable in time-constrained applications. In our experiments, the proposal can produce a high quality video up to 4 times large in each spatial dimension while just introducing an unnoticeable bitrate increase.

參考文獻


[2] “Information Technology – Generic Coding of Audio-Visual Objects,” ISO/IEC IS 14496, International Organization for Standardization, 1998.
[5] S. C. Park, M. K. Park, and M. G. Kang, “Super-resolution image reconstruction: A technical review,” IEEE Signal Processing Mag., vol. 20, pp. 21–36, May 2003.
[6] S. Borman and R.L. Stevenson, “Super-resolution from image sequences—A review,” in Proc. 1998 Midwest Symp. Circuits and Systems, 1999, pp. 374-378.
[7] J. J. Clark, M R. Palmer, and P.D. Laurence, “A transformation method for the reconstruction of functions from nonuniformly spaced samples,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 1151-1165, 1985.
[8] S.P. Kim and N.K. Bose, “Reconstruction of 2-D bandlimited discrete signals from nonuniform samples,” Proc. Inst. Elec. Eng., vol. 137, pt. F, pp. 197-204, June 1990.

延伸閱讀