數位信號與影像之任意放大與縮小演算法

比例縮放（scaling）一直是數位信號處理中重要的課題，主要原因是其應用非常廣泛，舉凡圖形的放大縮小扭曲變形、信號取樣頻率的升降、以及近來討論熱烈的小波轉換等，都離不開比例縮放的範疇。而比例縮放中最重要的參數就是比例縮放參數（scaling factor），其決定信號放大縮小的倍率。一直以來，大家對於處理有理數的比例縮放參數較有共識，相反的，對於無理數則是各持己見，沒有定論。而討論所有實數參數之比例縮放演算法我們稱之為廣義比例縮放（generalized scaling）。在本篇論文中，我們利用信號頻域與時域的關係，在頻譜上作處理以達到廣義比例縮放的目的。相較於別人的方法，這個演算法具有線性的優點，因此可以應用在完整重建濾波器串（perfect reconstruction filter bank）的設計上，無論是一維還是二維的完整重建濾波器串皆有新突破。此外，上述的方法也可以應用在數位信號的變形（warping）上。所謂變形與比例縮放的不同在於變形是非線性的取樣，而比例縮放則是線性的取樣。也就是說，透過這個方法，我們可以對一個數位信號做任意的扭曲變形，這在數位圖像的處理上有廣泛的應用，例如多投影機的校準便是一個例子。論文的最後，鑑於近年來區塊離散餘弦轉換（block-based DCT）被廣泛地應用在JPEG與MPEG上，我提出一個快速轉換區塊大小的架構，這對於以區塊離散餘弦轉換為基礎的壓縮標準有快速比例縮放的效果。

關鍵字

任意變形；內插；完整回復濾波器串；快速離散餘弦轉換

並列摘要

Scaling has always been an important topic in the field of digital signal processing, since its applications are so widespread and extensive. For example, interpolation, decimation and warping of a digital image, sampling rate conversion of a digital signal, and wavelet transform based filter banks are all closely related to scaling. The operation of scaling entirely depends on the scaling factor, which controls the ratio between the original signal and the scaled signal. In general, scaling factor can be either rational or irrational numbers. We named the operation on both rational and irrational factors, i.e. real factors, the generalized scaling. For a long time, there is no steadfast definition for generalized scaling, since everyone could justify his opinions from different points of view. In this paper, I propose a new algorithm to realize the generalized scaling using linear operations in the frequency domain. Compared to previous methods, this algorithm is entirely linear such that it can be well applied to the design of Perfect Reconstruction Filter Banks (PR FB). Furthermore, both one dimensional and two dimensional FB cases are explored in this paper. In addition, after a fine modification of the above algorithm, it can be utilized in image warping, which is a common problem in multi-projection applications. The term “warping” is different from scaling in that the former one is a non-linear operation while the later is linear. Through image warping, we can arbitrarily change the shape of a digital image. At the end of this paper, I put more emphases on DCT compressed domain manipulations, which are similar to generalized scaling in main concepts. I propose a new scheme to efficiently implement the block size conversion between different DCT’s. This scheme can be useful in image resizing for which the image is compressed in the DCT domain, such as JPEG and MPEG.

並列關鍵字

nonuniform filter bank ； interpolation ； decimation ； perfect reconstruction ； direct DCT domain computation ； radix-3 dct ； irrational scaling

參考文獻

[2.1] Seungsin Lee, Wei Zhao, R. Narasimha, and R. M. Rao, “Discrete-time models for statistically self-similar signals,” IEEE Transactions on Signal Processing, Volume: 51, Issue: 5, May 2003, Pages: 1221 – 1230

[2.5] Seungsin Lee, R. Rao, and R. Narasimha, “Characterization of self-similarity properties of discrete-time linear scale-invariant systems,” Proceedings. of IEEE International Conference on Acoustics, Speech, and Signal Processing, Volume: 6 , 7-11 May 2001, Pages:3969 - 3972

[2.6] Gianpaolo Evangelista and Sergio Cavaliere, “Audio Effects Based on Biorthogonal Time-varying Frequency Warping,” EURASIP Journal on Applied Signal Processing, 2001:1, Pages: 27-35

[2.7] L. Cohen, “The scale representation,” IEEE Transactions on Signal Processing, Volume: 41, Issue: 12, Dec. 1993, Pages: 3275 - 3292

[3.1] R. N. Bracewell, K. -Y. Chang, A. K. Jha, Y. -H. Wang, “Affine theorem for two-dimensional Fourier transform,” Electronics Letters , Volume: 29 , Issue: 3 , 4 Feb. 1993, Pages:304

國際替代計量

數位信號與影像之任意放大與縮小演算法

全文下載

主題瀏覽