在GPU上實作平行處理Bzip2資料壓縮演算法

科技資訊的發展日新月異，隨著網路頻寬以及儲存設備的不斷演進，人們對於數位資料的儲存需求也日益提昇，因此資料壓縮成為一個重要而且無法避免的課題。而無損壓縮具有確保資料正確性的能力，在考量編碼效率、編碼延遲以及編碼複雜度的情況下，如何取得一個平衡值是相當重要的研究方向。　　本論文將針對無損壓縮的Bzip2演算法改寫為平行處理的方式，首先介紹Nvidia CUDA的平行程式開發環境，利用顯示卡上的GPU來達成3D圖形顯示以外的運算用途，泛稱GPGPU，藉由繪圖晶片的強大運算能力來進行壓縮編碼的工作。由於CUDA支援C語言的使用，所以對於開發GPU程式的門檻降低，是一個相當適合的實驗環境，除了介紹目前無損壓縮編碼的演進，也順便介紹CUDA的程式架構以及硬體設備。　　對於無損編碼的改良，本論文會在壓縮編碼之前先執行Burrows-Wheeler Transformation以及Move-To-Front的轉換，此方式可以改善無損編碼的壓縮率，而我們的平行程式也著重在這兩者的操作上。除了探討程式分配的概念，本論文也會將GPU與CPU的Bzip2程式執行結果作比較與分析，最後討論壓縮演算法平行化對於此系統實作上所帶來的影響。

關鍵字

資料壓縮；顯示晶片；演算法；平行處理；平行程式

並列摘要

The development of information technology is rapid. With the network bandwidth and storage devices continue to evolve, require for digital data storage demand is rising. Data compression has become an important and unavoidable issue. The lossless compression has the ability to ensure data accuracy. In consideration of coding efficiency, coding delay and complexity of coding, how to strike a balance between the values is important research direction. 　　In this paper, we rewrite the lossless compression Bzip2 algorithms in the way for the parallel processing, first introduced the Nvidia CUDA the parallel programming environment. Using GPU on the graphics cards to achieve more operations besides 3D graphics computing, GPGPU, by a powerful graphics computing power to carry out the work of compression. As the CUDA support to the use of C language, so the threshold get lower for the development of GPU programming is a very suitable experimental environment. Apart from the evolution of the current lossless compression, but also the way introduce CUDA programming architecture and hardware. 　　For lossless coding improvements, this paper will execute Burrows-Wheeler Transformation and the Move-To-Front transformation before the compression entropy coding. This method can improve the lossless compression ratio, and our program also focuses on the parallel operation on both transform. In addition to the concept of distribute the program, we will compare the performance about CUDA GPU program and Bzip2 CPU program. This paper checks the results for comparison and analysis, finally discuss the impact of parallel compression algorithm implemented on this system.

並列關鍵字

GPGPU ； Bzip2 ； BWT ； MTF ； CUDA ； Nvidia ； Burrows-Wheeler Transformation ； Move-To-Front ； Parallel ； Data Compression ； Huffman code ； GPU

參考文獻

[1] Julian Seward's original reference implementation available under a BSD license.

The Bzip2 home page http://www.bzip.org/

[4] D.A. Huffman, “A method for the construction of minimum-redundancy codes”, Proceedings of the I.R.E., Sept 1952.

[5] Jorma J. Rissanen, "Generalized Kraft Inequality and Arithmetic Coding" , IBM Journal of Research and Development 20 (3): pp. 198–203. May 1976.

[6] C.E. Shannon, "A Mathematical Theory of Communication", Bell System Technical Journal 27: pp. 379–423 ,July 1948.

被引用紀錄

孫嘉駿（2012）。基於GPU平行DPCM視訊壓縮編碼法〔碩士論文，國立臺北科技大學〕。華藝線上圖書館。https://doi.org/10.6841/NTUT.2012.00639

蔡榮哲（2013）。以圖形運算單元加速機率型神經徑路追蹤演算法〔碩士論文，中山醫學大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0003-2108201321342000

國際替代計量

在GPU上實作平行處理Bzip2資料壓縮演算法

全文下載

主題瀏覽