透過您的圖書館登入
IP:18.119.138.123
  • 學位論文

運用於嵌入式訊號處理器之向量暫存器架構設計與模擬

Vector Register Architecture Design and Simulation on Embedded DSP Processor

指導教授 : 吳仁銘
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


Single Instruction Multiple Data (SIMD) is powerful in multimedia processing. Usually, for a conventional 32-bit machine, if one data unit is 8-bit in width, one SIMD instruction can operate on four units at a time and thus reach data parallelism to four. These data units are often be regarded as subwords in SIMD processing. However, performance of SIMD is restricted by ill subword permutation in register file. Therefore, we propose a new architecture of register file named Vector Register (VR) architecture. With Vector Register, subwords can be well permuted in register file without bringing heavy traffic between memory and register file. We have designed three benchmarks, matrix transposition, deblocking filter, and discrete cosine transform (DCT) based on H.264/AVC, and set up a deliberating simulation flow on Starfish DSP (digital signal processor) simulator. The simulation results shows that, in average, we can improve cycle count, instruction count of these benchmarks to 30.865%, 30.606%, respectively.

關鍵字

暫存器架構

並列摘要


在多媒體處理的領域中,由於資料的特性,單一指令操作於多重資料( Single Instruction Multiple Data, SIMD )的運算處理技術是有效及廣泛被使用的。通常,對於一台32-bit的機器來說,假如一個運算資料單位是8-bit的話,一條SIMD的指令可以同時操作於4個資料單位,因此也能將運算的平行度提升到4。這些運算資料單元在SIMD運算處理技術中,時常被稱之為子字符( subword )。然而,SIMD運算的效能常常受限於這些subwords在暫存器( register )之間的排列狀況。因此,為了解決subwords的排列問題,我們提出了一種新的暫存器架構,稱之為向量暫存器架構( Vector Register Architecture )。藉由向量暫存器架構,我們可以更自由地在暫存器間,排列、重組這些subwords,而不需要在暫存器跟記憶體之間,製造大量的資料流量。為了模擬與驗證向量暫存器的效能,我們基於新一代的影像壓縮技術─H.264/AVC,設計了三組標準測試程式( benchmark ),這些程式分別是矩陣轉置( matrix transposition),去方塊效應濾波器( deblocking filter),離散餘弦轉換 ( discrete cosine transform)。我們並設計了一套清楚的模擬流程去進行向量暫存器架構的模擬。透過這套流程,我們的模擬結果顯示:向量暫存器架構能有效地降低指令所消耗的週期數( cycle count ),以及所需要的指令數( instruction count )。平均而言,透過向量暫存器架構,我們能分別改善cycle count達到30.865%,instruction count達到30.606%。

並列關鍵字

HASH(0x1dab5b90)

參考文獻


[1] ”Analog Devices - Embedded Processing and DSP - Blackfin Processor Home”.
[3] Iain E. G. Richardson. ”H.264 and MPEG-4 Video Compression Video Coding for
Next-generation Multimedia”. John Wiley and Sons, 2003.
13(7):560–576, July 2003.
’03., 1:345–348, July 2003.

延伸閱讀