透過您的圖書館登入
IP:216.73.216.202
  • 學位論文

在特定應用的超長指令集處理器上產生及使用可重組態的客製化功能單元

Generating and Exploiting Reconfigurable Custom Functional Unit in Application Specific VLIW Processors

指導教授 : 單智君

摘要


為了提升針對特殊應用而設計之處理器的效能,可將可重組態的客製化功能單元﹙reconfigurable custom functional unit, RCFU﹚ 附加於超長指令字﹙VLIW﹚處理器架構中。此技術是藉由常出現的運算序列﹙operation segment﹚產生可重組態的客製化功能單元,並將可以在可重組態的客製化功能單元上執行的運算序列包裹成客製化指令﹙customized instructions﹚。接下來在程式編譯階段中,指令排程﹙instruction scheduling﹚演算法利用指令的可平行化執行來提升效能表現。此篇研究中,我們不只提出產生緊密附著於超長指令字處理器的可重組態之客製化功能單元的方法,同時也提出一個使用此硬體的演算法。我們假設在原本處理器中的功能單元和附加的可重組態的客製化功能單元可同時執行,且將在過去研究中分成兩個不同步驟的包裹客製化指令及指令排程整合成一個步驟,以便獲得更多的效能提升及硬體使用率。比較具可重組態的客製化功能單元處理器及不具可重組態的客製化功能單元處理器的效能表現,整體來說,在採取我們提出的使用可重組態的客製化功能單元的演算法下產生可重組態的客製化功能單元的方式相較於過去的產生方式在效能上有大幅的提升;針對如何善用已存在可重組態的客製化功能單元的架構所提出的演算法也比傳統將找客製化指令及指令排程分別討論的方式有明顯效能提升。

並列摘要


To improve the performance of processors, a customized accelerator, reconfigurable custom functional unit (RCFU), may be appended to a very long instruction word (VLIW) processor architecture. The technique is to generate RCFU by those frequent operation segments and collapse operation segments which could be executed on the RCFU as customized instructions. Then, instruction scheduling is done to elaborate instruction-level parallelism for performance improvement at compile time. In this research, we propose not only a tightly-coupled RCFU design on the VLIW processor, but also an algorithm is also proposed to exploit the processor augmented with RCFU. We assume that FUs in the processor pipeline and RCFU could execute simultaneously, and independent operation mapping and instruction scheduling algorithms are integrated into a single phase to get more performance gains and higher hardware usability. We had comparisons between the processors with RCFU and without RCFU. Overall, our proposed RCFU design while using our proposed exploitation algorithm still achieves giant speedup on average over previous generating algorithms. Furthermore, the algorithm for exploiting RCFU also achieves obviously speedup on average over previous methods, separating algorithms.

參考文獻


[1] Hamid Noori, Farhad Mehdipour, Morteza Saheb Zamani, Koji Inoue, and Kazuaki Murakami, “Handling Control Data Flow Graphs for a Tightly Coupled Reconfigurable Accelerator”, The Institute for Computational Earth System Science, California USA, 2007: 249-260.
[2] Hamid Noori, Farhad Mehdipour, Kazuaki Murakami, Koji Inoue, Maziar Goudarzi, Interactive presentation, “Generating and executing multi-exit custom instructions for an adaptive extensible processor, Proceedings of the conference on Design”, The Design, Automation, and Test in Europe, Germany, 2007: 325–330.
[5] Hartej Singh, Ming-Hau Lee, Guangming Lu, Fadi J. Kurdahi, Nader Bagherzadeh, “Morphosys: an integrated reconfigurable system for data-parallel and computation intensive applications”, Computers, IEEE Transactions on Volume 49, Issue 5, Heidelberg, Berlin, 2000: 465–481.
[6] Bingfeng Mei, Serge Vernalde, Diederik Verkest, Hugo De Man, Rudy Lauwereins, “ ADRES: An Architecture with Tightly Coupled VLIW Processor and Coarse-Grained Reconfigurable Matrix”, Field-Programmable Logic and Applications, Heidelberg, Berlin, 2003: 61–70.
[7] Hamid Noori, Farhad Mehdipur, Kazuaki Murakami, Koji Inoue, Maziar Goudarzi, “Generating and Executing Multi-Exit Custom Instructions for an Adaptive Extensible Processor”, The Design, Automation, and Test in Europe, Nice, France, 2007: 1–6.

被引用紀錄


陳怡芳(2013)。國中英語教師使用電子書輔助教學對學生學習動機及學習成就之影響研究〔碩士論文,中原大學〕。華藝線上圖書館。https://doi.org/10.6840/CYCU.2013.00441
何金燕(2012)。數位電子教科書學習滿意度與學習績效之探討:科技、動機與領導的整合性觀點〔碩士論文,國立臺中科技大學〕。華藝線上圖書館。https://doi.org/10.6826/NUTC.2012.00082
胡佳禎(2013)。屏東縣國中技藝教育專班學生技能學習態度、課程滿意度與職業自我概念之研究〔碩士論文,國立屏東科技大學〕。華藝線上圖書館。https://doi.org/10.6346/NPUST.2013.00274
黃韻伊(2010)。臺北市國中生表演藝術課程學習滿意度之研究〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0006-2207201017364700
陳文玲(2013)。新北市國民中學技藝教育專案編班學生學習滿意度之研究〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0006-0407201309524500

延伸閱讀