透過您的圖書館登入
IP:18.217.248.255

並列摘要


Sparse matrix multiplication (SMM) is widely used in many vital scientific and engineering computations, such as least-squares problems, eigenvalue problems, partial differential equations, and image reconstruction. However, it is a very time-consuming process and the irregular structure of sparse matrices usually causes general-purpose processors to perform poorly and to suffer cache misses severely. In this paper, we develop an SMM system based on network-on-a-chip (NoC) technology to parallelize the needed computations. To facilitate the load balancing and effciency of packet distribution on the proposed SMM system, a method of mapping and partitioning a large matrix onto the system is also proposed. In addition, the proposed SMM system is fully parameterizable so that it has the maximum exibility to meet the hardware resource available at hand. The proposed SMM system has been veried with a variety of network sizes, including 2 × 2, 2 × 4, 4 × 4, 4 × 8, and 8 × 8, on a Xilinx Virtex 5 device (XC5VLX110T) operating at 100 MHz. A number of random and real-application matrices are used to evaluate the performance of the proposed SMM system. In addition, the effects of network sizes, matrix sizes, and sparsity on the system performance are considered. The results show that the proposed SMM system can achieve up to 40× and 2× speedup over MicroBlaze and Intel processors, respectively. The proposed SMM system is also realized with a TSMC 0.18 µm cell library. The core area of the 4×4 system is 1,986.5 µm × 1,985.4 µm, equivalent to 259,026 gates. The average power consumption is 417 mW at the operating frequency of 166 MHz.

被引用紀錄


吳庭昌(2010)。應用於晶片網路之可調整傳輸通道設計〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://doi.org/10.6841/NTUT.2010.00265
Lin, Y. C. (2009). 雙向晶片網路架構之效能感知程序映射 [master's thesis, National Taiwan University]. Airiti Library. https://doi.org/10.6342/NTU.2009.10448

延伸閱讀