最近這幾年來,愈來愈多的多媒體應用在消費性電子商品上。發展蓬勃的手持式裝置如智慧型手機,除了基本的通話服務之外,還支援多種多媒體應用,如相機、錄影、網路、GPS導航、MP3隨身聽等等,大大地影響到人類的生活。而在手持式裝置上配備三維繪圖處理器(GPUs)的情況也越來越普遍。由於螢幕是人機之間最重要的介面,使用者對畫面上通常都會要求高畫質,這也意味著三維繪圖處理器所需要處理的運算也越來越複雜;除了高畫質的訴求,高解析度的螢幕不可或缺。高畫質及高解析度的螢幕要求,都直接的顯示出在三維繪圖處理器中,處理運算的核心單元也必須跟著變多。而這對資源極有限的手持式裝置來說,不失為一大挑戰。如何在固定的資源下,讓多個運算核心達到比較高的使用率以及讓各個核心中需要用來運算的資料使用最佳化,將會是一大考驗。 在這篇論文中,我們提出了一個手提式多核心繪圖處理器(Multi-Core Mobile GPU):利用動態配置的工作排程器(Task Scheduler)來分配各個需要運算的工作給各個運算核心單元,來達到各個運算核心之間的負擔平衡(Load balance);也利用資源排程器(Resource Scheduler),來提前搬移各個運算核心單元所需要的資料,使得各筆資料,能夠更有效率的被搬移並使用,進而提高每個著色處理器的管線效率, 綜合以上技術,我們把原本只有兩個運算核心的手提式繪圖處理器,擴充到了具有八個運算核心,並加入了所提出的工作排程器和資源排程器,並將之實現成一個系統晶片平台,晶片利用台積電65nm 技術製成,面積為4.0×4.0mm2,其工作頻率為200MHz,最大消耗功率為212.84mW
Recently, the multimedia application in consumer electronics has become more and more prosperous. Among all, the smart phones are not just phones. There are many multimedia applications such as games, camera, video, internet, GPS, MP3 etc embedded into the smart phones. Usually, there is a 3D graphics processing unit (GPU) which is embedded into mobile devices to enhance the processing capability of the multimedia applications. Since screen is the most important interface between the users and mobile devices, the high resolution and high quality screens are indispensible for users that use the mobile devices. However, higher resolution screen and higher quality display both mean that the processing power of the GPU must be increased which means the number of processing cores must be increased. This is a challenge since there are many limitations on mobile devices. How to maximize the utilization of the multi-core processor and how to make the efficient usage of the resource for processing will be a design challenge. In this thesis, we proposed a multi-core mobile GPU for mobile devices. We propose a dynamic task scheduler to dynamically assign the subtask to each processor to make the whole system more load balance; besides, we also propose a resource scheduler to efficiently prefetch the resource that is required by processor to make the whole processing pipeline more efficient. Based on the above techniques, we extend our original two processing shader cores GPU to a eight core unified shader GPU with task scheduler and resource scheduler added. We also realize these to a SOC platform, our chip is fabricated by TSMC 65nm technology. The chip area is 4mmx4mm and the chip working frequency is at 200 MHz. The maximum power consumption is 210mW. The maximum processing capabilities is 800MVtx/sec and 1600MPxl/sec.