透過您的圖書館登入
IP:3.17.6.75
  • 學位論文

使用即時迴授控制與重複資料刪除機制增進廣域雲端儲存網路效能

Improving Accessing Efficiency of Cloud Storage by De-duplication and Feedback Scheme

指導教授 : 吳庭育

摘要


在雲端儲存的環境中,檔案的分派與儲存過程是由提供者自行或租用第三方的實體儲存設備,在經由中央管理並虛擬化後整合為可用的儲存資源來提供使用者其相關的存取服務,常見的儲存協定像是有Internet Small Computer System Interface (iSCSI)、Fibre Channel、Common Internet File System (CIFS)等基於區塊形式或是檔案形式來進行資料傳輸與保存。因雲端網路涵蓋了相當大的使用範圍以及網域,有時由不同使用者在儲存設備上所寫入的內容都有著高度的相似性,由於數量眾多,管理者將無法確保每一個儲存節點皆能保持最佳狀態,且當檔案數量大幅增加後,不但會造成硬體資源的浪費也會增加資料中心的控管複雜度,進一步的降低雲端儲存系統的整體效能。 有鑑於此,為了減少重複資料對系統架構所造成的負擔,本論文提出了一使用重複資料刪除以及即時迴授控制的新型資料中心架構:索引名稱伺服器 (Index Name Server, INS),其將整合了重複資料刪除以及節點最佳化等機制來提升整體雲端儲存架構的效能。 藉由INS來進行儲存節點的控管並依照客戶端的傳輸情形作最佳化的動作,INS系統可以控制每個儲存節點保持在最佳狀態下工作,並盡可能地給予客戶端符合其頻寬的節點資源供其進行傳輸的動作,如此一來不但可以有效地提升雲端儲存網路的使用效能且也能夠有效的分配並降低儲存節點的負載。

並列摘要


In a cloud storage environment, file distribution and storage is processed by storage devices providers or physical storage devices rented from the third-party companies. Through centralized management and virtualization, files are integrated into available resources for users to access. Common file storage protocols include ISCSI, Fibre Channel, CIFS and so on, which transmit or store files based on blocks or types. Moreover, because of the wide range and extensive domains of the cloud network, it is very possible that files saved by different users on the same storage device are extremely similar. Also, due to the increasing number of files, the manager cannot guarantee the optimal status of each storage node. The great number of files not only leads to the waste of hardware resources, but also worsens the control complexity of data center, which further degrades the performance of the cloud storage system. For this reason, to decrease the workload caused by duplicated files, this paper proposes a new data management structure: Index Name Server (INS), which integrates data de-duplication with nodes optimization to enhance the performance of the cloud storage system. INS can manage and optimize the nodes according to the client-side transmission conditions. By INS, each node can be controlled to work in the best status and matched to suitable clients as possible. In such a manner, we can efficiently increase the performance of the cloud storage network and distribute the files reasonably to reduce the load of each node.

並列關鍵字

Cloud Storage DHT INS Deduplication

參考文獻


[22] Deke Guo; Jie Wu; Honghui Chen; Ye Yuan; XueshanLuo; “The Dynamic Bloom Filters”, Knowledge and Data Engineering, Volume 22 , Issue 1, 2010, pp. 120-133.
[8] YanmeiHuo; Hongyuan Wang; Liang Hu; Hongji Yang; "A Cloud Storage Architecture Model for Data-Intensive Applications", in Proc.Computer and Management (CAMAN), 2011, pp. 1-4.
[9] Microsoft SMB Protocol and CIFS Protocol Overview, MSDN, http://msdn.microsoft.com/en-us/library/aa365233, June 2011
[11] Costa, L.B.; Ripeanu, M.; “Towards automating the configuration of a distributed storage system”, in Proc.Grid Computing (GRID), 2010, pp. 201-208.
[12] Ohsaki, H.; Watanabe, S.; Imase, M.;“On dynamic resource management mechanism using control theoretic approach for wide-area grid computing”, in Proc. Control Applications, 2005, pp. 891-897.

延伸閱讀