透過您的圖書館登入
IP:3.145.186.6
  • 學位論文

資料倉儲實體化視域選取之研究─以資料方體之建置為例

STUDY OF MATERIALIZED VIEWS SELECTION IN A DATA WAREHOUSE ─ EXAMPLE BY IMPLEMENTING DATA CUBES

指導教授 : 林志麟
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


決策者常以多維度(multidimensional)的方式來檢視資料倉儲(data warehouse)內的彙總資料,這種多維度的資料檢視結構稱之為資料方體(data cubes)。在關聯式資料庫(relational databases)中,我們可以把這些資料方體視為是某些視域(views)的集合。為了有效提升資料倉儲系統的彙總查詢效能,將資料方體內相關的視域予以實體化(materialized)是常用的一種方式。然而一旦視域被實體化,系統管理者對其建置與維護的成本就必須要加以考量。由於受到儲存空間的限制,如何選取適當的實體化視域(materialized views),以有效地提升系統查詢效能(降低總查詢回應時間),同時降低維護視域的成本,便成為資料倉儲系統設計的重點。 本研究調查了目前有關選取實體化視域的相關研究,並設計了一個逆向式貪心演算法,在儲存空間的限制下,解決視域選取的問題。有別於之前演算法的觀點,我們從傷害性(damage)的角度去評估實體化視域,除了能夠提供系統管理者不同的實體化視域選取策略,以滿足管理者不同的需求;對某些資料方體,與之前演算法比較,逆向式貪心演算法會有較佳的選取。最後提出本研究演算法與之前演算法的結合應用,使資料方體的視域選取更為完善與周延。

並列摘要


Decision makers often view aggregate data in a data warehouse via multidimensional data cubes. In relational databases, we refer a data cube as a set of views. In order to improve the query performance against the data cube, the common technique used is to materialize some of the views in the data cube. Once a view is chosen to be materialized, the system manager must consider its implementation and maintenance cost. Because of space limit, it is important to select the right set of views in the data cube to materialize that improve query performance and reduce the maintenance cost. In this thesis, we investigate previous works on the selection of materialized views in a data warehouse, and design a backward greedy algorithm which solve the problem of selecting materialized views in data cubes under space constraint. Unlike previous algorithms, we evaluate each view by calculating its damage to the overall performance. In addition to provide a different selection strategy for system managers to satisfy their need, in some cases backward greedy algorithm provides a better view selection than previous algorithms. We also combine our algorithm with previous algorithms to further improve the results of view selection.

並列關鍵字

data warehouse data cube materialized view

參考文獻


[10] Amit Shukla, Prasad Deshpande, and Jeffrey F. Naughton, “Materialized View Selection for Multidimensional Datasets,” Proceedings of 24rd International Conference on Very Large Data Bases (VLDB 1998), pp. 488-499, August 1998.
[3] Kenneth A. Ross, Divesh Srivastava, and S. Sudarshan, “Materialized View Maintenance and Integrity Constraint Checking: Trading Space for Time,” Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data (SIGMOD''96), pp. 447-458, June 1996.
[6] Himanshu Gupta, Venky Harinarayan, Anand Rajaraman, and Jeffrey D. Ullman, “Index Selection for OLAP,” Proceedings of the 13th International Conference on Data Engineering, pp. 208-219, April 1997.
[7] C. I. Ezeife, “A Uniform Approach for Selecting Views and Indexes in a Data Warehouse,” Proceedings of the International Database Engineering and Applications Symposium (IDEAS 1997), pp. 151-160, August 1997.
[8] Jian Yang, Kamalakar Karlapalem, and Qing Li, “Algorithms for Materialized View Design in Data Warehousing Environment,” Proceedings of 23rd International Conference on Very Large Data Bases (VLDB 1997), pp. 136-145, August 1997.

被引用紀錄


劉文義(2005)。應用基因演算法設計資料倉儲與效能分析〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://doi.org/10.6841/NTUT.2005.00024
Wang, C. (2001). 物件關連式資料倉儲實體設計與綱要分割之研究 [master's thesis, Yuan Ze University]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0009-0112200611305465
林聖斌(2002)。物件關聯式資料倉儲之實體化視域選擇〔碩士論文,元智大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0009-0112200611361746
陳佳揚(2002)。範圍查詢近似總和之動態資料方體〔碩士論文,元智大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0009-0112200611365835
郭光軒(2006)。應用粒子群最佳化演算法設計資料倉儲與效能分析〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0006-1207200613201300

延伸閱讀