透過您的圖書館登入
IP:18.220.160.216
  • 學位論文

改良型關聯規則探勘演算法之效能分析

A Performance Analysis of Improved Association Rule Mining Algorithms

指導教授 : 曾明性

摘要


由於進行多屬性且多紀錄之大型資料庫的資料探勘關聯分析執行的效能不彰,本研究為了有效找到符合之關聯規則,藉由整合Column-wise、Vector-based、Data Cutting and Sorting Method、Bitwise Operation等計算方式來減少整體計算量、加速搜尋高頻項目的集合,並使用目標屬性探勘為主,因此提出五種改良型Apriori演算法,針對兩個密集型及兩個稀疏型資料庫做效能測試,並同時和BitTableFI、Index-BitTableFI、Apriori等三種前人演算法進行效能比較。本論文的案例應用係採用內政部2008年新移民女性資料庫,首先進行了描述性統計和卡方檢定,繼之使用改良關聯規則探勘演算法進行相關參數值設定的不同(資料量、屬性量和支持度)以及記憶體使用量進行效能分析,目的就是為了快速找出人口屬性、社經條件與醫療健康和生活輔助需求之關聯性,最後並對應用案例的關聯規則結果進行相關探討。

並列摘要


Many real world applications of association rule mining from multi-attribute and multi-record of large databases help users make better decisions. However, previous studies have produced big numbers of irrelevant patterns and much time is wasted for finding meaningful rules in large and sparse datasets. To efficiently discover interesting rules that connotes causality between antecedent and consequence in a target pattern, we propose five improved algorithms by integrating the Column-wise, Vector-based, Data Cutting and Sorting Method, and Bitwise Operation in this study. Experimental results show that the proposed algorithms can remove imprecise patterns, can fast discover target rules, and outperforms other three previous algorithms including Apriori, BitTableFI, and Index-BitTableFI, especially for lower supports in mining two intensive and two sparse databases. In addition, the proposed five improved algorithms were applied to find associations between socio-economic characteristics and life care needs for new immigrant women in Taiwan. The results also confirm that our approach is practical and effective with good performance for mining class-association rules in large databases.

參考文獻


2. 吳瑞堯、周駿賢,運用資料探勘技術於六大死因慢性疾病之研究,資訊管理學報, 18(1),87-211,2011年。
9. Agrawal, R., Imieliński, T., and Swami, A., (1993). Mining association rules between sets of items in large databases.
14. Dunkel, Brian; Soparkar, Nandit. Data organization and access for efficient data mining. In: Data Engineering, (1999). Proceedings, 15th International Conference on. IEEE, 1999. p. 522-529.
15. Fayyad, U., Piatetsky-Shapiro, G., and Smyth, P., (1996). From data mining to knowledge discovery in databases. AI magazine, 17(3), 37-54.
16. Feyyad, U., (1996). Data mining and knowledge discovery: Making sense out of data. IEEE expert, 11(5), 20-25.

延伸閱讀