透過您的圖書館登入
IP:18.221.187.121
  • 期刊

針對重要稀少性資料之一種有效率關聯式探勘方法設計

An Efficient Method for Mining Association Rules on Significant Rare Data

摘要


關聯法則(Association Rules)廣泛應用於資料探勘研究方法,於過往研究中,大都針對支持度(Support)較高之高頻項目集(Frequent ItemSets)進行探勘,然而卻無法迅速且有效探勘出支持度小但卻擁有重要關聯性之重要稀少性資料(Significant Rare Data),亦即所謂之半高頻項目集(Semi-frequent ItemSets)。現今有部份研究針對具備重要關連法則之稀少性資料,進行相關探勘方法設計,其方法大都採用由下而上(Bottom-Up)搜尋方式,但往往無法有效率探勘出最大半高頻項目集(Maximal Semi-frequent ItemSets)。針對上述問題,本研究提出與設計專門針對重要稀少性資料之最大半高頻項目集探勘演算法(Maximum Semi-frequent Itemsets Algorithm, MSIA),MSIA可有效整合分群(Cluster)與分解(Decomposition)探勘概念,並結合篩選法(Filter)與相對支持度(Relative Support)分析方法,採由上而下(Top-Down)之搜尋機制進行高效率最大半高頻項目集探勘。由效能實驗結果可知,MSIA於探勘過程中可以有效降低原始來源資料庫(Source Database)讀取掃描次數,提升探勘效能以節省探勘時所花費之時間成本,進而有效且快速取得重要稀少性資料中之最大半高頻項目集。

並列摘要


Mining out the association rules is the popular research issue in data mining research. In recent years, many studies have focused on discovering the important association rules based on the criteria of maximum support and confidence for frequent itemsets. The significant rare data, i.e., the semi-frequently itemsets, are not easily to mine out the important association rules using traditional mining methods. Some mining methods based on the bottom-up policy can not efficiently mine out association rules from longer length of semi-frequent itemsets. The time complexity of mining process is very high due to the generation of large candidates by repeatedly scanning source database. This research proposed the maximum semi-frequent itemsets algorithm (MSIA), which quickly and efficiently mining out the association rules on the significant rare data. MSIA is a top-down approach by combining the techniques of clustering, decomposition, filtering, and relative supports to efficiently search the source database. From the performance of experiment results, the MSIA can decrease the time complexity of scanning database and thus significantly reduce the number of candidate itemsets. MSIA efficiently mines out the useful association rules from the maximum semi-frequent itemsets.

參考文獻


蔡玉娟、張簡雅文、黃彥文(2003)。快速反向關聯法則與調整緊密規則-促銷商品組合之應用。資訊管理學報。10(1),181-204。
Agrawal, R.,Imilienski, T.,Swami, A.(1993).Mining Association Rules between Sets of Items in Large Databases.ACM SIGMOD International Conference on Management of Data.(ACM SIGMOD International Conference on Management of Data).
Agrawal, R.,Srikant, R.(1994).Fast algorithms for mining association rules.VLDB Conference.(VLDB Conference).
Agrawal, R.,Srikant, R.(1995).Mining Sequential Patterns.(IEEE ICDE).
Ali, K.,Manganaris, S.,Srikant, R.(1997).Partial Classification using Association Rules.3rd International Conference on Knowledge Discovery in Databases and Data Mining.(3rd International Conference on Knowledge Discovery in Databases and Data Mining).:

被引用紀錄


李晏華、黃冠凱、吳信宏(2022)。探討醫院異常事件通報病患發生跌倒事件之分析-以中部某區域教學醫院為例品質學報29(2),99-117。https://doi.org/10.6220/joq.202204_29(2).0001

延伸閱讀