透過您的圖書館登入
IP:3.136.154.103

摘要


資料探勘技術應用於萃取大型資料庫中有用之資訊,以輔助決策之參考。關聯法則是資料探勘技術中最被廣為研究與使用的方法,其在發掘高頻項目集之瓶頸為需多次掃描資料庫並逐層產生大量需比對的候選項目集。本研究提出一個新的矩陣為基礎之關聯法則MBAR(Matrix-Based Association Rule),以克服上述缺點。MBAR之執行步驟為:(1)發掘L1並建立矩陣TMatrix-只需掃描原始資料庫一次以發掘L1,並建立以L1之項目為列(row)與交易記錄編號TID為欄(column)之矩陣TMatrix;(2)發掘高頻項目集L2-將矩陣TMatrix之列兩兩進行AND運算以直接發掘L2,無需產生C2,並建立L2之布林上三角矩陣SMatrix;(3)產生所有長度≧3之候選項目集-利用L2之布林上三角矩陣SMatrix特性配合項目集間遞移性,產生所有長度≧3之候選項目集;(4)發掘長度≧3之高頻項目集-利用TMatrix之列間的AND運算發掘所有長度≧3高頻項目集。實驗結果顯示,MBAR之執行效率與穩定性優於FP-Growth關聯法則。

關鍵字

資料探勘 關聯法則 矩陣 遞移性

並列摘要


Algorithms of association rules in techniques of data mining are used to find associations of products or bundles. Association algorithms proposed in the present studies are hedged about with starting at level 1. It costs a lot of time to find suitable bundles via steps of reiterating combination and calculation. In order to overcome the bottlenecks of association rule, we proposed a new Matrix-Based Association Rule (MBAR), finding out large 2-itemsets by intersecting each two rows of the matrix which was made up of large 1-items and transaction numbers without scanning the database, and using transitivity and the property of the upper triangular matrix which was made up of the pairs of large 2-itemsets to discover all potential candidate k-itemsets, where k≧3 Finally, finding out large k-itemsets, where k≧3, by intersecting each two rows of the upper triangular matrix. Experiments with real-life databases show that MBAR outperforms FP-Growth, a well-known and widely used association rule.

並列關鍵字

Data Mining Association Rules Matrix Transitivity

參考文獻


蔡玉娟、張簡雅文、黃彥文(2003)。快速反向關聯法則與調整緊密規則-促銷商品組合之應用。資訊管理學報。10(1),181-204。
Agarwal, R. C.,Aggarwal, C. C.,Prasad, V. V. V.(1999).A Tree Projection Algorithm For Generation of Frequent Itemsets.(In Proceedings of High Performance Data Mining Workshop, Puerto Rico).
Agrawal, R.,Imielinski, T.,Swami, A.(1993).Database Mining: A Performance Perspective.IEEE Transactions on Knowledge and Data Engineering.5,914-925.
Agrawal, R.,Imielinski, T.,Swami, A.(1993).Mining Association Rules between Sets of Items in Large Databases.(Proceedings of ACM SIGMOD Conference on Management of Data).
Agrawal, R.,Srikant, R.(1994).Fast Algorithms for Mining Association Rules.(Proceedings of the 20th International Conf. on Very Large Databases).

被引用紀錄


林永翔(2010)。資料切割排序法在關聯規則搜尋之應用-以台電事故維修系統為例〔碩士論文,國立屏東科技大學〕。華藝線上圖書館。https://doi.org/10.6346/NPUST.2010.00066
鄭又銘(2011)。資料分割排序法在代謝症候群與生活型態關聯性法則搜尋之應用〔碩士論文,國立屏東科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0042-2202201313562690

延伸閱讀


國際替代計量