透過您的圖書館登入
IP:18.216.124.8
  • 學位論文

MapReduce架構下循序樣式探勘演算法之效能分析

Performance Comparison of Sequential Pattern Mining Algorithms Based on Mapreduce Framework

指導教授 : 陳世穎
共同指導教授 : 陳弘明(Hung-Ming Chen)

摘要


由於雲端科技的普及和巨量資料的累積,如何更有效率地縮短時間處理分析大量資料,成為一個重要的研究方向,尤其是大量資料分析所使用的資料探勘技術有很多種,其中也包含了關聯式規則探勘演算法與循序樣式探勘演算法。本研究的目的是透過MapReduce架構平行化設計並分析兩種不同循序樣式探勘演算法之間效能分析,包括循序樣本探勘演算法中的AprioriAll演算法與GSP演算法,以平行運算有效處理大型資料庫並進行兩者的效能分析。實驗結果顯示,GSP平行化演算法較AprioriAll平行化演算法,有較佳的效能。

並列摘要


Because that the popularity of cloud technology and the accumulation of large amounts of data, it is very important direction of research to reduce time for processing large amounts of data efficiently. Besides, there are many kinds of data mining technique which are used in analyzing of huge amounts of data, which contains the association rule mining algorithms and sequential pattern mining algorithms. In this study, two sequential pattern mining algorithms, GSP algorithm and AprioriAll algorithm, are parallelized through the MapReduce framework. Also, we design and study the different efficiency between the two kinds of sequential pattern mining algorithms, and analyze the different efficiency between GSP algorithm and AprioriAll algorithm. The results show that the parallelized GSP algorithm is better than the parallelized AprioriAll algorithm.

參考文獻


[1] A. Abraham, “Artificial neural networks,” handbook of measuring system design, 2005.
[3] R. Agrawal, T. Imielinski, and A. Swami, “Mining association rules between sets of items in large database,” IN ACM SIGMOD Record, Vol. 22, No. 2, pp.207-216, June., 1993.
[6] R. Agrawal and R. Srikant, “Mining sequential patterns: Generalizations and performance improvements,” In 5th Intl. Conf. Extending Database Technology, pp.3-17, March., 1996.
[9] E. Y. Chang, H. Li, Y. Wang, and M. Zhang, “Pfp: parallel fp-growth for query recommendation,” in the ACM Conference Series on Recommender Systems, pp.107-114, 2008.
[12] C. M., Fonseca, & P. J. Fleming, “Genetic Algorithms for Multiobjective Optimization: FormulationDiscussion and Generalization,” ICGA. Vol. 93, 1993.

被引用紀錄


蘇育群(2015)。應用平行關聯演算法於中式速食連鎖餐廳之套餐設計〔碩士論文,淡江大學〕。華藝線上圖書館。https://doi.org/10.6846/TKU.2015.00934
李卓勳(2014)。基於Hadoop叢集之具關聯式規則探勘雲端系統設計與效能之研究〔碩士論文,國立臺中科技大學〕。華藝線上圖書館。https://doi.org/10.6826/NUTC.2014.00114

延伸閱讀