透過您的圖書館登入
IP:3.144.161.116
  • 期刊

結合K-means及階層式分群法之二階段分群演算法

A New Two-Phase Clustering Algorithm Based on K-means and Hierarchical Clustering with Single-Linkage Agglomerative Method

摘要


本文提出一個二階段分群演算法:階層式K-means分群法(HKC, Hierarchical K-means Clustering)。在分割階段,HKC以K-means將資料集合分割成多個群聚。在此增加群聚的數量是爲了降低雜訊及離群值對K-means的影響。在合併階段則採用單一連結聚合演算法來彌補K-means無法探索任意形狀群聚的缺點,並且還能提供樹狀的分群結果。由於K-means將所有要處理的資料減化成數個群聚,所以HKC可以快速的產生樹狀的分群結果。實驗結果顯示,HKC的準確率相當的良好,並且能更有效率地產生樹狀分群結果。

並列摘要


We propose a new clustering algorithm: hierarchical K-means clustering algorithm (HKC), in this paper. HKC consists of two phases. In the first phase, HKC employs K-means clustering algorithm to split the original data into some groups. The purpose of the first phase is to handle the outliers and noises. In the second phase, HKC employs single-linkage agglomerative algorithm, which can discover the arbitrarily shaped clusters and produce a clustering tree, to merge the groups. Since the processed data are simplified to some groups by K-means, the clustering tree could be obtained quickly. In this paper, the accuracy of HKC is evaluated and compared with those of K-means and hierarchical clustering. The experimental results indicated that the accuracy of HKC is better than K-means and hierarchical clustering. Hence HKC could assist the researchers to quickly and accurately analyze data.

被引用紀錄


呂哲嘉(2010)。建構太陽能發電系統之三階段轉換效率預測模型〔碩士論文,國立交通大學〕。華藝線上圖書館。https://doi.org/10.6842/NCTU.2010.00455
許聿慎(2014)。應用混合切割進行分散式資料庫配置〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://doi.org/10.6841/NTUT.2014.00729
蘇俊杰(2012)。運用微群集策略於階層式分群法〔碩士論文,中原大學〕。華藝線上圖書館。https://doi.org/10.6840/cycu201200891
張加憲(2010)。使用N組連結平均法的階層式自動分群〔碩士論文,中原大學〕。華藝線上圖書館。https://doi.org/10.6840/cycu201000705
郭竹晏(2009)。國小學童在數學直覺法則表現之相關與分群探討〔碩士論文,中原大學〕。華藝線上圖書館。https://doi.org/10.6840/cycu200900668

延伸閱讀