透過您的圖書館登入
IP:3.133.157.12
  • 期刊

使用N組連結平均法的階層式自動分群

A Novel Algorithm Using N-link Average for Hierarchical Automatic Clustering

摘要


本研究提出以N組連結平均法的階層式自動分群演算法,其具備任意形狀的群聚探索能力,並有效避免鏈結效應的影響而提升分群結果的正確率。與相關文獻比較,對於自動分析群聚數量能更加精確。本研究實驗採用人工合成的二維資料集,分別與分割式分群演算法(k-means and PAM)、階層式分群演算法(Single-link, Complete-link, Group average, and Centroid)與結合k-means 及階層式分群法之二階段分群演算法(HKC)比較,獲得對於任意形狀資料集有更正確的分群結果。另以CHAMELEON的資料集比較文獻在自動分群的正確性,獲得更具精確性的群數判斷。

並列摘要


This study proposed a novel method of using N-link average for hierarchical automatic clustering, which has the ability to explore arbitrary shapes and can improve the accuracy of clustering to avoid chaining effect efficiently. Comparing with relevant literature, this method is more correct for the data of automatic clustering analysis. The experiment uses two-dimensional synthetic data to compare separately with Partitional Clustering Algorithm (k-means and PAM), Hierarchical Clustering Algorithms (Single-link, Complete-link, Group average, and Centroid) and a Two-Phase Clustering Algorithm based on K-means and Hierarchical Clustering with Single-Linkage Agglomerative Method and the results shows the new method we proposed can generate the clustering effect more correct for the data set of arbitrary shapes. Besides, comparison with the accuracy of automatic clustering in other relevant literature, adopting the data set of CHAMELEON can obtain more precise judgment of the number of clusters.

參考文獻


林正芳(2002)。以重力理論為基礎的二階段階層式資料分群演算法特性之研究(碩士論文)。國立台灣大學資訊工程研究所。
陳同孝、陳雨霖、劉明山、許文綬、林志強、邱永興(2006)。結合K-means及階層式分群法之二階段分群演算法。電腦學刊。17(1),65-75。
曾憲雄、蔡秀滿、蘇東興、曾秋蓉、王慶堯(2008)。資料探勘Data mining。台北市:旗標。
Almeida, J. A. S.,Barbosa, L. M. S.,Pais, A. A. C. C.,Formosinho, S. J.(2007).Improving hierarchical cluster analysis: A new method with outlier detection and automatic clustering.Chemometrics and Intelligent Laboratory Systems.87(2),208-217.
Downs, G. M.,Barnard, J. M.(2002).Clustering methods and their uses in computational chemistry.Reviews in computational chemistry.18(Summary),1-40.

被引用紀錄


謝佩鈞(2017)。相似分群方法在風場風機故障檢測的應用研究〔碩士論文,國立臺灣大學〕。華藝線上圖書館。https://doi.org/10.6342/NTU201703624

延伸閱讀