隨著資訊科技的進步發展,資料庫的儲存資訊與日俱增,研究如何從大型數據資料庫內快速擷取出訊息的資料探勘技術,便成為當今資料分群研究的顯學。 本研究提出一個新的密度式分群演算法,名為QIDBSCAN (Quick IDBSCAN),基於密度式分群演算法架構,改良其擴張程序,以求時間成本之降低。本方法除了與基於先前學者所提出之密度式演算法架構,且在不增加架構的複雜性之下,兼具非線性合併之密度式演算法的優點,使得本論文提出之演算法能確實縮減時間成本。經由實驗結果的驗證,本論文所提出的QIDBSCAN演算法不但可正確的執行資料分群,且確實的減少分群所需花費的時間成本,在分群正確率與雜訊濾除率均可達到先前學者所提出之演算法的水準。經由實驗結果得知,本研究所提出的QIDBSCAN資料分群演算法不但有效率,且可行性極高。
Of the many data clustering algorithms proposed in recent years, the most effective are the density-based clustering algorithms, DBSCAN and IDBSCAN. Although density-based clustering method is effective for identifying graphs, filtering out noise, and obtaining good clustering results, it is extremely time consuming. The IDBSCAN is faster than DBSCAN but is still unsatisfactory. This thesis therefore developed QIDBSCAN (Quick IDBSCAN), a new data clustering algorithm based on IDBSCAN that uses MBOs (Marked Boundary Objects) to expand computing directly without an actual data points selection. The experimental results in this study confirm that QIDBSCAN is substantially faster than IDBSCAN, DBSCAN, and other density-based algorithms.
為了持續優化網站功能與使用者體驗,本網站將Cookies分析技術用於網站營運、分析和個人化服務之目的。
若您繼續瀏覽本網站,即表示您同意本網站使用Cookies。