透過您的圖書館登入
IP:18.191.189.85
  • 學位論文

基於字詞關係動態建立階層分群

Dynamic Hierarchical Clustering Based on Taxonomy

指導教授 : 林熙禎
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


資訊爆炸時代的來臨,越來越多使用者在網路上搜尋相關資料進行閱讀。本研究目標是將大量文件資料進行階層分群(Hierarchical Clustering),並以字詞關係建置具有上下包含關係的分類學(Taxonomy),以用來成為階層群集的標籤。運用上,能方便使用者快速瞭解文件集有哪些主題,迅速選擇所需主題的文件進行閱讀。本研究提出的系統架構有效地改善了階層群集研究上的五個議題:高維度的向量、動態的特徵選取與文件分群、文件處理順序、文件跨領域分群與群集標籤之間的關係。

並列摘要


With the popularity of Internet, the World Wide Web contains a giant amount of information. To search relevant information from large number of texts becomes a challenge to the users. Hierarchical clustering is one of the methods to conquer this problem. Because its features let users can browse the topic gradually and find out the most relevant documents they have interesting. But there are still have some challenge in hierarchical clustering must be addressed, like high dimensionality of the data, dynamic data sets, the sensitivity of input order, documents has several concept, and the relationship of clusters and tags. In this paper, we propose an approach of dynamic hierarchical clustering based on taxonomy to conquer those challenges. The experimental result shows that our method not only suitable for constructing hierarchical clustering in dynamic data sets, but also offer a easier structure to browse than traditional algorithms, BKM and UPGMA. In addition, the clusters are labeled meaningful tags with the relationship of containment can let users understand the whole concept of clusters rapidly.

參考文獻


5. 潘麒全(民92),可修正的二分群集法,未出版碩士論文,私立中原大學資訊管理研究所。
1. 王千豪(民96),基於近似詞彙樣式匹配與共現關聯度之文件分群,未出版碩士論文,私立大同大學資訊經營學系(所)。
3. 楊雅婷、阮明淑(民95), 「分類相關概念之術語學研究」, 國家圖書館館刊, No. 2, 25-50。
4. 陳志豐(民97),基於高頻項目集結合近似樣式匹配之文件分群,未出版碩士論文,私立大同大學資訊經營學系(所)。
6. Amigo, E., Gonzalo, J., Artiles, J., & Verdejo, F. (2009). A comparison of extrinsic clustering evaluation metrics based on formal constraints. Inf. Retr., 12(4), 461-486.

被引用紀錄


李佩儒(2014)。利用自建Ontological User Profile應用於文字文件推薦〔碩士論文,國立中央大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0031-0412201511590667
江欣鴻(2015)。以自建本體進行使用者興趣偵測與文件推薦〔碩士論文,國立中央大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0031-0412201512073363

延伸閱讀