透過您的圖書館登入
IP:3.137.171.121
  • 學位論文

應用階層式概念關聯索引典於Web目錄整合之研究

Learning to Integrate Web Catalogs with Conceptual Relationships in a Hierarchical Thesaurus

指導教授 : 楊正仁
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


目錄整合在Web資源使用以及電子商務領域中已成為重要的研究議題。過去的研究曾經指出,如果能夠利用來源目錄所隱含的目錄資訊以及採用階層式結構的整合方法,目錄整合的精確性將更為改善。 然而,在過往目錄整合的研究中,雖然有利用了來源目錄的階層式結構資訊,但在大多數仍採用攤平式的目錄整合方法,缺乏對階層式目錄整合的討論。因此本論文針對階層式Web目錄整合機制進行探討, 提出加強式目錄整合方法,運用階層式目錄的層次關係,擷取出概念關聯索引典應用於目錄整合工作,並結合目前在文件分類上具有良好表現的支援向量機進行實作。實驗結果顯示加強式目錄整合方法能有效的提昇攤平式目錄整合工作和階層式目錄整合工作的整合效能, 而且加強式目錄整合方法應用在階層式目錄整合工作上比用在攤平式目錄整合工作上有更明顯的效能提昇。

並列摘要


Web catalog integration has been addressed as an important issue in current digital content management. Past studies have shown that exploiting a flattened structure with auxiliary information extracted from the source catalog can improve the integration results. Although earlier studies have also shown that exploiting a hierarchical structure in classification may bring better advantages, the effectiveness has not been studied in catalog integration. In this thesis, we propose an enhanced catalog integration (ECI) approach extracting the conceptual relationships from the hierarchical Web thesaurus to improve the accuracy performance. We have conducted experiments with real-world catalogs in a flatten format and in a hierarchical format. The results show that the ECI scheme effectively boosts the integration accuracy of both the flattened scheme and the hierarchical scheme with the Support Vector Machine (SVM) classifiers.

參考文獻


[4] J. K. Dongkyu Kim and S.-G. Lee,. Reducing Multiclass to Binary by Coupling Probability Estimates,. in Proceedings of the 12th International Workshop on Research Issues in Data Engineering: Engineering E-Commerce/E-Business Systems (RIDE'02), February 2002, pp. 28.33.
[8] T. Joachims,. SVMlight - An Implementation of Support Vector Machines (SVMs) in C,. http://svmlight.joachims.org/.
[10] M. F. Porter,. The Porter Stemming Algorithm,. http://www.sigir.org/resources.html.
[11] G. Salton and C. Buckley,. Term Weighting Approaches in Automatic Text Retrieval. Pergamon Press, Inc., 1998, vol. 4.
[16] C.-W. Wu, T.-H. Tsai, and W.-L. Hsu,. Learning to Integrate Web Taxonomies with Fine-Grained Relations: A Case Study Using Maximum Entropy Model,. in Proceedings of the Asia Information Retrieval Symposium 2005 (AIRS2005), October 2005, pp. 190.205.

延伸閱讀