透過您的圖書館登入
IP:3.142.174.55
  • 學位論文

應用增長層級式自我組織映射圖於相似專利文件搜尋系統之研究

Use of GHSOM for recommending patent documents

指導教授 : 陳大正
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


美國專利文件為目前世界上涵蓋範圍最廣、類別數量最多的專利制度,但也因為其類別數量眾多且類別間又具有高度階層性質,導致相似專利文件搜尋不易與一般分類演算法無法呈現出專利文件類別所原有的階層關係,為了解決此一問題,本研究應用以增長層級式自我組織映射網路為基礎的分群架構搭配擷取SOM神經元權重值形成階層式文件分類器的方式,建構一相似專利文件搜尋系統,透過對使用者文件的歸類,縮小相似文件搜尋範圍並進行相似文件搜尋,提供相似專利文件予使用者參考。

並列摘要


The patent system of United States has the most extensive coverage and the largest number of categories in the world. Due to the large number of patent categories with highly hierarchy, it is hard to search similar patent document for a specified patent. Also, the hierarchical relationships among these patent documents are hardly expressed by using the general classification algorithms. In order to solve the difficulties, the GHSOM has been trained as a classifier to construct the patent documents suggestion system.

並列關鍵字

Patent Documents Classification GHSOM

參考文獻


[1] D. Sullivan, " Document Warehousing and Text Mining ", Wiley Computer Publishing, 2001.
[2] G. Salton, A. Wong and C.S. Yang, " A Vector Space Model for Information Retrieval ", Journal of the American Society for Information Science, vol. 18(11), pp. 613-620, 1975.
[3] G. Salton, " Automatic Text Processing: the Transformation, Analysis, and Retrieval of Information by Computer ", Reading, MA: Addison-Wesley, 1989.
[4] S. Chakrabarti, B. Dora, R. Agrawal and P. Raghavan, " Using taxonomy, discriminants, and signatures for navigating in text databases ", Proceedings of the 23rd international Conference on Very Large Data Bases (VLDB), pp. 446-455, 1997.
[5] S. Chakrabarti, B. Dora, R. Agrawal and P. Raghavan, " Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies ", Proceedings of the 23rd international Conference on Very Large Data Bases (VLDB), pp. 163-178, 1998.

延伸閱讀