Title

數位文件之資訊組織與主題分析自動化之技術與應用

Translated Titles

Automatic Information Organization and Subject Analysis for Digital Documents

Authors

曾元顯(Yuen-Hsien Tseng)

Key Words

索引 ; 檢索 ; 索引典 ; 分類 ; 自動化 ; Indexing ; Retrieval ; Thesaurus ; Classification ; Automatic

PublicationName

臺北市立圖書館館訊

Volume or Term/Year and Month of Publication

20卷2期(2002 / 12 / 01)

Page #

29 - 43

Content Language

繁體中文

Chinese Abstract

資訊組織與主題分析是圖書館學的理論與實務中最主要的課題之一,其目的在探討如何分析並組織文件,以提供使用者便捷、有效的資訊服務。資訊組織與主題分析需要高度的知識加工處理,傳統上有賴於訓練有素的圖書館人員進行此項工作。由於資訊科技的持續進步,使得資訊組織與主題分析探討的很多課題有自動化處理的作法。本文便是在介紹筆者發展的一些自動化的方法,並探討其中的觀念、技術、應用、與其未來的影響。文中特別介紹自動化索引、索引典自動建構以及自動分類,並展示其應用範例,期使讀者能具備自動化作法的概念,以便有效運用現有的資訊科技,更有效率的進行資訊組織與主題分析的工作。

English Abstract

Information organization and subject analysis (IOSA) is the main concern in library science and library services. The goal of IOSA is about how to analyze and organize documents to provide effective and efficient information access. IOSA requires human knowledge involvement. Traditionally, only well-trained librarians are qualified for this task. Due to the advance of information technologies, many tasks about IOSA now have the automatic solutions. This article introduces these automatic approaches developed by the author. The concepts, techniques, applications and future impacts are explored. Specifically, this article describes the automatic ways of indexing, thesaurus construction, and text categorization. Application examples are demonstrated to allow readers better understand these approaches so that future IOSA could be achieved in a more efficient way, through the integration of human efforts and automatic methods.

Topic Category 人文學 > 圖書資訊學
Reference
  1. Chan, Lois Mai(1994).Cataloging and Classification: An Introduction.New York:McGraw-Hill.
  2. Gerard Salton(1989).Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer.Addison-Wesley.
  3. Hsinchun Chen,Tak Yim,David Fye,Bruce Schatz(1995).Automatic Thesaurus Generation for an Electronic Community System.Journal of the American Society for Information Science,46(3),175-193.
  4. Hwee Tou Ng,Wei Boon Goh,Kok Leong Low(1997).Feature Selection, Perception Learning, and a Usability Case Study for Text Categorization.Proceedings of the 20th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval
  5. Olson, Hope A.,John J. Boll(2001).Subject analysis in online catalogs.Englewood, Colorado:Libraries Unlimited.
  6. Olson, Hope A.,Kathleen de la Pena McCook (ed.)(1998).Global reach/Local touch.Chicago:American Library Association.
  7. Rila Mandala,Takenobu Tokunaga,Hozumi Tanaka(1999).Combining multiple evidence from different types of thesaurus for query expansion.Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
  8. Web World of Authority Control
  9. Thorsten Joachims(2001).A Statistical Learning Model of Text Classification for Support Vector Machines.Proceedings of the 23rd Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval
  10. Wai Lam, Kwok-Yin Lai(2001).A Meta-Learning Approach for Text Categorization.Proceedings of the 23rd Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval
  11. William W. Cohen,Yoram Singer(1996).Context-Sensitive Learning Methods for Text Categorization.Proceedings of the 19th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval
  12. Yiming Yang,Tom Ault,Thomas Pierce,Charles W. Lattimer(2000).Improving Text Categorization Methods for Event Tracking.Proceedings of the 23rd Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval
  13. Yuen-Hsien Tseng(2001).Automatic Cataloguing and Searching for Retrospective Data by Use of OCR Text.Journal of the American Society for Information Science and Technology,52(5),378-390.
  14. Yuen-Hsien Tseng(2002).Automatic Thesaurus Generation for Chinese Documents.Journal of the American Society for Information Science and Technology,53(13),1130-1138.
  15. Yuen-Hsien Tseng,Douglas W. Oard(2001).Document Image Retrieval Techniques for Chinese.Proceedings of the Fourth Symposium on Document Image Understanding Technology,Columbia Maryland:
  16. 美國資訊科學學會臺北分會編(1994)。索引典理論與實務。台北市:美國資訊科學學會臺北分會。
  17. 胡述兆、吳祖善(1989)。圖書館學導論。漢美圖書有限公司。
  18. 莊雅蓁(1999)。資訊檢索之索引典研究。中國圖書館學會會報,63,77-89。
  19. 曾元顯(2001)。共現索引典之自動建構、評估與應用。台灣大學圖書資訊學系四十週年系慶學術研討會
  20. 曾元顯(2001)。回溯性資料數位化服務之規劃與建置。二十一世紀資訊科學與技術國際學術研討會
  21. 曾元顯(2002)。文件主題自動分類成效因素探討。中國圖書館學會會報,68,62-83。
  22. 曾元顯、林瑜一(1998)。模糊搜尋、相關詞提示與相關詞回饋在OPAC系統中的成效評估。中國圖書館學會會報,61,103-125。
  23. 黃慕萱(1996)。資訊檢索。台北市:臺灣學生。
  24. 蔡明月(1991)。線上資訊檢索-理論與應用。台北市:臺灣學生。
Times Cited
  1. 陳慧倫(2006)。大學音樂系教師對音樂性非書資料館藏檢索需求之研究-以東吳大學為例。臺灣師範大學社會教育學系在職進修碩士班學位論文。2006。1-145。
  2. 谷佳臻(2007)。電腦輔助分析軟體運用於質性研究訪談稿內容分析之探討。臺灣師範大學圖書資訊學研究所學位論文。2007。1-107。
  3. 林佳怡(2011)。人文社會科學引文索引資料庫之系統結構與欄位設計研究。政治大學圖書資訊與檔案學研究所學位論文。2011。1-234。
  4. 趙婉婷(2011)。應用文字探勘分析網路團購商品群集之研究 -以美食類商品為例。政治大學資訊管理研究所學位論文。2011。1-54。