資訊的組織與擷取

綱際綱路的發展使得資訊檢索的研究進入更具挑戰性的環境，然而資訊檢索系統通常僅僅告訴使用者有哪些相關的文件，而非真正提供使用者所需要的資訊。資訊擷取的研究則是進一步分析文件，依據預先定義的樣版取出特定的資訊。參照於國畫館以機讀編目格式描述藏品，資訊擷取系統所稱的樣版與機讀編目格式都可視為一種無資料格式，亦即是用於描述資料的資料。本文說明元資料與資訊擷取的關係，並討論如何藉由自然語言處理的語言分析技術有效協助使用者擷取所需要的資訊。

關鍵字

資訊檢索；資訊擷取；元資料

並列摘要

The development of Internet makes the researches on information retrieval more changeable. Actually, the so-called ”information retrieval” is ”text retrieval.” It is necessary for users to find out the needed information from the retrieved texts. A higher-level task is information extraction, which extracts the information based on pre-defined templates. From the viewpoint of Library Science, these pre-defined templates are the metadata, which describes the collection of libraries in common. This paper discusses the relationships between metadata and information extraction and how natural language processing helps the task of information extraction.

並列關鍵字

Information Retrieval ； Information Extraction ； Metadata

被引用紀錄

黃婉筑（2007）。無線電視台發展數位影音內容加值服務規劃之研究〔碩士論文，元智大學〕。華藝線上圖書館。https://doi.org/10.6838/YZU.2007.00346

沈嘉慧（2008）。應用文字挖掘技術以整合文件與GIS—以陽明山國家公園研究報告為例〔碩士論文，國立臺灣大學〕。華藝線上圖書館。https://doi.org/10.6342/NTU.2008.01546

劉千里（2011）。網頁探勘技術應用於論壇用戶文章-以mobile01電影版為例〔碩士論文，國立臺北大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0023-2206201123480600

國際替代計量

資訊的組織與擷取

全文下載

主題瀏覽