事實上,很多企業不是沒有知識庫或資料倉儲,而是知識庫太繁雜,以致在需要時無法適當地取得資料;再加上網際網路的興起,網路上龐大的、未經組識與分類的、及高重複性的資料特性使得資料擷取問題更加複雜。透過一般常用的搜尋引擎(如:google)會搜尋到上千筆的資料。對於使用者而言,瀏覧超過數百萬個網頁來尋找相關的資料是一項沉重的負擔,而目前已開發的搜尋系統並無法確切地滿足使用者的需求。資料搜尋,有必要利用資訊技術來尋找相關且高品質的資訊。然而,僅藉由搜尋引擎來尋找知識是不足的,因爲即使目前大部份的搜尋引擎都有提供依相關性排序及本文摘要的功能。通常使用者還是得透過搜尋引擎尋找,數次、瀏覧許多不必要的網頁之後才能找到所需的資料,而非一次就能完成。因此本研究的主要目的,在於介紹如何利用文字探勘來發現蘊藏在大量中文文件中的知識。本文也將深入探討此技術的各項主要元件。透過主題地圖的實證研究,我們將制作兩類的主題地圖,分別是顯性知識(臺灣證券暨期貨法令資料)及隱性知識(王永慶思想哲學)。藉由這兩個地圖的比較來探討顯性知識與隱性知識在主題地圖的呈現上所發現的問題。
Knowledge management (KM) has received much attention from both academics and practitioners in the past few years. Following the KM trend, many organizations have built their own knowledge repositories or data warehouses. However, information or knowledge is still scattered everywhere without being properly managed. The rapid growth of the Internet accelerates the creation of unstructured and unclassified information and causes the explosion of information overload. The effort of browsing information through general-purpose search engines turns out to be tedious and painstaking. Hence, an effective technology to solve this information retrieval problem is much needed. The purpose of this research is to explore the application of text mining technique in organizing knowledge stored in unstructured natural language text documents. Major components of text mining techniques required for topic map in particular will be presented in detail. Two sets of unstructured documents are utilized to demonstrate the usage of SOM for topic categorization. The first set of documents is a collection of speeches given by Y.C. Wang, Chairman of the Taiwan Plastics Group, and the other is the collection of all laws and regulations related to securities and future markets in Taiwan. We also try to apply text mining to these two sets of documents to generate their respective topic maps, thus revealing the differences between organizing explicit and tacit knowledge as well as the difficulties associated with tacit knowledge.
為了持續優化網站功能與使用者體驗,本網站將Cookies分析技術用於網站營運、分析和個人化服務之目的。
若您繼續瀏覽本網站,即表示您同意本網站使用Cookies。