透過您的圖書館登入
IP:3.142.198.129
  • 學位論文

利用主題地圖建構知識礦掘網站之研究

Study on the Application of Topic Maps in Knowledge Mining

指導教授 : 丁肇隆 蔡進發

摘要


現今的網際網路是一個超大型的資料庫,任何人只要能夠利用電腦連上網路就可以在這個大型資料庫中取得所需的資訊或知識。而形成網際網路蓬勃發展的重要因素是Tim Berners-Lee於1990年所提出的HTML標籤語言。然而,經過十幾年的演變,要在網際網路上找到有價值的資訊或知識卻越來越困難,主要的原因在於網路上的資源絕大多數都沒有適當的內容描述方式。最明顯的例子就是HTML,其標籤內容缺乏嚴謹的資料描述方式,使得目前的搜尋引擎大多僅能透過關鍵字比對的方式,搜尋可能相關的網頁;使用者經常得面對上萬篇的搜尋結果,但只能找到極少有用的資訊或知識。 語意網(Semantic Web)是下一代網際網路主要發展的方向,目標是希望網路上的所有資源透過適當的內容描述方式,使得搜尋引擎或使用者代理程式(User Agent)能夠提高搜尋結果的精確度,但這種遠景尚無法在短時間內完成。因此是否能夠透過目前現有的搜尋引擎、資料探勘技術和語意描述方式,讓使用者能夠容易找到有用且有組織的資訊或知識,是本論文研究的方向。 本研究以主題地圖(Topic Maps)與延伸式標籤語言(Extensible Markup Language, XML)為基礎所制訂的主題地圖標籤語言(XML Topic Maps, XTM)為組織知識架構的方法,以及相關的規範與軟體技術為學習與應用的對象,並以WWW為知識探勘的資料來源建構一個知識礦掘(Knowledge Mining)之樣版架構。

並列摘要


Internet is the biggest, unstructured database in the world today. It's easy for everyone to get information or knowledge from Internet. The Hyper-Text Markup Language (HTML) created by Tim Berners-Lee makes this situation come true. Everyone can publish their web pages on Internet, but no well-structured content description language can be used. That makes it harder and harder to get useful information or knowledge from Internet. The obvious example is HTML. Tags have less content description mechanism. That's why modern search engine such as Google or Yahoo! can only use keyword matching to find out lots of web pages, but useful web pages is very few. The Semantic Web is next generation technology to solve this problem. It's focus on content description. Every shared resource should be given semantic description such that search engine or user agent can "understand" what the resource is and improve the precision of search results, but it's not an easy job. For getting more useful information or knowledge, it’s a possible way to combine recommended content description language we have now, data-mining technologies to find information or knowledge, and structure a more semantic result for user. In this paper, XTM (XML Topic Maps) is a content description language to describe found data or information. Some specifications, mining technologies and software are learned also. We combine these to create a knowledge mining template to mine the useful, well-organized information or knowledge from Internet.

參考文獻


[5] "ebXML Technical Architecture Specification V.1.0.4", OASIS and UN/CEFACT, 14 May 2001.
[6] "Universal Description Discovery and Integration of e-business for Webservice Specification 2.0", UDDI.org, 7 June 2001.
[10] "Wireless Application Protocol Wireless Markup Language Spec. Version 1.2", WAP Forum, WAP WML Version, 4 Nov 1999.
[11] "Voice Extensible Markup Language (VoiceXML), Version 2.0", W3C Recommendation, 16 March 2004.
[15] Schmidt and Ingrid, "A World to Discover: A Topic Map for Thomas Mann", XMLEurope 2001, Berlin, Germany.

被引用紀錄


侯偉富(2007)。應用Topic Map及Google Search API 建構個人化九年一貫教學資源之研究〔碩士論文,國立臺灣師範大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0021-2910200810551496

延伸閱讀