Title

建構一個半自動化Semantic Web的RDF/XML產生與展示系統

Translated Titles

Building a semi-automatic Semantic Web based RDF/XML generation and presentation system

Authors

王學治

Key Words

Semantic Web ; RDF ; Ontology ; Dublin Core ; XML ; Pseudo-RDF ; Metadata ; Semantic Web ; RDF ; Ontology ; Dublin Core ; XML ; Pseudo-RDF ; Metadata

PublicationName

元智大學資訊管理學系學位論文

Volume or Term/Year and Month of Publication

2003年

Academic Degree Category

碩士

Advisor

陸承志

Content Language

繁體中文

Chinese Abstract

由於網際網路的普及與社會的變遷,利用網路查找資訊已是非常普遍。不過,現代人在茫茫網海中搜尋的資訊量不但過多,還必須由檢索者親自耗費許多時間過濾搜尋結果;因此,Tim Berner's Lee 提出「Semantic Web」概念,希望將Web內容加上語意結構,以機器可以處理的方式來定義與連結網頁資料,使得資料得以自我解釋,方便被索引與搜尋。 目前建構Semantic Web的方法是使用資源描述框架(RDF,Resource Description Framework)以XML格式來儲存語意資訊,不過RDF所能表達的資料較為原始並且缺乏大量產生RDF/XML的工具;此外,使用者無法利用專家對特定領域的知識與經驗來改善搜尋精準度,使得Semantic Web理想遲遲無法普及。 為解決上述困境,本研究提出一分階段的「擷取、分析、展示」架構,利用專家協助建構的Ontology來半自動產生包含語意的RDF/XML文件,並實作一個系統來驗證本架構的可行性:首先在專家協助下建構目標領域的Ontology,接著由輸入資料中擷取出語意資料,並先以本研究提出的Pseudo RDF格式儲存,然後再轉換成符合標準語法的RDF/XML,最後以網頁介面展示系統建構所得。經實驗成果顯示,本研究架構不僅可行,也可有效加速產生符合精確語意的Semantic Web。

English Abstract

Using search engine to find information is common to all internet users. However, most search engines return too many results than what users want. Users then are required to spend a lot of efforts to filter out what they really want. The Semantic Web concept, proposed by Tim Berner’s Lee, is intented to add semantic description to web contents so that they can be easily indexed and searched by machines. RDF is a tool used to describe Semantic Web contents. Writing RDF on web contents is not only tedious, but also is hard to leverage expert’s experience on special domains. This becomes an obstacle to wide adoption of Semantic Web. A three staged “extraction, analysis and presentation” framework is proposed in this research to remedy the problem stated above. This framework is intended to semi-automatically generate RDF/XML for web contents with ontology of a special domain. We build a prototype system to validate this framework. First, we build ontology of Cihu poetry published at Sung Dynasty with help from domain experts. Then we extract semantic data from some sample poetry into Pseudo-RDF form. Next, we transform Pseudo-RDF into standard-compatible RDF/XML and provide a web interface to demonstrate the results in graphic form. Our results show that generator can be easily to re-generate Semantic RDF/XML once either the domain ontology or web contents are changed.

Topic Category 資訊學院 > 資訊管理學系
社會科學 > 管理學
Reference
  1. 〔1〕 Google Search Engine , http://www.google.com
    連結:
  2. 〔2〕 Open Directory Project , http://www.dmoz.org
    連結:
  3. 〔3〕 Sergey Brin, Lawrence Page , ”The Anatomy of a Large-Scale Hypertextual Web Search Engine” , Computer Networks and ISDN Systems , volume 30 no 1-7 , pp. 107-117 , 1998
    連結:
  4. 〔6〕 Sean B. Palmer , ”The Semantic Web: An Introduction” , http://infomesh.net/2001/swintro/
    連結:
  5. 〔10〕 Semantic Web Definition , http://www.w3.org/2001/sw/
    連結:
  6. 〔14〕 IsaViz:A Visual Authoring Tool for RDF, http://www.w3.org/2001/11/IsaViz/
    連結:
  7. 〔15〕 Hendler, J., “Agents and the Semantic Web. IEEE Intelligent Systems Journal Special Issue on the Semantic Web” , 16( 2) , pp. 30-37 , 2001.
    連結:
  8. 〔18〕 L. Stojanovic, N. Stojanovic, R. Volz , “Migrating data-intensive Web Sites into the Semantic Web” , ACM Symposium on Applied Computing SAC 2002 , Madrid Spain , 10 March 2002
    連結:
  9. 〔20〕 Resource Description Framework , http://www.w3.org/RDF
    連結:
  10. 〔24〕 Michael Denny ,”Ontology tool survey” , http://xml.coverpages.org/Denny-OntologyEditorSurveyTable20021111.html
    連結:
  11. 〔26〕 L. Stojanovic, N. Stojanovic, R. Volz , “Migrating data-intensive Web Sites into the Semantic Web” , ACM Symposium on Applied Computing SAC 2002 , Madrid Spain , 10 March 2002 , pp4
    連結:
  12. 〔30〕 Tim Bray, Jean Paoli, C. M. Sperberg-McQueen, Eve Maler , “Extensible Markup Language (XML) 1.0 (Second Edition) “ , 6 October 2000 , http://www.w3.org/TR/REC-xml
    連結:
  13. 〔32〕 吳政叡,「從都柏林核心集看未來資料描述格式的發展趨勢」,圖書館學刊 26 期,(民 86 年 6 月),頁 11-18,網址http://dimes.lins.fju.edu.tw/pub/fju-lins-26/dublin1.htm
    連結:
  14. 〔47〕 Scalable Vector Graphics (SVG) , http://www.w3.org/Graphics/SVG/
    連結:
  15. 〔48〕 PNG (Portable Network Graphics) , http://www.w3.org/Graphics/PNG/
    連結:
  16. 〔4〕 AskJeeve , http://www.askjeeve.com/
  17. 〔5〕 How is the ODP different from other directories such as Yahoo! or LookSmart? , http://dmoz.org/help/geninfo.html#search
  18. 〔7〕 The Semantic Web in breadth , http://logicerror.com/semanticWeb-long
  19. 〔8〕 Tim Berners-Lee, James Hendler and Ora Lassila , “The semantic Web“, Scientific American issue 284(5) , pp. 34-43 , May 2001 , also available at http://www.sciam.com/article.cfm?articleID=00048144-10D2-1C70-84A9809EC588EF21
  20. 〔9〕 Ying Ding, Dieter Fensel, Michel Klein and Borys Omelayenko , “The semantic web: yet another hip?” , Data & Knowledge Engineering , Volume 41 , Issues 2-3 , June 2002 , also available at http://www.cs.vu.nl/~mcaklein/papers/DKE41.pdf
  21. 〔11〕 Dublin Core Metadata Element Set Reference Description (Version 1.1) , http://www.dublincore.org/documents/dces/
  22. 〔12〕 Engels, R.H.P., B.A. Bremdal, and R. Jones , “CORPORUM: a workbench for the semantic Web” , Workshop on Semantic Web Mining , 12th European Conference on Machine Learning , Freiburg Germany , pp. 1-10 , Sept 2001
  23. 〔13〕 Engels, R.H.P., B.A. Bremdal, and R. Jones , “CORPORUM: a workbench for the semantic Web” , Workshop on Semantic Web Mining , 12th European Conference on Machine Learning , Freiburg Germany , pp. 2 , Sept 2001
  24. 〔16〕 Peter Murray-Rusta and Henry S. Rzepa , ”Towards the Chemical Semantic Web. An introduction to RSS“ , http://www.ch.ic.ac.uk/rzepa/rss/
  25. 〔17〕 A. Maedche, M. Ehrig, S. Handschuh, R. Volz, L. Stojanovic , “Ontology-Focused Crawling of Documents and Relational Metadata” , Proceedings of the Eleventh International World Wide Web Conference WWW-2002 , Hawaii , 30 May 2002
  26. 〔19〕 Engels, R.H.P., B.A. Bremdal, and R. Jones , “CORPORUM: a workbench for the semantic Web” , Workshop on Semantic Web Mining , 12th European Conference on Machine Learning , Freiburg Germany , pp. 3 , Sept 2001
  27. 〔21〕 Horrocks, I. etc al., “The Ontology Interchange Language, OIL” , Technical Report , Free Univ. of Amsterdam , 2000 , also available at http://www.ontoknowledge.org/oil/.
  28. 〔22〕 Grand B. Le, and M. Soto, “XML Topic Maps and Semantic Mining” , Workshop on Semantic Web Mining , 12th European Conference on Machine Learning , Freiburg Germany , pp.67-83 , Sept 2001
  29. 〔23〕 John R. Punin and M. Krishnamoorthy , “Describing Structure and Semantics of Graphs Using an RDF Vocabulary” , http://www.cs.rpi.edu/~puninj/RGML/EXTREME/TALK/rgml/all.html
  30. 〔25〕 José Kahan, Marja-Riitta Koivunen, Eric Prud'Hommeaux, and Ralph R. Swick , ”Annotea: An Open RDF Infrastructure for Shared Web Annotations” , in Proc. of the WWW10 International Conference , Hong Kong , May 2001 , ACM 1-58113-348-0/01/0005. http://www10.org/cdrom/papers/488/index.html
  31. 〔27〕 E. Pietriga , “IsaViz: a Visual Environment for Browsing and Authoring RDF Models” , WWW 2002, the 11th World Wide Web Conference (Developer's day) , Honolulu Hawaii USA , 7-11 May 2002
  32. 〔28〕 Weibel, Stuart, Jean Godby, Eric Miller and Ron Daniel. , “OCLC/NCSA Metadata Workshop Report”, available at http://www.oasis-open.org/cover/metadata.html
  33. 〔29〕 王新民,數位典藏技術彙編 第一冊,數位典藏國家型科技計畫,第三章5節 頁9,臺北市,2002年版
  34. 〔31〕 吳政叡,「都柏林核心集的發展現況與其在圖書館的應用」,網際網路與圖書館發展研討會論文集(台北市 : 中國圖書館學會,民 88 年 12 月 4 日)頁 113-135 , http://dimes.lins.fju.edu.tw/pub/dc-lib1/dc-lib.htm
  35. 〔33〕 Ora Lassila,Ralph R.Swick , ”Resource Description Framework(RDF) Model and Syntax Specification” , http://www.w3.org/TR/REC-rdf-syntax/ , 22 February 1999
  36. 〔34〕 Ora Lassila,Ralph R.Swick , ”Resource Description Framework(RDF) Model and Syntax Specification” , http://www.w3.org/TR/REC-rdf-syntax/ , 22 February 1999, section 1 , paragraph 4
  37. 〔35〕 RDF Schema , http://www.w3.org/TR/rdf-schema/
  38. 〔36〕 DAML , http://www.daml.org
  39. 〔37〕 RDF/XML , http://www.w3.org/TR/REC-rdf-syntax/
  40. 〔38〕 Notation 3 , http://www.w3.org/2000/10/swap/Primer
  41. 〔39〕 N-Triple , http://www.w3.org/2001/sw/RDFCore/ntriples/
  42. 〔40〕 Kalyanpur Aditya , Jennifer Golbeck , Michael Grove ,Jim Hendler, “An RDF Editor and Portal for the Semantic Web” , Position Papers of the Semantic Authoring , Annotation & Knowledge Markup Workshop at ECAI 2002, Lyon, France , 22-26 July 2002
  43. 〔41〕 XML SPY , http://www.xmlspy.com/
  44. 〔42〕 Sergey Melnik , ”Storing RDF in an relational Database” , December 2001 , http://www-db.stanford.edu/~melnik/rdf/db.html
  45. 〔43〕 Stefan Kokkelink, Roland Schwa"nzl , “Expressing Qualified Dublin Core in RDF/XML” , DCMI proposed recommendation , 14 April 2002 , http://dublincore.org/documents/2002/04/14/dcq-rdf-xml/
  46. 〔44〕 JENA Semantic Web toolkit , http://www.hpl.hp.com/semweb/index.html
  47. 〔45〕 RDFSuite , http://athena.ics.forth.gr:9090/RDF/
  48. 〔46〕 Emmanuel Pietriga , http://www.w3.org/People/Emmanuel/
  49. 〔49〕 RDF Validation Service , http://www.w3.org/RDF/Validator/
  50. 〔50〕 Owl Converter , http://www.mindswap.org/2002/owl.html