透過您的圖書館登入
IP:3.139.239.118
  • 期刊

一個以Ontology為基礎的Web-Mining技術應用於供應鏈競爭分析之研究

Research on an Ontology-based Web-Mining Technique for Supply-Chain Competitive Analysis

摘要


本研究的目的是開發一個以ontology為主的產業供應鏈知識探勘平台,並提出一種利用文件探勘(text mining)與XML文件技術整合的方法來提供企業在供應鏈競爭分析上的應用;為測試所開發的文件探勘演算法,本研究在文件探勘的對象是與產業活動相關的新聞語料庫為主,並以RosettaNet作為系統的重要內容來源,以進行系統模型架構的實現及個案研究。根據這個探勘演算法,應用網路探勘(Web Mining)的技術從大量半結構性(semi-structured)以及非結構性(unstructured)的網頁文件中萃取出文件內容中相關的概念與知識,以產生用來描述文件的資訊(i.e. metadata)與階層式的知識分類目錄架構,再將原有網頁轉化為XML文件存於這個分類演算法的On-tology架構為基礎的XML文件庫中。本研究將文件探勘的技術應用於分類目錄的自動建立與維護,並開發一個能達到自動知識分類目的之XML文件資料庫系統。本研究主要採用的研究是以Web content mining的方法為主,亦即以文件探勘(Text mining)技術針對存在於WWW網頁中的文件资訊內容加以分析處理,並運用類神經網路機器學習的技術來實現。

並列摘要


In this research we propose a novel approach to develop a platform for discovering supply-chain competitive analysis on an ontology-based web-mining technique. Also, by integrating a text mining approach with a XML document technique, in the developed platform we provide a way to allow businesses tackle difficulties in knowledge management for the supply-chain related information. To testify the developed web-mining algorithm, in this research a corpus associated with industrial information collected from specific news web sites (e.g. CNA News), with the RosettaNet standard framework, is employed as the major information source for conducting system implementation and case study. By applying the developed web-mining algorithm, in this work we attempt to extract concepts and knowledge from a huge semi-structured and unstructured HTML-document collections. The extracted concepts and knowledge can then be used to produce metadata and ontology to describe the contents in the original web documents. As such, the original web documents can be transformed into XML documents and stored in the XML document database based upon the ontology based ”knowledge template”. The research applies a text-mining approach to automating the construction and maintenance of a concept-hierarchy, in order to establish a XML document database based on the extracted metadata and ontology. The approach for knowledge extraction in this research is mainly using a Web-content mining method. That is, the existing WWW pages can be analyzed to generate a set of metadata to describe their content and produce an ontology for the XML document database through a text-mining technique, incorporated with a neural-net machine learning method for implementation.

參考文獻


李俊宏、吳志鴻()。
李俊宏、吳志鴻()。
Benjamins, V. R.(1999).(KA)2: building ontologies for the internet: a mid-term report.International Journal of Human-Computer Studies.51(3),687-721.
Braga, R. M. M.,C. M. L. Werner,M. Mattoso(2000).Proceedings of the 11th International Workshop on Database and Expert Systems Applications.London. UK:
Chakrabarti, S.,B. E. Dom,S. R. Kumar,P. Raghavan,S. Rajagopalan,A. Tomkins,D. Gibson,J. Kleinberg(1999).Mining the Web`s link structure.IEEE Computer.32(8),60-67.

被引用紀錄


張榮祥(2015)。學術研討會資訊推薦系統建置〔碩士論文,國立交通大學〕。華藝線上圖書館。https://doi.org/10.6842/NCTU.2015.00633

延伸閱讀