企業運用網路傳遞資料以節省成本及提高效率,而被傳遞的資料是否能夠正確的被接收方所解析將會是資料交換中最重要的問題。綱要(Schema)用以描述資料的結構及名稱,因此資料交換存在著綱要對映的問題。本研究以XML(eXtensible Markup Language)文件作為資料轉換的格式,為了簡化資料交換的複雜度,首先對綱要元素進行語意配對(Linguistic matching),以通用字及詞庫來幫助使用者對接收到的異質綱要與企業內資料庫綱要名稱進行配對並找出同義的部分。接著運用XSLT(Extensible Stylesheet Language Transformation)進行綱要結構的轉換,依照企業的資料綱要將接收的資料進行結構的轉換並儲存至資料庫。其中本研究考慮了資料交換標準的格式,並且討論綱要元素間的對映關係如何影響資料交換的複雜程度,依此設計與建置XML文件對映及轉換的雛型系統,透過此系統來將資料轉換並儲存至資料庫。藉著將依循標準所訂定的資料或使用自訂格式的資料轉以XML文件表示,並以此系統將XML文件進行對映及轉換,將接收到的資料回存至資料庫,解決異質資料交換中格式不一致的問題。本研究以XML為資料傳遞格式,所提出的轉換架構可應用於XML-enabled的電子化環境,讓外部資訊能藉由綱要配對與轉換來完成資料交換,並保持資料的完整性,以及提高資訊的再利用性。
Data exchange between information systems of different enterprises needs to fulfill requirements of both schema integration and message translation. Since information system with relational database system are developed independently, schema conflicts between databases and exchanged data is a common problem for schema integration. Thus, enterprises need to preserve the data integrity of transacting message with respect to the database of information systems. The core technology of data exchange is matching XML documents to relational database schemas. An interactive tool was designed to simplify the integration and implementation efforts of data exchange between database systems. The matching methods of this interactive tool provided mapping suggestions, and the system relied on users with specific schema knowledge to provide final verification of the matching results. XML schema and XSL transformation were applied to translate message and solve schema conflicts between XML documents and the database system. The interactive tool used repository of synonyms and common words dictionary as references to match XML documents schemas and target database schemas. According to the matching results, XSLT would translate the element names of the XML documents to the database schemas. XSLT transformations were also applied to transform schema structures of the documents to comply with the database schema structures. This tool provided an environment for enterprises to exchange data and it is a prototype of solving semantic conflict problems in data integration.