近年來,由於網際網路的普及與知識的爆發,造成數位資訊的激增,面臨這知識經濟的時代,人們無不尋求最佳資訊科技工具,改善資料蒐集及整理方面的問題,以期在知識爆炸的洪流中,確實掌握資訊,謀資料之共享。在資訊檢索科技發展中,本研究擬以「關鍵頁」代理搜尋引擎進行搜尋工作,其中配合乏晰理論進行文章相似度比對,以平行處理增進檢索速度。以對文章進行標誌,輔助擴充與建置繁體中文、簡體中文與英文同義資訊詞彙庫,解決因編碼方式不同與表達方式不同所造成的隔閡。 所以本研究之代理搜尋引擎,透過繁、簡體中文與英文同義資訊詞彙資料庫的輔助,使用者可跨語言與跨編碼方式對繁、簡體中文與英文資料輸入關鍵頁進行含同義資訊的檢索。關鍵頁代理搜尋引擎功能與Web畫面配合應用,兩者相輔相成,確實發揮個人化資訊服務與資料檢索功能整合。
In recent years, the amount of digitized information grows vastly due to the wide spreading of Internet and Web services. In the era of knowledge economy, it is important to make the best use of information. To achieve this, it is essential to utilize the emerging information technologies to solve the problem of information retrieval, information filtering, and information clustering. Traditional search engines retrieve information based on keywords which usually bring back more information than what users needed. In this study, the focus is on the design and implementation of a keypage-based search engine, which attempts to further process the information retrieved and compare them with the users target contents. The information with the higher similarity will eventually be present to the user. In this study, a parallel keypage-based search engine is designed and implemented. Users may take their desired contents, called keypage, as the input to the system. The multi-lingual search engine supports various encoding which includes English, Traditional Chinese (Big5 encoding), Simplified Chinese (GB encoding). In order to expand the search space and solve some representation problems, thesauruses for each encoding are also constructed inside the engine. Parallel processing techniques are used to improve the search performance. Finally, the keypage-based search engine and Web interface are tightly integrated which makes it a great improvement for personal information service and information retrieval.