網際網路搜尋過濾系統  [一個「關鍵頁」超搜尋智慧型代理引擎]

網際網路上的電子文件數量極為龐大，如何快速有效的進行網際網路資訊檢索以獲得多且高度相關的網頁，已是一項重要的研究課題。而目前網路上的資訊檢索系統皆是以「關鍵字/詞」來進行檢索，所回應獲得之網頁位址是數以千計，其數量之驚人已達到「資訊過量」之負擔，以致需耗費使用者大量的時間來逐一檢視網頁，但是，因為單憑「關鍵字/詞」所獲得的網頁位址，其中大部份的網頁和使用者所需求的資訊之相關度極低，甚至是毫無相關-「檢索失敗」，尤以前者「資訊過量」問題比後者之「檢索失敗」更為嚴重。以一個傳統的網路資源檢索系統，所獲得的資源資訊可知是如何的繁多，單憑使用者本能的過濾功能，縱使耗費巨量的時間，亦是不足以應付如此巨量的資訊。因此本研究計劃由使用者提供一篇「關鍵頁 (文件或網頁位址)」，整合乏晰理論、資訊檢索、資訊過濾、超搜尋、智慧型代理引擎、平行處理、三層式架構等相關技術及理論，建構一個「關鍵頁」超搜尋智慧型代理引擎，以協助使用者來獲得與其資訊需求相關度高的網頁。實際測試的結果顯示，本研究雛形系統之資訊過濾功能具有相當的成效。

關鍵字

關鍵頁；資訊過濾；超搜尋；智慧型代理引擎；樣幟比對；平行處理

並列摘要

It has become a critical issue to effectively retrieve useful information from the Internet as the electronic documents available online has grown drastically fast during the last few years. Most of the currently available information retrieval systems on the Internet are designed to search desired information using keywords. The number of resulting documents provided by these systems are usually more than what is needed since most of them are not highly related, or even irrelevant to the user''''s needs. It creates the so-called "information overload" or "search failure" problems. Due to many of the powerful search engines available, it is more likely to encounter the problem of "information overload". In this study, a keypage-based intelligent meta-search agent is proposed which helps users to easily locate webpages that are more likely fit the users'''' queries. The user may simply provide a web address or an electronic document as the "keypage", the proposed system will then try to locate the candidate webpages and then provide a matching degree for all the candidate pages. Some preliminary results have shown that this system can greatly help users in finding their desired information in a more effective fashion. The proposed system integrates techniques such as Fuzzy theory, SimNet, Parallel Processing and Three-Tier architecture.

並列關鍵字

Keypage ； Information Filtering ； Meta-Search ； Intelligent Agent ； Pattern Matching ； Parallel Processing

參考文獻

[7] Lee-Feng Chien and Hsiao-Tieh Pu, "Important issues on Chinese Information Retrieval," Computational Linguistics and Chinese Language Processing 1 (Aug. 1996): 205-221.

[14] Lee-Feng Chien, et al., "尋易"(Csmart)-A High-performance Chinese Document Retrieval System. Proceedings of the 1995 Int. Conf. On Computer Processing of Oriental Languages, Hawaii, USA, Nov. 1995.

[1] Edmund F. Santa Vicca, "The Internet as a Reference and Research Tool: a Model for Educators", The Reference Librarian 41/42(1994):p228.

[4] Belkin N.j. and Croft W.B., "Information Filtering and Information Retrieval: Two Side of the Same Coin" , Communication of ACM, December, 1992

[15] Lee-Feng Chien, "PAT-Tree Based Keyword Extraction for Chinese Information Retrieval" ACM SIGIR(1997)

被引用紀錄

陳秋蓮（2006）。以無線網路為基礎之定位研究及其應用〔碩士論文，元智大學〕。華藝線上圖書館。https://doi.org/10.6838/YZU.2006.00205

蔡明志（2000）。神經網路應用於字元的不變性辨識〔碩士論文，元智大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0009-0112200611362863

謝超煒（2000）。網際網路資訊擷取過濾系統─中文關鍵頁超搜尋代理人〔碩士論文，元智大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0009-0112200611310685

邱顯正（2000）。網際網路上產品資訊擷取代理人雛形之設計與建置〔碩士論文，元智大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0009-0112200611313388

張如瑩（2001）。多語系平行關鍵頁搜尋引擎之設計與建構〔碩士論文，元智大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0009-0112200611302285

國際替代計量

網際網路搜尋過濾系統 [一個「關鍵頁」超搜尋智慧型代理引擎]

主題瀏覽