透過您的圖書館登入
IP:18.118.227.69
  • 學位論文

基於借閱目的之資料清理機制研究 -以興趣目的為例

A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose

指導教授 : 謝建成
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


研究人員經常使用現實社會中的資料進行研究分析,但這些資料通常存在些許問題,如此將可能降低資料分析的效率,甚至產生錯誤的結果。圖書館經常藉由分析讀者的歷史借閱紀錄作為提供各項服務之依據,但過去在分析前並未考量讀者的借閱目的進行清理。歷史借閱紀錄大多包含一個以上的借閱目的,若在分析前未依借閱目的進行清理,極可能產生錯誤的結果。 本研究透過考量讀者借閱目的中的興趣目的,設計啟發式清理機制,嘗試去除讀者歷史借閱紀錄中的非興趣紀錄,並透過F-Measure評估清理結果,歸納出合適的清理方法與屬性。此外,本研究透過調整各清理機制的參數,嘗試進行個人化清理,以瞭解個人化清理的步驟與流程。 由研究結果可知,讀者的歷史借閱紀錄無法輕易地依據興趣借閱目的進行清理,但可嘗試透過群集分析的E-M演算法,並使用「第三層分類號、借閱日、作者」屬性組合來進行清理。在個人化清理方面,透過調整參數可獲得更佳的清理結果。此外,若使用F-Measure評估清理結果,讀者的原始興趣比越高,其清理難度也越高。

關鍵字

資料清理 書目探勘 F-Measure

並列摘要


Researchers often use statistics from previous events to serve as a basis for analysis, but the acquired data usually has its problems, which in turn may reduce the efficiency of the researcher’s analysis or even create erroneous results. Libraries often analyze the patron’s borrowing history in order to adjust and improve its services, but often does not consider the patron’s purpose behind borrowing his or her information from the library. Most patrons have several reasons behind their borrowings, and it is may create erroneous results if we don’t clean it before analyzing. In this paper we analyze the effectiveness of a heuristic data-cleaning approach to remove the areas of non-interest in the patron’s historical loan record. Meanwhile, we also use F-Measure analysis to evaluate the results in order to suggest suitable cleaning methods. In addition, personal cleaning processes for patrons is implemented by adjusting the parameters of the clean-up mechanisms. From the study results, the patron’s borrowing history cannot be easily cleaned based on interest purposes, but you can attempt to clean the data by the E-M algorithm using cluster analysis, and use the properties of third tier classification: number, loan date, and author. Using personal cleaning, it is concluded that adjustments in the parameters could produce more satisfying results. In addition, if use F-Measure, more interesting parts in the patron’s borrowing history, the cleaning process will be more difficult.

並列關鍵字

Data cleaning Bibliomining F-Measure

參考文獻


卜小蝶(2002)。使用者導向之圖書分類關聯分析研究。圖書資訊學刊,17,81-94。
陳垂呈(2008)。利用關聯規則發掘讀者適性化之書籍推薦。圖書與資訊學刊, 65,58-60。
柯皓仁、楊雅雯、吳安琪、戴玉旻、楊維邦 (2002)。個人化及群體化圖書館資訊服務初探。國家圖書館館刊,1,161-195。
謝建成、林湧順(2006)。書目探勘讀者使用圖書館之行為。教育資料與圖書館學,44(1),35-60。
陳垂呈(2005)。利用資料探勘技術發掘圖書館個人化之書籍推薦。教育資料與圖書館學,43(1),87-107。

被引用紀錄


李威毅(2012)。書目探勘資料之清理研究-以問卷資料為例〔碩士論文,國立臺灣師範大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0021-1610201315290782

延伸閱讀