古地契是研究臺灣歷史上土地開發及社會經濟活動的第一手資料,而同筆土地在不同時間關於土地權利的移轉、典賣、鬮分等行為使地契之間產生如上下手契、鬮分契多份的關係,這些地契之間的關係是利用古地契研究土地發展的重要依據。目前「台灣歷史數位圖書館」(THDL)共蒐集了30285件清代及日治時期的古契書,這些契書由不同的單位所數位化,並分布在34個大小不一的文件集,若想單靠人力來重建地契之間的關係是相當困難的。 因此本研究提出一個自動化的方法來幫助重建古地契之間上下手契、原契與契尾、鬮分契多份、契書內容相同四種關係。首先從契書的詮釋資料與全文中擷取契書特徵,利用已有的契書分類與關係人角色對應方法並加以修正。接著整理出每種契書關係須滿足的特徵條件,再根據所整理出的特徵條件,配合契書特性使用特徵模糊比對,兩兩比對THDL裡所有契書,並經人工檢查,最後找出上下手契2409對、原契與契尾92對、鬮分契多份878組、契書內容相同531組,其中包含許多跨文件集人力不易發現的契書關係。另外,我們也利用「神岡 : 筱雲呂玉慶堂典藏古文書集」所包含已經過人工整理較完整的契書關係來檢視重建方法的回收率。 將這些重建的契書關係都連結起來,可以幫助我們觀察土地發展的脈絡,有助於研究臺灣歷史上從土地關係所衍伸出的經濟社會等相關議題,而這些契書關係也都已加入THDL的檢索系統可供歷史研究者使用。
During the dominant time of the Ching Dynasty and Japan (1683-1945), the development and operation of lands was the main social and economic activity in Taiwan. Consequently, there are a large number of land deeds leaved, which are contracted by local resident in private. These land deeds in that time was the only proof of land ownership and today they become vital material to study the development history of Taiwan. The acquisition, transfer and division of lands over time have brought about the relationships among the land deeds. Using these relationships, we can better make use of the land deeds which are in big quantity. However, these land deeds scattered are collected and digitalized by different organizations into many corpuses. It’s very hard to reconstruct the relationships only depending on the manpower. So in this thesis, we propose an automatic method to reconstruct the relationships. We first extract features such as related person, contracted time, price, …etc, from metadata and full-text of land deeds and unify the category and person role of deeds. Second, we define conditions each relationship should meet based on the features and define fuzzy comparison methods of features. Finally, using the feature conditions, we design an algorithm to efficiently compare each pair of land deeds to find the relationships. As a result, in totally 30285 land deeds, we find “original deed and deed from the previous owner” 2409 pairs, “original deed and its government receipt of tax payment” 92 pairs, “allotment agreements” 878 sets and “same deeds” 531 sets. These relationships reconstructed have been accessible in THDL (Taiwan History Digital Library) to assist historian in Taiwanese land deeds research.