透過您的圖書館登入
IP:18.118.120.204
  • 學位論文

基於馬可夫隨機場之表格文件擷取系統

Form Document Extraction System Based on Markov Random Field

指導教授 : 郭天穎

摘要


近年來,因數位化時代的到來,許多傳統報章雜誌及紙本文件等資料,亦逐步地數位化以作呈現和保存。然而,數位影像格式眾多繁雜,且容易因軟硬體設備的汰舊更新,而有無法存取舊影像格式之數位資料的問題,因此便發展出利用轉置技術來解決此一問題。我們先前所提之智慧影像分類系統,便透過分析與分類影像內涵,來達到給予最佳化格式轉置。 在此,本論文提出一基於馬可夫隨機場的表格文件擷取系統,分析表格文件內涵且移除其表格特徵,並結合到我們先前所提之智慧影像分類系統,改進其分類效能。從實作成果中可證明,本論文所提出的表格文件擷取系統,不僅可以有效的移除表格特徵,透過本系統與智慧影像分類系統做結合,也能改進智慧影像分類系統在整體數位影像上的分類正確性。

並列摘要


As the digital era is coming, enormous traditional news papers, magazines, and documents, are being digitized for archiving. However, there are so many kinds of digital image formats that are sensitive to data loss caused by old formats that can not be read when software or hardware is updated. Therefore, a lot of works have been presented to adopt the technique of format migration to solve this problem. And we previously proposed an intelligent image classification system to decide the best format for migration by analyzing and classifying image contents. In this thesis, we propose a form document extraction system based on Markov random field, which analyzes form document contents and removes the form features. We integrate this form document extraction system into our intelligent image classification system to improve the classification performance. Experimental results show that our form document extraction system is valid for extracting the form features and improves the whole image classification correctness when we combine our proposed method with the intelligent image classification system.

參考文獻


[12] 蕭文海,針對數位保存而設計之具彈性且智慧的影像轉置系統,碩士論文,國立臺北科技大學電機所,臺北,2008.
[15] Y. H. Tseng, “Hardwritten Chinese Character Extraction from Document Images,” Journal of Science and Engineering Technology, vol. 1, no. 2, 2005, pp. 13-22.
[2] Andrew K. Pace, “Digital Preservation: Everything New is Old Again,” Computer in Libraries, vol. 20, Issue 2, Feb. 2000, pp. 55-57.
[5] J. Curtis, P. Koerbin, P. Raftos, D. Berriman, and J.Hunter, “AONS-An obsolescence detection and notification service for Web archives and digital repositories,” New Review of Hypermedia and Multimedia, vol. 13, no. 1, Jan. 2007, pp. 39-53.
[6] J. Hunter, and S. Choudhury, “A Semi-Automated Digital Preservation System based on Semantic Web Services,” Proceedings of the Joint ACM/IEEE Conference on Digital Libraries, Tucson, AZ, 7-11, June 2004, pp. 269-278.

被引用紀錄


湯武仁(2014)。利用衛星影像探討高美濕地內雲林莞草生長範圍的變遷研究〔碩士論文,國立交通大學〕。華藝線上圖書館。https://doi.org/10.6842/NCTU.2014.00699
卓宥亦(2013)。利用多尺度區域雜訊不一致性之影像拼接偵測〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0006-1708201317465500

延伸閱讀