  • 學位論文


Design of Computer-Assisted Reading System

指導教授 : 謝景棠




This paper proposes a complete system which can be corrected captured document images into a readable file. Document images captured by camera or scanner often suffer from warping and distortions because of the bounded volumes and complex environment light source. These effects not only reduce the document readability but also the OCR recognition performance. In this paper, we propose a method to combine non-linear and linear compensation for correcting distortions of document images. Before we proceeding text rectification the page segment [19] and the text extraction [10] methods are applied as preprocessing. First, due to the broken text result of Otsu binarization, an image processing method [20] is used to remove the effect of background light. Second, the dewarping method using the cubic polynomial fitting equation is proposed to find out the optimal approximate text line for vertical direction rectification. Third, we use linear compensation for horizontal direction rectification. Finally, according to the word/sentence clicked by user the system will performing text to speech.


Document image Page segment Warping Text extraction


[16] 王蕙君,基於Kinect之即時雙向人流計數系統,私立淡江大學電機工程學系碩士論文,民國一百零一年。
[17] 郭泰谷,無標誌擴增實境之實現-利用Kinect的觸摸人機介面設計,私立淡江大學電機工程學系碩士論文,民國一百零一年。
[8] Z. Zhang and C. L. Tan, “Correcting document image warping based on regression of curved text lines,” in Int. Conference on Document Analysis and Recognition Conf. (ICDAR), 2003, pp. 589-593.
[10] N. Stamatopoulos, B. Gatos, I. Pratikakis, and S. J. Perantonis, “Goal-Oriented Rectification of Camera-Based Document Images,” IEEE Trans. on Image processing, vol. 20, no. 4, Sept. 27, 2010, pp. 910-920.
[11] L. Zhang, Y. Zhang and C. L. Tan, “An improved physically-based method for geometric restoration of distorted document images,” IEEE Trans. on Pattern Anal. Mach. Intell., vol. 30, no. 4, Apr. 4, 2008, pp. 728-734.
