基於霍夫轉換之複雜名片文字行擷取

由於手持照相機的影像受到光源不均、投影扭曲和震動等外界干擾影響，圖像品質較掃描機所生成的影像為低，加上名片的設計也愈來愈多元，這些都是不利於光學字元辨識（optical character recognition）的因素。本研究目標專注於減少外界因素和名片設計本身的影響，取出名片內的文字部分，分析名片文字行的排列角度並準確切割出文字行。本研究為一名片影像分析之系統設計，藉由文字偵測和文字行的切割，擷取出單行文字影像。包括三大部份：第一部份為前處理，偵測出名片的文字部份；第二部份為名片文字行方向分析，採用Hough transform當基底，修改成針對特定區域檢測的方式，在名片中同時存在垂直或水平兩種排列方式的文字區塊時，偵測出不同區塊的文字行方向；第三部份為文字行建構，使用第二步驟得到的資訊，由下而上(bottom-up) 擷取完整文字行，最後將得到的文字行影像輸出。實驗結果以三種OCR（optical character recognition）軟體為例，辨識率增進程度依序為67.87%增為87.52%，其次為62.91%增為72.84%，最後為28.74%增為77.06%，數據證明本研究擷取文字行的方法有效增加OCR軟體的辨識度。

關鍵字

文字偵測；文字行建構；霍夫轉換

並列摘要

Due to the development of cell phones with cameras, it is convenient to take pictures and capture business card images. Optical character recognition (OCR) is a very mature technique. The key issue is how to improve camera-based document image analysis and extract text information for OCR systems. Our research includes three major parts. The first part would be preprocessing which will detect characters in the business card. The second part would be layout analysis, here we modify Hough transform and apply it to the specified regions to detect text lines angle. The last part would be text line construction. Several text lines will be developed though the bottom up approach. We propose a system designed for Chinese business cards image analysis. By way of detecting characters and separating text lines, we can fetch some semantic consistent text lines. As the experimental results shows, our design can enhance the recognition rate of commercial OCR software when the business cards suffer from complex background, highlight regions or complex design problems.

並列關鍵字

text detection ； text-line construction ； Hough transform

參考文獻

[1] The Stanford Mobile Visual Search Dataset http://web.cs.wpi.edu/~claypool/mmsys-dataset/2011/stanford/

[4] H.-K. Kim, "Efficient automatic text location method and content-based indexing and structuring of video database," Journal of Visual Communication and Image Representation, vol. 7, pp. 336-344, 1996.

[6] Y. M. Y. Hasan and L. J. Karam, "Morphological text extraction from images," Ieee Transactions on Image Processing, vol. 9, pp. 1978-1983, Nov 2000.

[8] J. Canny, "A computational approach to edge detection," Pattern Analysis and Machine Intelligence, IEEE Transactions on, pp. 679-698, 1986.

[9] V. Wu, R. Manmatha, and E. M. Riseman, "Textfinder: An automatic system to detect and recognize text in images," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 21, pp. 1224-1229, 1999.

國際替代計量

基於霍夫轉換之複雜名片文字行擷取

主題瀏覽