高速公路上之道路牌文字偵測

自動地對影片中的文字做偵測，對於影片的檢索和瞭解是不可或缺的工作。而本論文主要是針對輔助駕駛系統做應用，利用自動對高速公路牌（指示牌、速限牌）上的文字做偵測，提供駕駛者在高速公路上的導航，例如：現在駕駛者所處的位置、方向、現在應該保持的速度。讓駕駛者可以專心地開車，而不會因為為了看道路上的路牌而造成駕駛不當，或者因為超速而造成車禍；且為了響應2008年北京奧運，在未來工作方面，除了對高速公路牌上的文字自動辨識之外，可以進階地將所攝影機擷取到的文字，自動地翻譯成各國語言，使各國來的人士能容易地了解各個路牌所代表的意思。本論文提出一個對擷取道路牌上強而有效的新特徵，使用顏色的資訊來對高速公路上的道路牌做定位，並結合邊的資訊，自動偵測道路牌上的文字。主要的架構可分為三個，第一個步驟，利用顏色和不同物質具有不同傳導係數的特性，配合類神經網路的訓練，將路牌和其他的物質分開；第二個步驟，加入仿射（affine）矯正，將照相機在不適當的拍攝角度，所拍得的變形道路牌復原回來，使道路牌上的文字正對著相機，增加框選文字的正確性；第三個步驟，利用 Canny 邊緣偵測來取得邊的資訊，藉此在每一個道路牌上框出文字的候選區。在實驗的部分，我們希望本論文提出的方法能適用在大多的情況下，所以從多段影片之中擷取出20段影片，其中包括了晴天、多雲和筆直的或彎曲的道路情況。在道路牌定位的部份，檢出率(recall)和精確率(precision)分別為 91.1%及 80.8%，在文字偵測上則是 93.6% 和 88.0%。

關鍵字

道路牌定位；文字偵測；仿射校正

並列摘要

With the advancement of scientific technologies, the cost of digital camera decreases rapidly. It is a trend to improve and uplift the living quality of people using image processing techniques. Automatic detection of text from video is one of the applications which is an essential task for understanding and indexing of video. In this thesis, a driver assistant system is designed by automatic detection of text on road sign (guid signs and limit signs) on highway to provide drivers information for navigation, such as location, direction, and speed limit. It may also alleviate the load of driver who may lose his/her attention looking at road signs while focusing on driving. For 2008 Olympic in Beijing, there will be many foreigner visiting China and not all of them understand Chinese language. Hence, the translation of text on road sign is another goal that can be accomplished. In this thesis, a set of feature is devised to detect road signs. The proposed system consists of three modules. The first module finds the constituting colors of road signs using the color transform model and locates road sign candidates. In the second module, affine transformation is performed to restore road signs which are captured by camera in different positions to let every road sign seems to be vertical to the camera optical axis. Moreover, affine transformation can improve the accuracy in detecting texts embedded in road signs. As to the third module, it performs the task of detecting texts on road signs. The method we adopt is canny edge detector to obtain clearer edge information. Experiments were conducted on a variety of situations. 20 video sequences (sunny*10 and cloudy*10) including light variations and straight or cursive road conditions were tested to verify the validity of the proposed method. The recall and precision rates in locating road sign are 91.1% and 80.8%, respectively. The recall and precision rates in detection text are 93.6% and 88.0%.

並列關鍵字

text detection ； sign location ； affine

參考文獻

[1] N. Otsu,“A threshold selection method from gray level histogram,”IEEE Trans. On Systems, Man, and Cybernetics, SMC-8, pp.62-66, 1978.

[3] K. Jung, K. In Kim and A. K. Jain, “Text information extraction in images and video: a survey,” Pattern Recognition, vol.37 pp.977-997, 2004.

[4] R. Lienhart and A. Wernicke, “Localizing and segmenting text in images, videos and web pages,” IEEE Trans. Circuits Syst. Video Technol., vol.12, no. 4, pp. 256-268, Apr. 2002.

[5] M. R. Lyu, J. Song and M. Cai, “A comprehensive method for multilingual video text detection, localization, and extraction,” IEEE Trans. Circuits Syst. Video Technol., vol.15, no. 2, Feb. 2005.

[6] J. P. Peters, C. Thillou and S. Ferreira, “Embedded reading device for blind people： a User-Centered Design,” in Proceeding of Imagery Pattern Recognition Workshop , 1550-529/04, 2004.

被引用紀錄

王宗任（2009）。交通標誌偵測與辨識〔碩士論文，淡江大學〕。華藝線上圖書館。https://doi.org/10.6846/TKU.2009.00570

李建勳（2010）。應用DSP實現交通限速標誌的偵測與辨識〔碩士論文，國立臺北科技大學〕。華藝線上圖書館。https://doi.org/10.6841/NTUT.2010.00323

朱建興（2009）。運用H.264預測模式作畫面內的錯誤隱藏〔碩士論文，中原大學〕。華藝線上圖書館。https://doi.org/10.6840/CYCU.2009.00864

Yen, C. C. (2009). 利用倒傳遞網路搭配透視轉換不變性之廣義霍夫轉換做路標的偵測與定位 [master's thesis, National Taipei University of Technology]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0006-2108200919193900

國際替代計量

高速公路上之道路牌文字偵測

未授權

主題瀏覽