由於視訊媒介及網際網路的蓬勃發展與盛行,促使數位學習時代加速來臨,透過此一無遠弗屆的數位學習環境,讓所有使用者可在任何地點、任何時間經由視訊媒介,很容易的擷取各式學習資訊,但是如何提供一完善的檢索系統,讓使用者能夠很方便地、很有效地檢索教學視訊串列,以獲得所需學習資訊,是建構一完善數位學習環境不容忽視的課題。本論文提出針對複雜背景教學視訊串列擷取文字之新方法,以利教學視訊串列之文字關鍵字檢索。 因為教學投影片的背景複雜且多樣化,甚至與文字特性相近,所以設計針對教學視訊串列之前景切割法,以擷取前景文字區塊,是本論文的研究主題之一。一般而言,教學視訊串列的解析度偏低,所以如何提升文字品質,以利後續文字辨識,也是本論文的另一重要課題。 首先,針對教學視訊串列進行場景分析,切割出每張投影片對應之場景影像,並整合成一張主影像,使後續處理在主影像上進行,以降低計算時間。之後,對於每張主影像擷取影像分塊特徵,並依照其特徵值進行時間串列分析,以建置投影片的背景圖像,進而據此提取前景層。最後,將提取出的前景層進行文字品質的提升及二值化,以利後續文字辨識。文字辨識的正確性是評估本論文的依據,實驗證明本論文所提方法確實可行且有效。
In terms of streaming media and internet are used more and more frequently, the era of e-Learning emerges. In the e-learning system, learners can access lecture videos no matter when and where. Thus, it is imperative to provide an effective method to retrieve lecture videos conveniently and friendly. In the thesis, text extraction for lecture videos with complicated background is proposed to facilitate lecture video retrieval using textual keywords. Since background of lecture videos may be rather complicated and fancy, in particular may have textual characteristics, foreground segmentation method is designed to extract texts region. On the other hand, since the resolution of lecture videos is generally low, how to enhance the quality of texts to facilitate the consequent text recognition is the other issue in this thesis. First, temporal analysis of lecture videos is performed to detect slide transitions. The frames corresponding to those frames between slide transitions are then merged into a key frame to represent the slide. The consequent process can then be applied to the key frame only so as to reduce computing time. Second, local features are extracted from block partition of slide-like key frame, based on which background model are generated followed by foreground extraction. Finally, for each text region extracted from foregrounds, quality improvement and adaptive binarization are employed to facilitate consequent optical character recognition. The recognition accuracy rate is used to evaluate the performance of the proposed method and to compare with existing methods. Various experiments prove that the effectiveness and feasibility of our method.