簡易檢索 / 詳目顯示

研究生: 顏百璋
論文名稱: 於教學影片上進行索引和教學重點探勘之研究
Indexing and Teaching Focus Mining of Lecture Video
指導教授: 李忠謀
學位類別: 碩士
Master
系所名稱: 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2009
畢業學年度: 97
語文別: 中文
論文頁數: 49
中文關鍵詞: 教學影片片段變化偵測遮蔽物體偵測
英文關鍵詞: lecture video, shot change detection, occlusion object detection
論文種類: 學術論文
相關次數: 點閱:132下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本研究提出一個適用於教學影片結構化與分析的方法,來提供使用者有效率地取得教學上的內容與知識,在不使用color-based 或 histogram-based的方法下,本研究提出一個edge-based的片段變化偵測演算法,來建立投影片切換時的結構,且可成功抵禦來自光線的影響,以及當影像中出現人或其他物體遮蔽投影片內容時,也可免於錯誤偵測的發生;當片段偵測階段完畢,本研究更進一步利用影片中講者的手勢以及聲音的資訊,來找尋講者欲強調內容在影像上的位置,分析探勘出教學重點,讓使用者更方便的學習。

    We present a lecture video structuring and analysis scheme to provide students an efficient way to access the lecture content. Instead of using color-based or histogram-based methodologies, we propose a new edge-based shot change detection algorithm to accurately rebuild the slide structure. The proposed approach can successfully resist the unwanted influences induced from the variant illumination condition and occlusions. Furthermore, the teaching focus is analyzed according to the location which the instructor points and the audio information so that this system becomes more useful for learning.

    附表目錄 III 附圖目錄 IV 第一章 緒論 1 1.1研究動機 1 1.2研究目的 1 1.3研究範圍 2 1.3.1 教學影片 2 1.3.2 階層式影片 3 1.3.3 換頁片段影像 4 1.3.4 研究限制 5 1.4論文架構 5 第二章 文獻探討 6 2.1 視訊切割技術探討 6 2.1.1 像素的比較 6 2.1.2 以區塊為基礎的比較 7 2.1.3 直方圖的比較 7 2.1.4 以叢集為基礎的比較 7 2.1.5 以模型導向的比較 8 2.1.6 以特徵為基礎的比較 9 2.1.7 整合及比較多種技術 10 第三章 研究方法 12 3.1 研究目標 12 3.2 系統流程 13 3.3 前處理 14 3.4 遮蔽物偵測 15 3.5 片段變化偵測 19 3.6 教學重點探勘 22 3.6.1 手勢偵測 22 3.6.2 版面結構分析 26 3.6.3 音訊分析 27 3.6.4 事件標記 28 第四章 實驗結果 29 4.1 實驗流程說明 29 4.2 實驗評估方式 29 4.3 實驗影片之環境說明 29 4.4 實驗結果與分析 34 第五章 結論 42 5.1 結論 42 5.2 未來研究 43 參考文獻 44 附錄ㄧ 48

    [1] S. Ammouri, and G. A. Bilodeau, “Face and Hands Detection and Tracking Applied to the Monitoring of Medication Intake,” Canadian Conference on Computer and Robot Vision, pp. 147-154, Canadian, May 2008.

    [2] C. Cotsaces, N. Nikolaidis, and I. Pitas, “Video Shot Detection and Condensed Representation a review,” IEEE Signal Processing Magazine, vol. 23, no. 2, pp. 28-37, Mar. 2006.

    [3] H. Fang, J. Jiang, and Y. Feng, “A Fuzzy Logic Approach for Detection of Video Shot Boundaries,” Pattern Recognition, vol. 39, no. 11, pp. 2092-2100, Nov. 2006.

    [4] A. M. Ferman, A. M. Tekalp, and R. Mehrotra, “Robust Color Histogram Descriptors for Video Segment Retrieval and Identification,” IEEE Trans. On Image Processing, vol. 11, no. 5, pp. 497-508, May 2002.

    [5] C. Fredembach, M. Schroder, and S. Susstrunk, “Eigenregions for Image Classification,” IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 26, no. 12, pp. 1645-1649, Dec. 2004.

    [6] X. Gao, and X. Tang, “Unsupervised Video-Shot Segmentation and Model-Free Anchorperson Detection for News Video Story Parsing,” IEEE Trans. on Circuits and Systems for Video Technology, vol. 12, no. 9, pp.765-776, Sept. 2002.

    [7] U. Gargi, R. Kasturi, and S. H. Strayer, “Performance Characterization of Video-Shot-Change Detection Methods,” IEEE Trans. on Circuits and Systems for Video Technology, vol. 10, no. 1, pp.1-13, Feb. 2000.

    [8] Y. Gong, “An Accurate and Robust Method for Detecting Video Shot Boundaries,” Proceedings of IEEE International Conference on Multimedia Computing and Systems, vol. 1, pp. 850-854, July 1999.

    [9] R. C. Gonzalez, R. E. Woods, “Digital Image Processing,” Prentice-Hall second edition, 2002.

    [10] C. Grana, and R. Cucchiara, “Linear Transition Detection as a Unified Shot Detection Approach,” IEEE Trans. on Circuits and Systems for Video Technology, vol. 17,no. 4, pp. 483-489, Apr. 2007.

    [11] J. Ha, R. M. Haralick, and I. T. Phillips, “Recursive X-Y Cut Using Bounding Boxes of Connected Components,” Proceedings of the Third International Conference on Document Analysis and Recognition, vol. 2, pp. 952-955, Aug. 1995.

    [12] O. Ikeda, “Estimation of Speaking Speed for Faster Face Detection in Video-Footage,” International Conference on Multimedia and Expo, pp. 442-445, July 2005.

    [13] T. Kikukawa, and S. Kawafuchi, “Development of An Automatic Summary Editing System for the Audio Visual Resources,” IEICE Trans., vol. J75-A, no. 2, pp. 204-212, 1992.

    [14] R.A. Kirsch, “Computer Determination of the Constituent Structure of Biological Images,” Computers in Biomedical Research, vol. 4, pp. 315-328, 1971.

    [15] I. Koprinska, and S. Carrato, “Temporal Video Segmentation: A Survey,” Signal Processing: Image Communication, vol. 16, pp. 477-500, Jan. 2001.

    [16] C. M. Li, Y. S. Li, S. H. Wang, and X. Q. Zhang, “Moving Human Body Detection in Video Sequences,” Proceedings of the Sixth International Conference on Machine Learning and Cybernetics, vol. 4, pp. 2188-2192, Aug. 2007.

    [17] L. Liang, Y. Liu, H. Lu, X. Xue, and Y. P. Tan, “Enhanced Shot Boundary Detection Using Video Text Information,” IEEE Trans. on Consumer Electronics, vol. 51, no. 2, pp. 580-588, May 2005.

    [18] H. C. Liu, and G. Zick, “Automatic Determination of Scene Changes in MPEG Compressed Video,” IEEE International Symposium on Circuits and Systems, vol. 1, pp. 764-767, May 1995.

    [19] A. Nagasaka and Y. Tanaka, “Automatic Video Indexing and Full-Video Search for Object Appearances,” Proceeding of IFIP Second Workshop Conf. on Visual Database System II, Budapest, Hunary, pp.113-127, 1992.

    [20] W. Niblack, “An Introduction to Image Processing,” Prentice-Hall, Englewood Cliffs, NJ, pp. 115-116, 1986.

    [21] N. Otsu, “A Threshold Selection Method from Gray-Level Histogram,” IEEE Trans. on Systems, Man, and Cybernetics, vol. 9, no. 1, pp. 62-66, Jan. 1979.

    [22] T. Peng, K. Zhao, and B. Li, “Video Abrupt Transition Detection Based on K-L Transform,” IEEE International Conference on Image and Graphics, pp. 845-848, Aug. 2007.

    [23] M. Piccardi, “Background Subtraction Techniques: a Review,” IEEE International Conference on Systems, Man and Cybernetics, vol. 4, pp. 3099-3104, 2004.

    [24] S. Salvador, and P. Chan, “Toward Accurate Dynamic Time Warping in Linear Time and Space,” Intelligent Data Analysis, vol. 11, pp. 561-580, Oct.2007.

    [25] B. Shahraray, “Scene Change Detection and Content-based Sampling of Video Sequences,” Proceeding of IS&T/SPIE conference on Digital Video Compression:Algorithms and Technologies, vol. 2419, pp. 2-13, 1995.

    [26] K. W. Sze, K. M. Lam, and G. Qiu, “A New Key Frame Representation for Video Segment Retrieval,” IEEE Trans. on Circuits and Systems for Video Technology, vol.15, no. 9, pp. 1148-1155, Sept. 2005.

    [27] K. W. Sze, K. M. Lam, and G. Qiu, “An Optimal Key Frame Representation for Video Shot Retrieval,” Proceedings of IEEE International Symposium on Intelligent Multimedia, Video and Speech Processing, pp. 270-273, Oct. 2004.

    [28] F. Wang , C. W. Ngo ,and T. C. Pong, “Structuring Low-Quality Videotaped Lectures for Cross-Reference Browsing by Video Text Analysis,” Pattern Recognition, vol. 41, no. 10, pp. 3257-3269, Oct. 2008.

    [29] X. Yi, and N. Ling, “Fast Pixel-Based Video Scene Change Detection,” in Proceeding IEEE Int. Symp. on Circuits and Systems, pp. 3443-3446, May 2005.

    [30] J. Yuan, H. Wang, L. Xiao, W. Zheng, J. Li, F. Lin, and B. Zhang, “A Formal Study of Shot Boundary Detection,” IEEE Trans. on Circuits and Systems for Video Technology, vol. 17, no. 2, pp.168-186, Feb. 2007.

    [31] R. Zabith, J. Miler, and K. Mai, “A Feature-based Algorithm for Detecting and Classifying Production Effects,” ACM Journal of Multimedia Systems, vol. 7, no. 2, pp.119-128, 1999.

    [32] H. J. Zhang, A. Kankanhalli, and S. W. Smoliar, “Automatic Partitioning of Full-motion Video,” ACM Journal of Multimedia Systems, vol.1, no. 1, pp. 10-28, 1993.

    [33] Y. Zhuangt, Y. Rui, T. S. Huang, and S. Mehrotra, “Adaptive Key Frame Extraction using Unsupervised Clustering,” Proceeding of IEEE International Conference on Image Processing, vol. 1, pp. 866-870, Oct. 1998.

    [34] 王小川, “語音訊號處理,” 全華科技圖書股份有限公司, 2004

    下載圖示
    QR CODE