自動化演講錄製系統之虛擬攝影師

本研究的目的是發展一套自動化的演講拍攝系統，利用Kinect分析講者的行為，然後使用PTZ攝影機摸擬真實攝影師拍攝的效果，並且藉由網路傳輸達到攝影師與導播之間的溝通。　　為了得知講者在PTZ攝影機畫面中的位置，本研究使用AdaBoost演算法來偵測講者的臉部區域。獲得該區域後，再利用平均位移（Mean Shift）追蹤演算法對該區域進行追蹤，讓PTZ攝影機可以持續得知講者的臉部位置。此外，使用Kinect的深度影像搭配高斯混合模型來辨識講者的手部姿勢，以及其彩色影像偵測講者是否使用雷射筆或指揮棒。　　為了呈現最理想的畫面給聽眾，我們制定了一套適當的攝影機運鏡規則。在系統綜合所有講者相關的資訊後，PTZ攝影機會根據該運鏡規則自動拍攝畫面，並且傳送訊息告知導播。導播得到訊息後，即可搭配其制定的選鏡規則，決定是否選擇講者PTZ攝影機的畫面，呈現給聽眾。

關鍵字

自動演講拍攝； AdaBoost演算法；平均位移追蹤演算法；高斯混合模型；姿勢辨識

並列摘要

We present a virtual cameraman system, which is a component system of an automated lecture recording system. The proposed virtual cameraman system consists of a Kinect device and a PTZ camera, which correspond to the cameraman and his/her camera, respectively. The Kinect device is further composed of a color camera and a depth sensor. To begin, the PTZ camera locates the speaker using the Adaboost algorithm and then the speaker is tracked by the mean shift algorithm. During tracking, the postures of the speaker are recognized based on the data provided by the depth sensor as well as a set of prebuilt posture models. The movement and posture of the speaker then determine the action of the PTZ camera according to a collection of predefined action rules. The output of the virtual cameraman system is the image sequence acquired by the PTZ camera.

並列關鍵字

automated lecture shooting system ； AdaBoost algorithm ； mean shift tracking algorithm ； Gaussian mixture model ； posture recognition

參考文獻

［Law 01］Lawrence A. Rowe, Diane Harley, Peter Pletcher, and Shannon Lawrence, “BIBS: A Lecture Webcasting System”, Berkeley Multimedia Research Center, 2001.

［Mic 04］Michael Bianchi, “Automatic Video Production of Lectures Using an Intelligent and Aware Environment”, Proceedings of the 3rd International Conference on Mobile and Ubiquitous Multimedia, Pages 117-123, 2004.

［Gre 99］Gregory D. Abowd, “Classroom 2000: An Experiment with the Instrumentation of a Living Educational Environment” IBM Systems Journal, volume 38, issue 4, pages 508-530, 1999.

［Cha 08］Cha Zhang, Yong Rui, Jim Crawford, and Li-Wei He, “An Automated End-to-end Lecture Capture and Broadcasting System”, Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), volume 4, issue 1, pages 2-11, 2008.

［Dav 96］David B. Christianson, Sean E. Anderson, Li-Wei He, David H. Salesin, Daniel S. Weld, and Michael F. Cohen, “Declarative Camera Control for Automatic Cinematography”, Proceedings of the 13th National Conference on Artificial Intelligence, volume 1, pages 148-155, 1996.

國際替代計量

自動化演講錄製系統之虛擬攝影師

主題瀏覽