透過您的圖書館登入
IP:13.59.82.167
  • 學位論文

基於剪影細化與動態貝氏網路之立定跳遠評量系統

A Standing Long Jump Evaluation System Based on Silhouette Thinning and Dynamic Bayesian Networks

指導教授 : 許輝煌

摘要


依照體育專家的研究而知,一個人的四肢靈活度與肌力發展,和幼年的協調性訓練有很大的關係,可以藉由觀察立定跳遠的動作窺知一二。但是一個教師要觀看全部小朋友的姿勢是否正確需要花費很多時間,並且可能無法完全客觀的做判斷。因此我們希望發展出一套系統,可以自動地判斷受測者的姿勢哪裡不正確,進而根據不正確的姿勢給予受測者意見,使受測者可以了解自己動作的缺點而不需要專業的老師在側協助。 本篇論文大致可以分為四大部分: 第一部份是將取出影片中的受測者。我們將此階段分為四個細項來處理:第一個項目是根據輸入的圖片序列重新建造背景;第二個項目是取出每個影格中的前景物件(亦即受測者輪廓);第三個項目是去除雜訊;第四個項目是修補去除雜訊後的輪廓,使其破損程度降低。 第二部分是預測骨架。我們採用細化演算法搭配圖學基礎運算,對細化後的初步結果做更進一步的修飾,刪除雜訊,保留骨架。這部份也是分為四個細項:第一個項目是對輸入的物件輪廓做細化,取得初步的骨架;第二個項目將初步的骨架轉換成圖(Graph)儲存;第三個項目是去除迴圈;第四個項目是去除不正確的骨架雜訊。 第三部份是姿勢判斷。透過動態貝氏網路結構的建立與訓練,利用訓練好的網路對前面第二部份得到的骨架做識別,判斷出最可能的姿勢。這部份將分為三個細項:第一個項目是將骨架轉換成特徵向量;第二個項目是送入貝氏網路當中做訓練;第三個項目是使用訓練好的貝氏網路去找出使受測的特徵向量機率為最大的姿勢。 第四部份是動作改正建議。當判斷出來的姿勢中,有不合理或是屬於我們定義的「不正確」姿勢,我們可以將專家對這些錯誤姿勢的修正建議顯示給受測者,讓受測者可以即時知道該如何修正姿勢。這樣一來就如同教練就在身邊一樣,受測者可常常做自我練習而不用擔心練習了很久卻是錯誤的姿勢。 這篇論文主要的貢獻是在於受測者真的可以透過這套系統,知道某幾種姿勢的修正方式。雖然目前可以辨識出的動作數量不是很多,但是相信在增加更多的參考資訊(如:更多的參考點,更準確的參考點計算方式以及更多的區塊)後,必定可以得到更佳的結果。

並列摘要


From the research of sport experts, body controlling and development of a person’s muscle are related to coordinative training when he/she was a child. If there is any pose which is not good enough, it may influence his body controlling attributes in the future. We can observe this problem from the progress of standing long jump. But it takes a lot of time to see if each child’s pose is good enough or not when there are many children. So we want to develop a system to recognize the defined wrong poses while the human is doing standing long jump, and to give him suggestions about the wrong poses. This thesis can be divided into four parts. First of all, extract the silhouette of the user from each frame. There are 4 steps in the first part. Firstly, we have to re-build the background from the input image sequence. Secondly, the foreground object is extracted from each frame by subtracting the background from each frame. Thirdly, to remove noise from the result that was generated in the second step. The last step is to fill the holes of the extracted silhouette. The second part is to find out the skeleton of the extracted silhouette. We use the thinning algorithm and the graph theorem to complete this work. Using the basic graph operations based on the graph theorem to refine the raw skeleton generated by the thinning algorithm to remove noise. There are also four steps of this part. First step is to thin the input silhouette to obtain raw skeleton. Secondly, raw skeleton should be converted into a graph structure, and the structure of the graph is refined by removing the adjacent junction vertices. Then, remove the loops of the graph to make sure that a simple path between any two vertices is the only single path always. Fourthly, the redundant branches are pruned to obtain the final skeleton. The third part is judging the pose of the skeleton. By constructing and training the Dynamic Bayesian Network. We can then use this network to recognize the skeleton, finding out the most possible pose of each frame. There are three steps of this part. First, convert each skeleton into a feature vector. Then, train the Bayesian network with the feature vectors obtained in the previous step. Finally, we test the feature vectors converted from the test data by finding out the maximum probability of each feature vector in the trained network. The last part is to output suggestions for detected poses that are incorrect. If there is any pose recognized in previous part defined as “incorrect”, we will show the experts’ suggestion for this pose. This actually helps users to know how to adjust the pose just like there is a coach standing aside. The contribution of this research is the user actually can improve his poses by using this system. Although, we define only four different “incorrect” poses to give suggestions. In the future, we will add more information (e.g. more reference nodes, more precise equation of finding out the reference nodes, and more partitions) to this system to obtain better results.

參考文獻


[1] C.-L. Huang, H.-C. Shih, C.-Y. Chao, “Semantic Analysis of Soccer Video Using Dynamic Bayesian Network,” 749-760, Volume 8, Number 1, February 2006, IEEE Transactions on Multimedia (TMM), Vol. 8, 2006.
[2] Hui-Huang Hsu, Sheng-Wen Hsieh, Wu-Chou Chen, and Chun-Jung Chen, “Motion Analysis for the Standing Long Jump,” in Proc. the 26th IEEE Int’l Conf. on Distributed Computing Systems Workshops (ICDCS Workshops 2006), Lisboa, Portugal, July 4-7, 2006.
[3] Sangho Park, J.K. Aggarwal, “Recognition of Two-person Interactions Using a Hierarchical Bayesian Network,” IWV
[4] Ibrahim Karliga and Jenq-Neng Hwang, “Analyzing Human Body 3-D Motion of Golf Swing From Single-camera Video Sequenc
[5] Ying Luo, Tzong-Der Wu, and Jenq-Neng Hwang, “Object-based Analysis and Interpretation of Human Motion in Sports Video Sequences by Dynamic Bayesian Networks,” Computer Vision and Image Understanding, vol. 92 , Issue 2-3 , Pages: 196–

延伸閱讀