透過您的圖書館登入
IP:3.133.87.185
  • 學位論文

提升物件追蹤中連續框遺漏補償方法

A compensation method to improve the problem of missing continuous frames in object tracking

指導教授 : 張翠蘋

摘要


物件偵測是利用電腦視覺機器學習的技術,在圖像或影片中識別並分類物件。 隨著模型設計愈趨複雜,物件偵測的準確度和速度也逐漸提升。YOLOv4 由於其 較低的設備需求、快速運算速度以及在 MS COCO 測試集中的優異表現,成為許 多應用中物件追蹤的選擇。然而,在連續影片物件偵測與分類的研究領域中,通 常會採用更複雜的模型以提升偵測的準確度。這種方法將連續影片分解成多個圖 片進行偵測,但容易產生物件框無法連續被檢測的問題,且模型無法明確知道物 件的移動路徑。本研究以 YOLOv4 作為前端物件偵測的架構,再利用 DeepSORT 追蹤進行連續圖像中的物件追蹤。同時,我們提出了一種連續框物件補償方法 CFM,用以補償因圖像切割而導致的物件框遺漏,以實現物件的連續追蹤。我們 設計的四組實驗證實 CFM能夠有效地補償追蹤影片中的目標物件的連續物件框, 這四組實驗設計中,我們也分別以效能較佳的 YOLOv7-E6E 及 StrongSORT 替代 YOLO v4 及 DeepSORT 來檢視 CFM 效能,亦透過不同的參數設定來進行 CFM 效 能分析。此外,我們使用 HOTA 與 MOTchallenge 指標來評估連續影片的計算方 法,以驗證我們方法的有效性,並與其他物件追蹤模型進行比較。最終,實驗結 果證明了 CFM 能夠在多遮蔽物的環境中,提高物件追蹤和識別的效能。

並列摘要


Object detection is a computer vision machine learning technique used to recognize and classify objects in images or videos. With increasingly complex model designs, the accuracy and speed of object detection have significantly improved. YOLOv4, due to its lower hardware requirements, fast computation speed, and outstanding performance on the MS COCO test set, has become a preferred choice for object tracking in various applications. However, in the research field of continuous image object detection and classification, more sophisticated models are often adopted to enhance detection accuracy. This approach involves breaking down continuous images into multiple frames for detection, which can lead to issues such as non-continuous detection of object boxes and lack of information about the objects' movement paths.In this study, we employ YOLOv4 as the front-end object detection architecture and integrate it with DeepSORT for continuous object tracking in image sequences. Additionally, we propose a novel method called Compensation Frame Mechanism (CFM) to address the issue of missing object boxes caused by image segmentation, thus achieving continuous object tracking. We conduct four sets of experiments to demonstrate the effectiveness of CFM in compensating for missing object boxes. These experiments also compare CFM's performance with YOLOv7-E6E and StrongSORT, which replace YOLOv4 and DeepSORT, respectively. Furthermore, we perform CFM's performance analysis with different parameter settings. To evaluate the effectiveness of our method, we use HOTA and MOTchallenge metrics to assess the calculation method for continuous images, and we compare our proposed approach with other object tracking models. The experimental results conclusively demonstrate that CFM effectively improves object tracking and identification performance, especially in environments with multiple occlusions.

並列關鍵字

YOLOv4 Object detection DeepSORT Object tracking

參考文獻


[1] T. M. Mitchell, Machine learning. McGraw-hill New York, 2007.
[2] K. Pustu-Iren, J. Sittel, R. Mauer, O. Bulgakowa, and R. Ewerth, "Automated Visual
Content Analysis for Film Studies: Current Status and Challenges," DHQ: Digital
Humanities Quarterly, vol. 14, no. 4, 2020.
[3] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, "Imagenet: A large-scale

延伸閱讀