透過您的圖書館登入
IP:18.218.127.141
  • 學位論文

一個基於多物體辨識器及追蹤器的自駕車半自動影片標註工具

A semi-Automatic Video Labeling Tool for Autonomous Driving Based on Multi-Object Detector and Tracker

指導教授 : 金仲達

摘要


近年來,由於用於識別和追蹤道路上物體的深度學習技術獲得巨大進步,自動駕駛汽車也得到了巨大的發展。為了將深度學習技術運用 於系統上,在此之前,通常需要大量的已標註的影片來訓練神經元網絡 模型。然而,標註影片的過程往往非常耗時且乏味,目前主要依賴於人 工。自動化標註影片這個過程實際上是雞與蛋的問題:我們需要一個完 美的物體偵測和追蹤工具來標註影片,以便訓練完美的物體偵測和追 蹤演算法。一個可行的替代方案是使用尚未完美的工具標註影片,然後 手動更正結果。在此論文中,我們介紹了這種用於自動駕駛的半自動視 頻標記工具。我們的工具基於開源影片標註系統 VATIC。首先使用多對 象偵測器和追蹤器來註釋視頻。識別對象標記中的可能錯誤,然後將其 呈現給人類以產生正確的標註。在標註測試影片實驗中,結果顯示我們 的工具可以更快地完成影片標註任務,同時保持相同的標註質量。這套 半自動的影片標註工具是從開源工具 VATIC 修改而來的,可以從 Github 獲得。

並列摘要


In recent years, the development of autonomous cars has gained great momentum due to the vast advances in deep learning technique for recognizing and tracking objects on the roads. To apply the deep learning technique, a large set of properly annotated videos are normally needed to train the neuron network model. However, the process of annotating videos is very time-consuming and tedious, and currently it relies mainly on human. Automating this process is actually a chicken-and-egg problem: we need a perfect object detection and tracking tool to annotate the videos so as to train a perfect object detection and tracking algorithm. A viable alternative is to annotate the videos using a less perfect tool and then correct the results manually. In this paper, we introduce such a semi-automatic video labeling tool for autonomous driving. Our tool is based on the open-source video annotation system VATIC. A multi-object detector and tracker is first used to annotate the video. Possible errors in the labeling of the objects are identified and then presented to human annotators to produce correct annotations. Experiments on labeling test videos show that our tool can complete the annotation task faster, while maintaining the same quality as a human annotator. The proposed tool is modified from the open-source tool VATIC and is available from Github.

參考文獻


[1] VATIC autolabel features, 2018. https://github.com/billy0059/VATIC_ AutoLabelFeaturs.
[2] A. Ambardekar, M. Nicolescu, and S. Dascalu. Ground truth verification tool (gtvt) for video surveillance systems. pages 354–359, Feb 2009.
[3] Keni Bernardin and Rainer Stiefelhagen. Evaluating multiple object tracking performance: The clear mot metrics. EURASIP Journal on Image and Video Processing, 2008(1):246–309, May 2008.
[4] F. Comaschi, S. Stuijk, T. Basten, and H. Corporaal. A tool for fast ground truth generation for object detection and tracking from video. pages 368–372, Oct 2014.
[5] Andreas Geiger, Philip Lenz, Christoph Stiller, and Raquel Urtasun. Vision meets robotics: The kitti dataset. International Journal of Robotics Research (IJRR), 2013.

延伸閱讀