SMARTANNOTATOR: 互動式室內 RGBD 場景標註系統

在場景認知(Scene Understanding)和影像操作(Image Manipulation)領域，包含高階語意標註的RGBD資料庫是非常有用的。因為我們可以從資料庫萃取出先備知識(Prior Knowledge)。現在由於深度感測器的普及化，RGBD 資料的收集已經變得容易，但是高階語意的標註工作仍是相當繁冗。在本研究中，我們設計了一個互動式的RGBD 資料標注系統SmartAnnotator。該系統可以自動的推測出場景中物件的名稱與幾何抽象表示(Cuboid) 以及物件之間的結構關係。使用者可以由系統產生的建議名稱列表，快速得確認標注。在標注過程中，根據使用者的輸入，系統便會自動修正並改善幾何表示與場景結構。此外隨著越多資料被標註，系統的推測也會越來越準確。本研究設計了四個實驗來分析此系統的效能，包括大量數據的標注效率、與簡易方法(Naive Method) 的比較、對於不同物件分割影響探討、以及系統計算速度分析。實驗結果顯示本系統可以有效改善傳統 RGBD 資料標注的效率，並產生高品質的RGBD 標註資料庫。

關鍵字

電腦視覺；電腦圖學；場景認知；標註

並列摘要

RGBD images with high quality annotations, both in the form of geometric(i.e., segmentation) and structural (i.e., how do the segments mutually relate in 3D) information, provide valuable priors for a diverse range of applications in scene understanding and image manipulation. While it is now simple to acquire RGBD images, annotating them, automatically or manually, remains challenging. We present SmartAnnotator, an interactive system to facilitate annotating raw RGBD images. The system performs the tedious tasks of grouping pixels, creating potential abstracted cuboids, inferring object interactions in 3D, and generates an ordered list of hypotheses. The user simply has to flip through the suggestions for segment labels, finalize a selection, and the system updates the remaining hypotheses. As annotations are finalized, the process becomes simpler with fewer ambiguities to resolve. Moreover, as more scenes are annotated, the system makes better suggestions based on the structural and geometric priors learned from previous annotation sessions. We test the system on a large number of indoor scenes across different users and experimental settings, validate the results on existing benchmark datasets, and report significant improvements over low-level annotation alternatives.

並列關鍵字

Computer Vision ； Computer Graphics ； Scene Understanding ； Annotation

參考文獻

[1] Bryan C. Russell, Antonio Torralba, Kevin P. Murphy, and William T. Freeman.

LabelMe: A database and web-based tool for image annotation. IJCV, 2008.

foreground extraction using iterated graph cuts. ACM Trans. Graph. (Proc.

[3] Ruiqi Guo and Derek Hoiem. Support surface prediction in indoor scenes. IEEE

spaces reconstructed using sfm and object labels. In IEEE ICCV, 2013.

國際替代計量

SMARTANNOTATOR: 互動式室內 RGBD 場景標註系統

主題瀏覽