基於 RGB 圖像中新物體的 6D 姿態估計

我們提出了一種新方法，能夠從單張RGB圖像準確估計新物體的6自由度（6DoF）姿態。我們的方法巧妙地結合了2D-3D關鍵點的對應和透過渲染比較來優化姿態。具體來說，我們首先使用現有的物體檢測技術檢測輸入圖像中的目標物體，然後通過2D-3D關鍵點匹配來估計初始的6DoF姿態。最後，我們利用3D高斯渲染技術，通過比較渲染圖像與輸入圖像來精細化優化物體的姿態。我們的方法結合了基於點雲模型的2D-3D關鍵點對應和基於3D高斯點的渲染模型，並實現了高效的可微渲染技術。實驗結果顯示，我們的方法在LINEMOD、YCB-V和OnePose-LowTexture等數據集上表現出色，尤其適用於實景和室內場景中的應用。

關鍵字

6D 姿態估計；單張 RGB 影像；可微渲染； 2D-3D 特徵點匹配

並列摘要

We introduce a new method for accurately estimating the 6DoF pose of new objects from a single RGB image. Our approach cleverly integrates 2D-3D keypoint correspondences and utilizes rendering comparisons to optimize the pose. Specifically, we first employ existing object detection techniques to detect the target object in the input image. Next, we estimate the initial 6DoF pose using 2D-3D keypoint matching. Finally, we refine the object's pose using 3D Gaussian rendering techniques by comparing rendered images with the input image. Our method combines 2D-3D keypoint correspondences based on point cloud models and utilizes 3D Gaussian rendering models, implementing efficient differentiable rendering techniques. Experimental results demonstrate the effectiveness of our approach on datasets such as LINEMOD, YCB-V, and OnePose-LowTexture, particularly in real-world and indoor settings.

並列關鍵字

6DoF pose estimation ； Single RGB image ； Differentiable rendering ； 2D-3D keypoint matching

參考文獻

1. Y. An, D. Yang, and M. Song. Hft6d: Multimodal 6d object pose estimation based on hierarchical feature transformer. Measurement, 224:113848, 2024.

Google Scholar

2. J. K. S. B. Bowen Wen, Wei Yang. FoundationPose: Unified 6d pose estimation and tracking of novel objects. CVPR, 2024.

Google Scholar

3. Y. Bukschat and M. Vetter. Efficientpose: An efficient, accurate and scalable end-to-end 6d multi object pose estimation approach. ArXiv, abs/2011.04307, 2020.

Google Scholar

4. M. Cai and I. Reid. Reconstruct locally, localize globally: A model free method for object pose estimation. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 3150–3160, 2020.

Google Scholar

5. P. Castro and T.-K. Kim. Posematcher: One-shot 6d object pose estimation by deep feature matching. ICCVW, 2023.

Google Scholar

主題瀏覽