透過您的圖書館登入
IP:216.73.216.200
  • 學位論文

基於 RGB 圖像中新物體的 6D 姿態估計

6D Pose Estimation of Novel Objects Based on RGB Images

指導教授 : 陳祝嵩
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


我們提出了一種新方法,能夠從單張RGB圖像準確估計新物體的6自由度(6DoF)姿態。我們的方法巧妙地結合了2D-3D關鍵點的對應和透過渲染比較來優化姿態。具體來說,我們首先使用現有的物體檢測技術檢測輸入圖像中的目標物體,然後通過2D-3D關鍵點匹配來估計初始的6DoF姿態。最後,我們利用3D高斯渲染技術,通過比較渲染圖像與輸入圖像來精細化優化物體的姿態。我們的方法結合了基於點雲模型的2D-3D關鍵點對應和基於3D高斯點的渲染模型,並實現了高效的可微渲染技術。實驗結果顯示,我們的方法在LINEMOD、YCB-V和OnePose-LowTexture等數據集上表現出色,尤其適用於實景和室內場景中的應用。

並列摘要


We introduce a new method for accurately estimating the 6DoF pose of new objects from a single RGB image. Our approach cleverly integrates 2D-3D keypoint correspondences and utilizes rendering comparisons to optimize the pose. Specifically, we first employ existing object detection techniques to detect the target object in the input image. Next, we estimate the initial 6DoF pose using 2D-3D keypoint matching. Finally, we refine the object's pose using 3D Gaussian rendering techniques by comparing rendered images with the input image. Our method combines 2D-3D keypoint correspondences based on point cloud models and utilizes 3D Gaussian rendering models, implementing efficient differentiable rendering techniques. Experimental results demonstrate the effectiveness of our approach on datasets such as LINEMOD, YCB-V, and OnePose-LowTexture, particularly in real-world and indoor settings.

參考文獻


1. Y. An, D. Yang, and M. Song. Hft6d: Multimodal 6d object pose estimation based on hierarchical feature transformer. Measurement, 224:113848, 2024.
2. J. K. S. B. Bowen Wen, Wei Yang. FoundationPose: Unified 6d pose estimation and tracking of novel objects. CVPR, 2024.
3. Y. Bukschat and M. Vetter. Efficientpose: An efficient, accurate and scalable end-to-end 6d multi object pose estimation approach. ArXiv, abs/2011.04307, 2020.
4. M. Cai and I. Reid. Reconstruct locally, localize globally: A model free method for object pose estimation. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 3150–3160, 2020.
5. P. Castro and T.-K. Kim. Posematcher: One-shot 6d object pose estimation by deep feature matching. ICCVW, 2023.

延伸閱讀