Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation

Mar 21, 2024

Francesco Di Felice, Alberto Remus, Stefano Gasperini, Benjamin Busam, Lionel Ott, Federico Tombari, Roland Siegwart, Carlo Alberto Avizzano

Figure 1 for Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation

Figure 2 for Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation

Figure 3 for Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation

Figure 4 for Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation

Share this with someone who'll enjoy it:

Abstract:Estimating the pose of objects through vision is essential to make robotic platforms interact with the environment. Yet, it presents many challenges, often related to the lack of flexibility and generalizability of state-of-the-art solutions. Diffusion models are a cutting-edge neural architecture transforming 2D and 3D computer vision, outlining remarkable performances in zero-shot novel-view synthesis. Such a use case is particularly intriguing for reconstructing 3D objects. However, localizing objects in unstructured environments is rather unexplored. To this end, this work presents Zero123-6D to demonstrate the utility of Diffusion Model-based novel-view-synthesizers in enhancing RGB 6D pose estimation at category-level by integrating them with feature extraction techniques. The outlined method exploits such a novel view synthesizer to expand a sparse set of RGB-only reference views for the zero-shot 6D pose estimation task. Experiments are quantitatively analyzed on the CO3D dataset, showcasing increased performance over baselines, a substantial reduction in data requirements, and the removal of the necessity of depth information.

* 6 pages, 2 reference pages, 4 figures

View paper on

Share this with someone who'll enjoy it:

Title:Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation

Paper and Code