Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Diffusion Features for Zero-Shot 6DoF Object Pose Estimation

Nov 25, 2024

Bernd Von Gimborn, Philipp Ausserlechner, Markus Vincze, Stefan Thalhammer

Figure 1 for Diffusion Features for Zero-Shot 6DoF Object Pose Estimation

Figure 2 for Diffusion Features for Zero-Shot 6DoF Object Pose Estimation

Figure 3 for Diffusion Features for Zero-Shot 6DoF Object Pose Estimation

Figure 4 for Diffusion Features for Zero-Shot 6DoF Object Pose Estimation

Share this with someone who'll enjoy it:

Abstract:Zero-shot object pose estimation enables the retrieval of object poses from images without necessitating object-specific training. In recent approaches this is facilitated by vision foundation models (VFM), which are pre-trained models that are effectively general-purpose feature extractors. The characteristics exhibited by these VFMs vary depending on the training data, network architecture, and training paradigm. The prevailing choice in this field are self-supervised Vision Transformers (ViT). This study assesses the influence of Latent Diffusion Model (LDM) backbones on zero-shot pose estimation. In order to facilitate a comparison between the two families of models on a common ground we adopt and modify a recent approach. Therefore, a template-based multi-staged method for estimating poses in a zero-shot fashion using LDMs is presented. The efficacy of the proposed approach is empirically evaluated on three standard datasets for object-specific 6DoF pose estimation. The experiments demonstrate an Average Recall improvement of up to 27% over the ViT baseline. The source code is available at: https://github.com/BvG1993/DZOP.

View paper on

Share this with someone who'll enjoy it:

Title:Diffusion Features for Zero-Shot 6DoF Object Pose Estimation

Paper and Code