Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation

Mar 09, 2025

Yang Zou, Zhaoshuai Qi, Yating Liu, Zihao Xu, Weipeng Sun, Weiyi Liu, Xingyuan Li, Jiaqi Yang, Yanning Zhang

Figure 1 for AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation

Figure 2 for AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation

Figure 3 for AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation

Figure 4 for AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation

Share this with someone who'll enjoy it:

Abstract:Object pose estimation, which plays a vital role in robotics, augmented reality, and autonomous driving, has been of great interest in computer vision. Existing studies either require multi-stage pose regression or rely on 2D-3D feature matching. Though these approaches have shown promising results, they rely heavily on appearance information, requiring complex input (i.e., multi-view reference input, depth, or CAD models) and intricate pipeline (i.e., feature extraction-SfM-2D to 3D matching-PnP). We propose AxisPose, a model-free, matching-free, single-shot solution for robust 6D pose estimation, which fundamentally diverges from the existing paradigm. Unlike existing methods that rely on 2D-3D or 2D-2D matching using 3D techniques, such as SfM and PnP, AxisPose directly infers a robust 6D pose from a single view by leveraging a diffusion model to learn the latent axis distribution of objects without reference views. Specifically, AxisPose constructs an Axis Generation Module (AGM) to capture the latent geometric distribution of object axes through a diffusion model. The diffusion process is guided by injecting the gradient of geometric consistency loss into the noise estimation to maintain the geometric consistency of the generated tri-axis. With the generated tri-axis projection, AxisPose further adopts a Triaxial Back-projection Module (TBM) to recover the 6D pose from the object tri-axis. The proposed AxisPose achieves robust performance at the cross-instance level (i.e., one model for N instances) using only a single view as input without reference images, with great potential for generalization to unseen-object level.

View paper on

Share this with someone who'll enjoy it:

Title:AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation

Paper and Code