Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Cameras as Rays: Pose Estimation via Ray Diffusion

Feb 22, 2024

Jason Y. Zhang, Amy Lin, Moneish Kumar, Tzu-Hsuan Yang, Deva Ramanan, Shubham Tulsiani

Figure 1 for Cameras as Rays: Pose Estimation via Ray Diffusion

Figure 2 for Cameras as Rays: Pose Estimation via Ray Diffusion

Figure 3 for Cameras as Rays: Pose Estimation via Ray Diffusion

Figure 4 for Cameras as Rays: Pose Estimation via Ray Diffusion

Share this with someone who'll enjoy it:

Abstract:Estimating camera poses is a fundamental task for 3D reconstruction and remains challenging given sparse views (<10). In contrast to existing approaches that pursue top-down prediction of global parametrizations of camera extrinsics, we propose a distributed representation of camera pose that treats a camera as a bundle of rays. This representation allows for a tight coupling with spatial image features improving pose precision. We observe that this representation is naturally suited for set-level level transformers and develop a regression-based approach that maps image patches to corresponding rays. To capture the inherent uncertainties in sparse-view pose inference, we adapt this approach to learn a denoising diffusion model which allows us to sample plausible modes while improving performance. Our proposed methods, both regression- and diffusion-based, demonstrate state-of-the-art performance on camera pose estimation on CO3D while generalizing to unseen object categories and in-the-wild captures.

* To appear in ICLR 2024 (oral). Project webpage: https://jasonyzhang.com/RayDiffusion

View paper on

Share this with someone who'll enjoy it:

Title:Cameras as Rays: Pose Estimation via Ray Diffusion

Paper and Code