Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning to Render Novel Views from Wide-Baseline Stereo Pairs

Apr 17, 2023

Yilun Du, Cameron Smith, Ayush Tewari, Vincent Sitzmann

Figure 1 for Learning to Render Novel Views from Wide-Baseline Stereo Pairs

Figure 2 for Learning to Render Novel Views from Wide-Baseline Stereo Pairs

Figure 3 for Learning to Render Novel Views from Wide-Baseline Stereo Pairs

Figure 4 for Learning to Render Novel Views from Wide-Baseline Stereo Pairs

Share this with someone who'll enjoy it:

Abstract:We introduce a method for novel view synthesis given only a single wide-baseline stereo image pair. In this challenging regime, 3D scene points are regularly observed only once, requiring prior-based reconstruction of scene geometry and appearance. We find that existing approaches to novel view synthesis from sparse observations fail due to recovering incorrect 3D geometry and due to the high cost of differentiable rendering that precludes their scaling to large-scale training. We take a step towards resolving these shortcomings by formulating a multi-view transformer encoder, proposing an efficient, image-space epipolar line sampling scheme to assemble image features for a target ray, and a lightweight cross-attention-based renderer. Our contributions enable training of our method on a large-scale real-world dataset of indoor and outdoor scenes. We demonstrate that our method learns powerful multi-view geometry priors while reducing the rendering time. We conduct extensive comparisons on held-out test scenes across two real-world datasets, significantly outperforming prior work on novel view synthesis from sparse image observations and achieving multi-view-consistent novel view synthesis.

* CVPR 2023, Project Webpage: https://yilundu.github.io/wide_baseline/, Last Two Authors Equal Advising

View paper on

Share this with someone who'll enjoy it:

Title:Learning to Render Novel Views from Wide-Baseline Stereo Pairs

Paper and Code