Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion

Jul 16, 2023

Shitao Tang, Fuyang Zhang, Jiacheng Chen, Peng Wang, Yasutaka Furukawa

Figure 1 for MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion

Figure 2 for MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion

Figure 3 for MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion

Figure 4 for MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion

Share this with someone who'll enjoy it:

Abstract:This paper introduces MVDiffusion, a simple yet effective multi-view image generation method for scenarios where pixel-to-pixel correspondences are available, such as perspective crops from panorama or multi-view images given geometry (depth maps and poses). Unlike prior models that rely on iterative image warping and inpainting, MVDiffusion concurrently generates all images with a global awareness, encompassing high resolution and rich content, effectively addressing the error accumulation prevalent in preceding models. MVDiffusion specifically incorporates a correspondence-aware attention mechanism, enabling effective cross-view interaction. This mechanism underpins three pivotal modules: 1) a generation module that produces low-resolution images while maintaining global correspondence, 2) an interpolation module that densifies spatial coverage between images, and 3) a super-resolution module that upscales into high-resolution outputs. In terms of panoramic imagery, MVDiffusion can generate high-resolution photorealistic images up to 1024$\times$1024 pixels. For geometry-conditioned multi-view image generation, MVDiffusion demonstrates the first method capable of generating a textured map of a scene mesh. The project page is at https://mvdiffusion.github.io.

* Project page, https://mvdiffusion.github.io

View paper on

Share this with someone who'll enjoy it:

Title:MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion

Paper and Code