Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm

Aug 03, 2024

Junyan Ye, Jun He, Weijia Li, Zhutao Lv, Jinhua Yu, Haote Yang, Conghui He

Figure 1 for SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm

Figure 2 for SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm

Figure 3 for SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm

Figure 4 for SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm

Share this with someone who'll enjoy it:

Abstract:Street-to-satellite image synthesis focuses on generating realistic satellite images from corresponding ground street-view images while maintaining a consistent content layout, similar to looking down from the sky. The significant differences in perspectives create a substantial domain gap between the views, making this cross-view generation task particularly challenging. In this paper, we introduce SkyDiffusion, a novel cross-view generation method for synthesizing satellite images from street-view images, leveraging diffusion models and Bird's Eye View (BEV) paradigm. First, we design a Curved-BEV method to transform street-view images to the satellite view, reformulating the challenging cross-domain image synthesis task into a conditional generation problem. Curved-BEV also includes a "Multi-to-One" mapping strategy for combining multiple street-view images within the same satellite coverage area, effectively solving the occlusion issues in dense urban scenes. Next, we design a BEV-controlled diffusion model to generate satellite images consistent with the street-view content, which also incorporates a light manipulation module to optimize the lighting condition of the synthesized image using a reference satellite. Experimental results demonstrate that SkyDiffusion outperforms state-of-the-art methods on both suburban (CVUSA & CVACT) and urban (VIGOR-Chicago) cross-view datasets, with an average SSIM increase of 14.5% and a FID reduction of 29.6%, achieving realistic and content-consistent satellite image generation. The code and models of this work will be released at https://opendatalab.github.io/skydiffusion/.

* 12 pages, 8 figures

View paper on

Share this with someone who'll enjoy it:

Title:SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm

Paper and Code