Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Jul 04, 2023

Dustin Podell, Zion English, Kyle Lacey, Andreas Blattmann, Tim Dockhorn, Jonas Müller, Joe Penna, Robin Rombach

Figure 1 for SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Figure 2 for SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Figure 3 for SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Figure 4 for SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Share this with someone who'll enjoy it:

Abstract:We present SDXL, a latent diffusion model for text-to-image synthesis. Compared to previous versions of Stable Diffusion, SDXL leverages a three times larger UNet backbone: The increase of model parameters is mainly due to more attention blocks and a larger cross-attention context as SDXL uses a second text encoder. We design multiple novel conditioning schemes and train SDXL on multiple aspect ratios. We also introduce a refinement model which is used to improve the visual fidelity of samples generated by SDXL using a post-hoc image-to-image technique. We demonstrate that SDXL shows drastically improved performance compared the previous versions of Stable Diffusion and achieves results competitive with those of black-box state-of-the-art image generators. In the spirit of promoting open research and fostering transparency in large model training and evaluation, we provide access to code and model weights at https://github.com/Stability-AI/generative-models

View paper on

Share this with someone who'll enjoy it:

Title:SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper and Code