Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation

Jun 03, 2024

Mingyuan Zhou, Zhendong Wang, Huangjie Zheng, Hai Huang

Figure 1 for Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation

Figure 2 for Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation

Figure 3 for Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation

Figure 4 for Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation

Share this with someone who'll enjoy it:

Abstract:Diffusion-based text-to-image generation models trained on extensive text-image pairs have shown the capacity to generate photorealistic images consistent with textual descriptions. However, a significant limitation of these models is their slow sample generation, which requires iterative refinement through the same network. In this paper, we enhance Score identity Distillation (SiD) by developing long and short classifier-free guidance (LSG) to efficiently distill pretrained Stable Diffusion models without using real training data. SiD aims to optimize a model-based explicit score matching loss, utilizing a score-identity-based approximation alongside the proposed LSG for practical computation. By training exclusively with fake images synthesized with its one-step generator, SiD equipped with LSG rapidly improves FID and CLIP scores, achieving state-of-the-art FID performance while maintaining a competitive CLIP score. Specifically, its data-free distillation of Stable Diffusion 1.5 achieves a record low FID of 8.15 on the COCO-2014 validation set, with a CLIP score of 0.304 at an LSG scale of 1.5, and a FID of 9.56 with a CLIP score of 0.313 at an LSG scale of 2. We will make our PyTorch implementation and distilled Stable Diffusion one-step generators available at https://github.com/mingyuanzhou/SiD-LSG

View paper on

Share this with someone who'll enjoy it:

Title:Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation

Paper and Code