Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Accelerating Video Diffusion Models via Distribution Matching

Dec 08, 2024

Yuanzhi Zhu, Hanshu Yan, Huan Yang, Kai Zhang, Junnan Li

Figure 1 for Accelerating Video Diffusion Models via Distribution Matching

Figure 2 for Accelerating Video Diffusion Models via Distribution Matching

Figure 3 for Accelerating Video Diffusion Models via Distribution Matching

Figure 4 for Accelerating Video Diffusion Models via Distribution Matching

Share this with someone who'll enjoy it:

Abstract:Generative models, particularly diffusion models, have made significant success in data synthesis across various modalities, including images, videos, and 3D assets. However, current diffusion models are computationally intensive, often requiring numerous sampling steps that limit their practical application, especially in video generation. This work introduces a novel framework for diffusion distillation and distribution matching that dramatically reduces the number of inference steps while maintaining-and potentially improving-generation quality. Our approach focuses on distilling pre-trained diffusion models into a more efficient few-step generator, specifically targeting video generation. By leveraging a combination of video GAN loss and a novel 2D score distribution matching loss, we demonstrate the potential to generate high-quality video frames with substantially fewer sampling steps. To be specific, the proposed method incorporates a denoising GAN discriminator to distil from the real data and a pre-trained image diffusion model to enhance the frame quality and the prompt-following capabilities. Experimental results using AnimateDiff as the teacher model showcase the method's effectiveness, achieving superior performance in just four sampling steps compared to existing techniques.

View paper on

Share this with someone who'll enjoy it:

Title:Accelerating Video Diffusion Models via Distribution Matching

Paper and Code