Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Nov 28, 2024

Feng Liu, Shiwei Zhang, Xiaofeng Wang, Yujie Wei, Haonan Qiu, Yuzhong Zhao, Yingya Zhang, Qixiang Ye, Fang Wan

Figure 1 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Figure 2 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Figure 3 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Figure 4 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Share this with someone who'll enjoy it:

Abstract:As a fundamental backbone for video generation, diffusion models are challenged by low inference speed due to the sequential nature of denoising. Previous methods speed up the models by caching and reusing model outputs at uniformly selected timesteps. However, such a strategy neglects the fact that differences among model outputs are not uniform across timesteps, which hinders selecting the appropriate model outputs to cache, leading to a poor balance between inference efficiency and visual quality. In this study, we introduce Timestep Embedding Aware Cache (TeaCache), a training-free caching approach that estimates and leverages the fluctuating differences among model outputs across timesteps. Rather than directly using the time-consuming model outputs, TeaCache focuses on model inputs, which have a strong correlation with the modeloutputs while incurring negligible computational cost. TeaCache first modulates the noisy inputs using the timestep embeddings to ensure their differences better approximating those of model outputs. TeaCache then introduces a rescaling strategy to refine the estimated differences and utilizes them to indicate output caching. Experiments show that TeaCache achieves up to 4.41x acceleration over Open-Sora-Plan with negligible (-0.07% Vbench score) degradation of visual quality.

* Project: https://liewfeng.github.io/TeaCache

View paper on

Share this with someone who'll enjoy it:

Title:Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Paper and Code