Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:FIFO-Diffusion: Generating Infinite Videos from Text without Training

May 19, 2024

Jihwan Kim, Junoh Kang, Jinyoung Choi, Bohyung Han

Figure 1 for FIFO-Diffusion: Generating Infinite Videos from Text without Training

Figure 2 for FIFO-Diffusion: Generating Infinite Videos from Text without Training

Figure 3 for FIFO-Diffusion: Generating Infinite Videos from Text without Training

Figure 4 for FIFO-Diffusion: Generating Infinite Videos from Text without Training

Share this with someone who'll enjoy it:

Abstract:We propose a novel inference technique based on a pretrained diffusion model for text-conditional video generation. Our approach, called FIFO-Diffusion, is conceptually capable of generating infinitely long videos without training. This is achieved by iteratively performing diagonal denoising, which concurrently processes a series of consecutive frames with increasing noise levels in a queue; our method dequeues a fully denoised frame at the head while enqueuing a new random noise frame at the tail. However, diagonal denoising is a double-edged sword as the frames near the tail can take advantage of cleaner ones by forward reference but such a strategy induces the discrepancy between training and inference. Hence, we introduce latent partitioning to reduce the training-inference gap and lookahead denoising to leverage the benefit of forward referencing. We have demonstrated the promising results and effectiveness of the proposed methods on existing text-to-video generation baselines.

* Project Page: https://jjihwan.github.io/projects/FIFO-Diffusion

View paper on

Share this with someone who'll enjoy it:

Title:FIFO-Diffusion: Generating Infinite Videos from Text without Training

Paper and Code