Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator

Sep 25, 2023

Hanzhuo Huang, Yufan Feng, Cheng Shi, Lan Xu, Jingyi Yu, Sibei Yang

Figure 1 for Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator

Figure 2 for Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator

Figure 3 for Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator

Figure 4 for Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator

Share this with someone who'll enjoy it:

Abstract:Text-to-video is a rapidly growing research area that aims to generate a semantic, identical, and temporal coherence sequence of frames that accurately align with the input text prompt. This study focuses on zero-shot text-to-video generation considering the data- and cost-efficient. To generate a semantic-coherent video, exhibiting a rich portrayal of temporal semantics such as the whole process of flower blooming rather than a set of "moving images", we propose a novel Free-Bloom pipeline that harnesses large language models (LLMs) as the director to generate a semantic-coherence prompt sequence, while pre-trained latent diffusion models (LDMs) as the animator to generate the high fidelity frames. Furthermore, to ensure temporal and identical coherence while maintaining semantic coherence, we propose a series of annotative modifications to adapting LDMs in the reverse process, including joint noise sampling, step-aware attention shift, and dual-path interpolation. Without any video data and training requirements, Free-Bloom generates vivid and high-quality videos, awe-inspiring in generating complex scenes with semantic meaningful frame sequences. In addition, Free-Bloom is naturally compatible with LDMs-based extensions.

* NeurIPS 2023; Project available at: https://github.com/SooLab/Free-Bloom

View paper on

Share this with someone who'll enjoy it:

Title:Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator

Paper and Code