Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Edit Temporal-Consistent Videos with Image Diffusion Model

Aug 17, 2023

Yuanzhi Wang, Yong Li, Xin Liu, Anbo Dai, Antoni Chan, Zhen Cui

Figure 1 for Edit Temporal-Consistent Videos with Image Diffusion Model

Figure 2 for Edit Temporal-Consistent Videos with Image Diffusion Model

Figure 3 for Edit Temporal-Consistent Videos with Image Diffusion Model

Figure 4 for Edit Temporal-Consistent Videos with Image Diffusion Model

Share this with someone who'll enjoy it:

Abstract:Large-scale text-to-image (T2I) diffusion models have been extended for text-guided video editing, yielding impressive zero-shot video editing performance. Nonetheless, the generated videos usually show spatial irregularities and temporal inconsistencies as the temporal characteristics of videos have not been faithfully modeled. In this paper, we propose an elegant yet effective Temporal-Consistent Video Editing (TCVE) method, to mitigate the temporal inconsistency challenge for robust text-guided video editing. In addition to the utilization of a pretrained 2D Unet for spatial content manipulation, we establish a dedicated temporal Unet architecture to faithfully capture the temporal coherence of the input video sequences. Furthermore, to establish coherence and interrelation between the spatial-focused and temporal-focused components, a cohesive joint spatial-temporal modeling unit is formulated. This unit effectively interconnects the temporal Unet with the pretrained 2D Unet, thereby enhancing the temporal consistency of the generated video output while simultaneously preserving the capacity for video content manipulation. Quantitative experimental results and visualization results demonstrate that TCVE achieves state-of-the-art performance in both video temporal consistency and video editing capability, surpassing existing benchmarks in the field.

* 8 pages, 7 figures

View paper on

Share this with someone who'll enjoy it:

Title:Edit Temporal-Consistent Videos with Image Diffusion Model

Paper and Code