Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

Apr 23, 2024

Xuanhua He, Quande Liu, Shengju Qian, Xin Wang, Tao Hu, Ke Cao, Keyu Yan, Man Zhou, Jie Zhang

Figure 1 for ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

Figure 2 for ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

Figure 3 for ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

Figure 4 for ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

Share this with someone who'll enjoy it:

Abstract:Generating high fidelity human video with specified identities has attracted significant attention in the content generation community. However, existing techniques struggle to strike a balance between training efficiency and identity preservation, either requiring tedious case-by-case finetuning or usually missing the identity details in video generation process. In this study, we present ID-Animator, a zero-shot human-video generation approach that can perform personalized video generation given single reference facial image without further training. ID-Animator inherits existing diffusion-based video generation backbones with a face adapter to encode the ID-relevant embeddings from learnable facial latent queries. To facilitate the extraction of identity information in video generation, we introduce an ID-oriented dataset construction pipeline, which incorporates decoupled human attribute and action captioning technique from a constructed facial image pool. Based on this pipeline, a random face reference training method is further devised to precisely capture the ID-relevant embeddings from reference images, thus improving the fidelity and generalization capacity of our model for ID-specific video generation. Extensive experiments demonstrate the superiority of ID-Animator to generate personalized human videos over previous models. Moreover, our method is highly compatible with popular pre-trained T2V models like animatediff and various community backbone models, showing high extendability in real-world applications for video generation where identity preservation is highly desired. Our codes and checkpoints will be released at https://github.com/ID-Animator/ID-Animator.

* Project Page: https://id-animator.github.io/

View paper on

Share this with someone who'll enjoy it:

Title:ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

Paper and Code