Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

Jan 30, 2024

Qingcheng Zhao, Pengyu Long, Qixuan Zhang, Dafei Qin, Han Liang, Longwen Zhang, Yingliang Zhang, Jingyi Yu, Lan Xu

Figure 1 for Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

Figure 2 for Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

Figure 3 for Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

Figure 4 for Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

Share this with someone who'll enjoy it:

Abstract:The synthesis of 3D facial animations from speech has garnered considerable attention. Due to the scarcity of high-quality 4D facial data and well-annotated abundant multi-modality labels, previous methods often suffer from limited realism and a lack of lexible conditioning. We address this challenge through a trilogy. We first introduce Generalized Neural Parametric Facial Asset (GNPFA), an efficient variational auto-encoder mapping facial geometry and images to a highly generalized expression latent space, decoupling expressions and identities. Then, we utilize GNPFA to extract high-quality expressions and accurate head poses from a large array of videos. This presents the M2F-D dataset, a large, diverse, and scan-level co-speech 3D facial animation dataset with well-annotated emotional and style labels. Finally, we propose Media2Face, a diffusion model in GNPFA latent space for co-speech facial animation generation, accepting rich multi-modality guidances from audio, text, and image. Extensive experiments demonstrate that our model not only achieves high fidelity in facial animation synthesis but also broadens the scope of expressiveness and style adaptability in 3D facial animation.

* Project Page: https://sites.google.com/view/media2face

View paper on

Share this with someone who'll enjoy it:

Title:Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

Paper and Code