Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yibo Xia

GMTalker: Gaussian Mixture based Emotional talking video Portraits

Dec 12, 2023

Yibo Xia, Lizhen Wang, Xiang Deng, Xiaoyan Luo, Yebin Liu

Figure 1 for GMTalker: Gaussian Mixture based Emotional talking video Portraits

Figure 2 for GMTalker: Gaussian Mixture based Emotional talking video Portraits

Figure 3 for GMTalker: Gaussian Mixture based Emotional talking video Portraits

Figure 4 for GMTalker: Gaussian Mixture based Emotional talking video Portraits

Abstract:Synthesizing high-fidelity and emotion-controllable talking video portraits, with audio-lip sync, vivid expression, realistic head pose, and eye blink, is an important and challenging task in recent years. Most of the existing methods suffer in achieving personalized precise emotion control or continuously interpolating between different emotions and generating diverse motion. To address these problems, we present GMTalker, a Gaussian mixture based emotional talking portraits generation framework. Specifically, we propose a Gaussian Mixture based Expression Generator (GMEG) which can construct a continuous and multi-modal latent space, achieving more flexible emotion manipulation. Furthermore, we introduce a normalizing flow based motion generator pretrained on the dataset with a wide-range motion to generate diverse motions. Finally, we propose a personalized emotion-guided head generator with an Emotion Mapping Network (EMN) which can synthesize high-fidelity and faithful emotional video portraits. Both quantitative and qualitative experiments demonstrate our method outperforms previous methods in image quality, photo-realism, emotion accuracy and motion diversity.

* Project page: https://bob35buaa.github.io/GMTalker

Via

Access Paper or Ask Questions