Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model

Jun 20, 2023

Lianying Yin, Yijun Wang, Tianyu He, Jinming Liu, Wei Zhao, Bohan Li, Xin Jin, Jianxin Lin

Figure 1 for EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model

Figure 2 for EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model

Figure 3 for EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model

Figure 4 for EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model

Share this with someone who'll enjoy it:

Abstract:Although previous co-speech gesture generation methods are able to synthesize motions in line with speech content, it is still not enough to handle diverse and complicated motion distribution. The key challenges are: 1) the one-to-many nature between the speech content and gestures; 2) the correlation modeling between the body joints. In this paper, we present a novel framework (EMoG) to tackle the above challenges with denoising diffusion models: 1) To alleviate the one-to-many problem, we incorporate emotion clues to guide the generation process, making the generation much easier; 2) To model joint correlation, we propose to decompose the difficult gesture generation into two sub-problems: joint correlation modeling and temporal dynamics modeling. Then, the two sub-problems are explicitly tackled with our proposed Joint Correlation-aware transFormer (JCFormer). Through extensive evaluations, we demonstrate that our proposed method surpasses previous state-of-the-art approaches, offering substantial superiority in gesture synthesis.

* under review

View paper on

Share this with someone who'll enjoy it:

Title:EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model

Paper and Code