Talking Face Generation


Talking face generation is the process of generating videos of a person speaking based on an audio recording of their voice.

DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model

Add code
Mar 24, 2025
Viaarxiv icon

PC-Talk: Precise Facial Animation Control for Audio-Driven Talking Face Generation

Add code
Mar 20, 2025
Viaarxiv icon

UniSync: A Unified Framework for Audio-Visual Synchronization

Add code
Mar 20, 2025
Viaarxiv icon

Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics

Add code
Mar 27, 2025
Viaarxiv icon

ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model

Add code
Mar 27, 2025
Viaarxiv icon

EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models

Add code
Mar 14, 2025
Viaarxiv icon

Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait

Add code
Mar 17, 2025
Viaarxiv icon

Removing Averaging: Personalized Lip-Sync Driven Characters Based on Identity Adapter

Add code
Mar 09, 2025
Viaarxiv icon

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

Add code
Mar 07, 2025
Viaarxiv icon

Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion

Add code
Feb 11, 2025
Viaarxiv icon