Picture for Kaisiyuan Wang

Kaisiyuan Wang

AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers

Add code
Mar 25, 2025
Viaarxiv icon

Cosh-DiT: Co-Speech Gesture Video Synthesis via Hybrid Audio-Visual Diffusion Transformers

Add code
Mar 13, 2025
Viaarxiv icon

TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model

Add code
Oct 14, 2024
Figure 1 for TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model
Figure 2 for TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model
Figure 3 for TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model
Figure 4 for TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model
Viaarxiv icon

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Add code
Aug 06, 2024
Figure 1 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 2 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 3 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 4 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Viaarxiv icon

AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation

Add code
Feb 25, 2024
Viaarxiv icon

ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces

Add code
Aug 17, 2023
Viaarxiv icon

StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator

Add code
May 09, 2023
Viaarxiv icon

Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation

Add code
Feb 14, 2023
Figure 1 for Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation
Figure 2 for Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation
Figure 3 for Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation
Figure 4 for Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation
Viaarxiv icon

Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers

Add code
Dec 09, 2022
Figure 1 for Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers
Figure 2 for Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers
Figure 3 for Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers
Figure 4 for Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers
Viaarxiv icon

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Add code
Nov 22, 2022
Figure 1 for Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
Figure 2 for Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
Figure 3 for Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
Figure 4 for Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
Viaarxiv icon