Picture for Zilu Guo

Zilu Guo

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion

Add code
Nov 23, 2024
Figure 1 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 2 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 3 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 4 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Viaarxiv icon

DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation

Add code
Jun 06, 2024
Figure 1 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 2 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 3 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 4 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Viaarxiv icon

A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition

Add code
May 27, 2024
Viaarxiv icon

Quality-aware Masked Diffusion Transformer for Enhanced Music Generation

Add code
May 24, 2024
Viaarxiv icon

Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning

Add code
Sep 17, 2023
Viaarxiv icon

Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement

Add code
Jun 14, 2023
Viaarxiv icon