Picture for Zilu Guo

Zilu Guo

DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation

Add code
Mar 14, 2025
Viaarxiv icon

PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models

Add code
Mar 13, 2025
Viaarxiv icon

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion

Add code
Nov 23, 2024
Figure 1 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 2 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 3 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 4 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Viaarxiv icon

DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation

Add code
Jun 06, 2024
Figure 1 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 2 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 3 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 4 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Viaarxiv icon

A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition

Add code
May 27, 2024
Viaarxiv icon

Quality-aware Masked Diffusion Transformer for Enhanced Music Generation

Add code
May 24, 2024
Viaarxiv icon

Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning

Add code
Sep 17, 2023
Viaarxiv icon

Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement

Add code
Jun 14, 2023
Viaarxiv icon