Picture for Dan Bigioi

Dan Bigioi

LatentColorization: Latent Diffusion-Based Speaker Video Colorization

Add code
May 09, 2024
Viaarxiv icon

Synthetic Speaking Children -- Why We Need Them and How to Make Them

Add code
Nov 08, 2023
Viaarxiv icon

Speech Driven Video Editing via an Audio-Conditioned Diffusion Model

Add code
Jan 12, 2023
Figure 1 for Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Figure 2 for Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Figure 3 for Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Figure 4 for Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Viaarxiv icon

Can Self-Supervised Learning solve the problem of child speech recognition?

Add code
Apr 06, 2022
Figure 1 for Can Self-Supervised Learning solve the problem of child speech recognition?
Figure 2 for Can Self-Supervised Learning solve the problem of child speech recognition?
Figure 3 for Can Self-Supervised Learning solve the problem of child speech recognition?
Viaarxiv icon

A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis

Add code
Apr 04, 2022
Figure 1 for A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis
Figure 2 for A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis
Figure 3 for A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis
Figure 4 for A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis
Viaarxiv icon