Picture for Junseok Ahn

Junseok Ahn

VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis

Add code
Dec 26, 2024
Viaarxiv icon

VoxSim: A perceptual voice similarity dataset

Add code
Jul 26, 2024
Figure 1 for VoxSim: A perceptual voice similarity dataset
Figure 2 for VoxSim: A perceptual voice similarity dataset
Figure 3 for VoxSim: A perceptual voice similarity dataset
Figure 4 for VoxSim: A perceptual voice similarity dataset
Viaarxiv icon

Faces that Speak: Jointly Synthesising Talking Face and Speech from Text

Add code
May 16, 2024
Viaarxiv icon

SlowFast Network for Continuous Sign Language Recognition

Add code
Sep 21, 2023
Viaarxiv icon