Picture for Chng Eng Siong

Chng Eng Siong

Text-based Talking Video Editing with Cascaded Conditional Diffusion

Add code
Jul 20, 2024
Figure 1 for Text-based Talking Video Editing with Cascaded Conditional Diffusion
Figure 2 for Text-based Talking Video Editing with Cascaded Conditional Diffusion
Figure 3 for Text-based Talking Video Editing with Cascaded Conditional Diffusion
Figure 4 for Text-based Talking Video Editing with Cascaded Conditional Diffusion
Viaarxiv icon

Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures

Add code
Sep 14, 2023
Viaarxiv icon

Study of GANs for Noisy Speech Simulation from Clean Speech

Add code
May 21, 2023
Viaarxiv icon

Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning

Add code
Jun 04, 2022
Figure 1 for Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning
Figure 2 for Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning
Figure 3 for Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning
Viaarxiv icon

Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model

Add code
Mar 22, 2022
Figure 1 for Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model
Figure 2 for Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model
Figure 3 for Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model
Figure 4 for Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model
Viaarxiv icon

Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling

Add code
Oct 24, 2021
Figure 1 for Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling
Figure 2 for Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling
Figure 3 for Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling
Figure 4 for Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling
Viaarxiv icon