Picture for Hieu-Thi Luong

Hieu-Thi Luong

NTU-NPU System for Voice Privacy 2024 Challenge

Add code
Oct 03, 2024
Figure 1 for NTU-NPU System for Voice Privacy 2024 Challenge
Figure 2 for NTU-NPU System for Voice Privacy 2024 Challenge
Figure 3 for NTU-NPU System for Voice Privacy 2024 Challenge
Figure 4 for NTU-NPU System for Voice Privacy 2024 Challenge
Viaarxiv icon

Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection

Add code
Jun 25, 2024
Viaarxiv icon

LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example

Add code
Oct 11, 2021
Figure 1 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Figure 2 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Figure 3 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Figure 4 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Viaarxiv icon

Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance

Add code
Jun 25, 2021
Figure 1 for Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Figure 2 for Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Figure 3 for Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Figure 4 for Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Viaarxiv icon

Latent linguistic embedding for cross-lingual text-to-speech and voice conversion

Add code
Oct 08, 2020
Figure 1 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Figure 2 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Figure 3 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Figure 4 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Viaarxiv icon

NAUTILUS: a Versatile Voice Cloning System

Add code
May 22, 2020
Figure 1 for NAUTILUS: a Versatile Voice Cloning System
Figure 2 for NAUTILUS: a Versatile Voice Cloning System
Figure 3 for NAUTILUS: a Versatile Voice Cloning System
Figure 4 for NAUTILUS: a Versatile Voice Cloning System
Viaarxiv icon

Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech

Add code
Sep 14, 2019
Figure 1 for Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech
Figure 2 for Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech
Figure 3 for Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech
Figure 4 for Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech
Viaarxiv icon

A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation

Add code
Jun 18, 2019
Figure 1 for A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation
Figure 2 for A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation
Figure 3 for A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation
Figure 4 for A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation
Viaarxiv icon

Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora

Add code
Apr 07, 2019
Figure 1 for Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
Figure 2 for Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
Figure 3 for Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
Figure 4 for Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
Viaarxiv icon

Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems

Add code
Oct 01, 2018
Figure 1 for Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems
Figure 2 for Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems
Figure 3 for Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems
Figure 4 for Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems
Viaarxiv icon