Picture for Jae-Sung Bae

Jae-Sung Bae

Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis

Add code
Oct 05, 2023
Viaarxiv icon

An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space

Add code
Nov 06, 2022
Viaarxiv icon

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Add code
Jun 28, 2022
Figure 1 for Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Figure 2 for Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Figure 3 for Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Figure 4 for Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Viaarxiv icon

Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech

Add code
Apr 08, 2022
Figure 1 for Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech
Figure 2 for Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech
Figure 3 for Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech
Figure 4 for Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech
Viaarxiv icon

GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis

Add code
Jun 29, 2021
Figure 1 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Figure 2 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Figure 3 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Figure 4 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Viaarxiv icon

Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech

Add code
Jun 29, 2021
Figure 1 for Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech
Figure 2 for Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech
Figure 3 for Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech
Figure 4 for Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech
Viaarxiv icon

FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Add code
Jun 29, 2021
Figure 1 for FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Figure 2 for FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Figure 3 for FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Figure 4 for FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Viaarxiv icon

A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music

Add code
Mar 04, 2021
Figure 1 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 2 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 3 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 4 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Viaarxiv icon