Picture for Thomas Merritt

Thomas Merritt

AE-Flow: AutoEncoder Normalizing Flow

Add code
Dec 27, 2023
Viaarxiv icon

Creating New Voices using Normalizing Flows

Add code
Dec 22, 2023
Figure 1 for Creating New Voices using Normalizing Flows
Figure 2 for Creating New Voices using Normalizing Flows
Figure 3 for Creating New Voices using Normalizing Flows
Figure 4 for Creating New Voices using Normalizing Flows
Viaarxiv icon

Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech

Add code
Jul 31, 2023
Viaarxiv icon

Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows

Add code
Nov 10, 2022
Viaarxiv icon

GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion

Add code
Jul 04, 2022
Figure 1 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 2 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 3 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 4 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Viaarxiv icon

Expressive, Variable, and Controllable Duration Modelling in TTS

Add code
Jun 28, 2022
Figure 1 for Expressive, Variable, and Controllable Duration Modelling in TTS
Figure 2 for Expressive, Variable, and Controllable Duration Modelling in TTS
Figure 3 for Expressive, Variable, and Controllable Duration Modelling in TTS
Figure 4 for Expressive, Variable, and Controllable Duration Modelling in TTS
Viaarxiv icon

Text-free non-parallel many-to-many voice conversion using normalising flows

Add code
Mar 15, 2022
Figure 1 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 2 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 3 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 4 for Text-free non-parallel many-to-many voice conversion using normalising flows
Viaarxiv icon

Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech

Add code
Jun 25, 2021
Figure 1 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 2 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 3 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 4 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Viaarxiv icon

In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data

Add code
Apr 04, 2019
Figure 1 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Figure 2 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Figure 3 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Figure 4 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Viaarxiv icon

Effect of data reduction on sequence-to-sequence neural TTS

Add code
Nov 23, 2018
Figure 1 for Effect of data reduction on sequence-to-sequence neural TTS
Figure 2 for Effect of data reduction on sequence-to-sequence neural TTS
Figure 3 for Effect of data reduction on sequence-to-sequence neural TTS
Figure 4 for Effect of data reduction on sequence-to-sequence neural TTS
Viaarxiv icon