Picture for Thomas Merritt

Thomas Merritt

AE-Flow: AutoEncoder Normalizing Flow

Add code
Dec 27, 2023
Viaarxiv icon

Creating New Voices using Normalizing Flows

Add code
Dec 22, 2023
Viaarxiv icon

Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech

Add code
Jul 31, 2023
Viaarxiv icon

Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows

Add code
Nov 10, 2022
Viaarxiv icon

GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion

Add code
Jul 04, 2022
Figure 1 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 2 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 3 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 4 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Viaarxiv icon

Expressive, Variable, and Controllable Duration Modelling in TTS

Add code
Jun 28, 2022
Figure 1 for Expressive, Variable, and Controllable Duration Modelling in TTS
Figure 2 for Expressive, Variable, and Controllable Duration Modelling in TTS
Figure 3 for Expressive, Variable, and Controllable Duration Modelling in TTS
Figure 4 for Expressive, Variable, and Controllable Duration Modelling in TTS
Viaarxiv icon

Text-free non-parallel many-to-many voice conversion using normalising flows

Add code
Mar 15, 2022
Figure 1 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 2 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 3 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 4 for Text-free non-parallel many-to-many voice conversion using normalising flows
Viaarxiv icon

Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech

Add code
Jun 25, 2021
Figure 1 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 2 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 3 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 4 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Viaarxiv icon

In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data

Add code
Apr 04, 2019
Figure 1 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Figure 2 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Figure 3 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Figure 4 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Viaarxiv icon

Effect of data reduction on sequence-to-sequence neural TTS

Add code
Nov 23, 2018
Figure 1 for Effect of data reduction on sequence-to-sequence neural TTS
Figure 2 for Effect of data reduction on sequence-to-sequence neural TTS
Figure 3 for Effect of data reduction on sequence-to-sequence neural TTS
Figure 4 for Effect of data reduction on sequence-to-sequence neural TTS
Viaarxiv icon