Picture for Jan Melechovsky

Jan Melechovsky

DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech

Add code
Oct 17, 2024
Figure 1 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 2 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 3 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 4 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Viaarxiv icon

MidiCaps -- A large-scale MIDI dataset with text captions

Add code
Jun 04, 2024
Viaarxiv icon

Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training

Add code
Jun 03, 2024
Viaarxiv icon

Mustango: Toward Controllable Text-to-Music Generation

Add code
Nov 14, 2023
Viaarxiv icon

Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder

Add code
Nov 07, 2022
Viaarxiv icon