Picture for Simon King

Simon King

Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?

Add code
Oct 25, 2024
Figure 1 for Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?
Figure 2 for Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?
Figure 3 for Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?
Figure 4 for Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?
Viaarxiv icon

Enabling Beam Search for Language Model-Based Text-to-Speech Synthesis

Add code
Aug 29, 2024
Viaarxiv icon

Natural language guidance of high-fidelity text-to-speech with synthetic annotations

Add code
Feb 02, 2024
Viaarxiv icon

Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing

Add code
Jun 02, 2023
Figure 1 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Figure 2 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Figure 3 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Figure 4 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Viaarxiv icon

Using a Large Language Model to Control Speaking Style for Expressive TTS

Add code
May 17, 2023
Viaarxiv icon

Ensemble prosody prediction for expressive speech synthesis

Add code
Apr 03, 2023
Viaarxiv icon

Do Prosody Transfer Models Transfer Prosody?

Add code
Mar 07, 2023
Viaarxiv icon

Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing

Add code
Nov 13, 2022
Viaarxiv icon

Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis

Add code
Jun 15, 2021
Figure 1 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Figure 2 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Figure 3 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Figure 4 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Viaarxiv icon

ADEPT: A Dataset for Evaluating Prosody Transfer

Add code
Jun 15, 2021
Figure 1 for ADEPT: A Dataset for Evaluating Prosody Transfer
Figure 2 for ADEPT: A Dataset for Evaluating Prosody Transfer
Figure 3 for ADEPT: A Dataset for Evaluating Prosody Transfer
Viaarxiv icon