Picture for Brendan Shillingford

Brendan Shillingford

DeepMind

Imagen 3

Add code
Aug 13, 2024
Viaarxiv icon

The Brain's Bitter Lesson: Scaling Speech Decoding With Self-Supervised Learning

Add code
Jun 06, 2024
Viaarxiv icon

More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech

Add code
Nov 19, 2021
Figure 1 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Figure 2 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Figure 3 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Figure 4 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Viaarxiv icon

Interactive decoding of words from visual speech recognition models

Add code
Jul 01, 2021
Figure 1 for Interactive decoding of words from visual speech recognition models
Figure 2 for Interactive decoding of words from visual speech recognition models
Figure 3 for Interactive decoding of words from visual speech recognition models
Figure 4 for Interactive decoding of words from visual speech recognition models
Viaarxiv icon

Large-scale multilingual audio visual dubbing

Add code
Nov 06, 2020
Figure 1 for Large-scale multilingual audio visual dubbing
Figure 2 for Large-scale multilingual audio visual dubbing
Figure 3 for Large-scale multilingual audio visual dubbing
Figure 4 for Large-scale multilingual audio visual dubbing
Viaarxiv icon

Recurrent Neural Network Transducer for Audio-Visual Speech Recognition

Add code
Nov 08, 2019
Figure 1 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Figure 2 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Figure 3 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Figure 4 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Viaarxiv icon

Make Up Your Mind! Adversarial Generation of Inconsistent Natural Language Explanations

Add code
Oct 09, 2019
Figure 1 for Make Up Your Mind! Adversarial Generation of Inconsistent Natural Language Explanations
Viaarxiv icon

Speech bandwidth extension with WaveNet

Add code
Jul 05, 2019
Figure 1 for Speech bandwidth extension with WaveNet
Figure 2 for Speech bandwidth extension with WaveNet
Figure 3 for Speech bandwidth extension with WaveNet
Viaarxiv icon

Large-Scale Visual Speech Recognition

Add code
Oct 01, 2018
Figure 1 for Large-Scale Visual Speech Recognition
Figure 2 for Large-Scale Visual Speech Recognition
Figure 3 for Large-Scale Visual Speech Recognition
Figure 4 for Large-Scale Visual Speech Recognition
Viaarxiv icon

Sample Efficient Adaptive Text-to-Speech

Add code
Sep 27, 2018
Figure 1 for Sample Efficient Adaptive Text-to-Speech
Figure 2 for Sample Efficient Adaptive Text-to-Speech
Figure 3 for Sample Efficient Adaptive Text-to-Speech
Figure 4 for Sample Efficient Adaptive Text-to-Speech
Viaarxiv icon