Picture for Antonio Bonafonte

Antonio Bonafonte

Controllable Emphasis with zero data for text-to-speech

Add code
Jul 13, 2023
Viaarxiv icon

Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue

Add code
Dec 07, 2022
Figure 1 for Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue
Figure 2 for Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue
Figure 3 for Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue
Figure 4 for Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue
Viaarxiv icon

Distribution augmentation for low-resource expressive text-to-speech

Add code
Feb 19, 2022
Figure 1 for Distribution augmentation for low-resource expressive text-to-speech
Figure 2 for Distribution augmentation for low-resource expressive text-to-speech
Figure 3 for Distribution augmentation for low-resource expressive text-to-speech
Figure 4 for Distribution augmentation for low-resource expressive text-to-speech
Viaarxiv icon

Discrete acoustic space for an efficient sampling in neural text-to-speech

Add code
Oct 24, 2021
Figure 1 for Discrete acoustic space for an efficient sampling in neural text-to-speech
Figure 2 for Discrete acoustic space for an efficient sampling in neural text-to-speech
Figure 3 for Discrete acoustic space for an efficient sampling in neural text-to-speech
Figure 4 for Discrete acoustic space for an efficient sampling in neural text-to-speech
Viaarxiv icon

Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

Add code
Apr 15, 2021
Figure 1 for Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems
Figure 2 for Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems
Figure 3 for Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems
Figure 4 for Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems
Viaarxiv icon

Prosodic Phrase Alignment for Machine Dubbing

Add code
Aug 20, 2019
Figure 1 for Prosodic Phrase Alignment for Machine Dubbing
Figure 2 for Prosodic Phrase Alignment for Machine Dubbing
Figure 3 for Prosodic Phrase Alignment for Machine Dubbing
Figure 4 for Prosodic Phrase Alignment for Machine Dubbing
Viaarxiv icon

Towards Generalized Speech Enhancement with Generative Adversarial Networks

Add code
Apr 06, 2019
Figure 1 for Towards Generalized Speech Enhancement with Generative Adversarial Networks
Figure 2 for Towards Generalized Speech Enhancement with Generative Adversarial Networks
Figure 3 for Towards Generalized Speech Enhancement with Generative Adversarial Networks
Figure 4 for Towards Generalized Speech Enhancement with Generative Adversarial Networks
Viaarxiv icon

Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

Add code
Apr 06, 2019
Figure 1 for Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks
Figure 2 for Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks
Figure 3 for Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks
Figure 4 for Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks
Viaarxiv icon

Self-Attention Linguistic-Acoustic Decoder

Add code
Nov 05, 2018
Figure 1 for Self-Attention Linguistic-Acoustic Decoder
Figure 2 for Self-Attention Linguistic-Acoustic Decoder
Figure 3 for Self-Attention Linguistic-Acoustic Decoder
Figure 4 for Self-Attention Linguistic-Acoustic Decoder
Viaarxiv icon

Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks

Add code
Nov 05, 2018
Figure 1 for Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks
Figure 2 for Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks
Figure 3 for Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks
Figure 4 for Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks
Viaarxiv icon