Picture for Mateusz Lajszczak

Mateusz Lajszczak

Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations

Add code
Feb 05, 2024
Viaarxiv icon

Controllable Emphasis with zero data for text-to-speech

Add code
Jul 13, 2023
Viaarxiv icon

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer

Add code
Jun 27, 2022
Figure 1 for CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer
Figure 2 for CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer
Viaarxiv icon

Distribution augmentation for low-resource expressive text-to-speech

Add code
Feb 19, 2022
Figure 1 for Distribution augmentation for low-resource expressive text-to-speech
Figure 2 for Distribution augmentation for low-resource expressive text-to-speech
Figure 3 for Distribution augmentation for low-resource expressive text-to-speech
Figure 4 for Distribution augmentation for low-resource expressive text-to-speech
Viaarxiv icon

Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

Add code
Jul 10, 2019
Figure 1 for Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Figure 2 for Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Figure 3 for Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Figure 4 for Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Viaarxiv icon