Picture for Grzegorz Beringer

Grzegorz Beringer

Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations

Add code
Feb 05, 2024
Viaarxiv icon

SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic Spaces

Add code
Jul 23, 2023
Viaarxiv icon

GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion

Add code
Jul 04, 2022
Figure 1 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 2 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 3 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 4 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Viaarxiv icon

Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows

Add code
Jun 10, 2021
Figure 1 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 2 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 3 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 4 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Viaarxiv icon

Detection of Lexical Stress Errors in Non-native English with Data Augmentation and Attention

Add code
Dec 29, 2020
Figure 1 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Figure 2 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Figure 3 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Figure 4 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Viaarxiv icon