Picture for Ismail Rasim Ulgen

Ismail Rasim Ulgen

Discrete Unit based Masking for Improving Disentanglement in Voice Conversion

Add code
Sep 17, 2024
Figure 1 for Discrete Unit based Masking for Improving Disentanglement in Voice Conversion
Figure 2 for Discrete Unit based Masking for Improving Disentanglement in Voice Conversion
Figure 3 for Discrete Unit based Masking for Improving Disentanglement in Voice Conversion
Figure 4 for Discrete Unit based Masking for Improving Disentanglement in Voice Conversion
Viaarxiv icon

SelectTTS: Synthesizing Anyone's Voice via Discrete Unit-Based Frame Selection

Add code
Aug 30, 2024
Figure 1 for SelectTTS: Synthesizing Anyone's Voice via Discrete Unit-Based Frame Selection
Figure 2 for SelectTTS: Synthesizing Anyone's Voice via Discrete Unit-Based Frame Selection
Figure 3 for SelectTTS: Synthesizing Anyone's Voice via Discrete Unit-Based Frame Selection
Figure 4 for SelectTTS: Synthesizing Anyone's Voice via Discrete Unit-Based Frame Selection
Viaarxiv icon

We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings

Add code
Jul 05, 2024
Figure 1 for We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings
Figure 2 for We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings
Figure 3 for We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings
Viaarxiv icon

Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline

Add code
Jun 06, 2024
Viaarxiv icon

Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition

Add code
Jan 19, 2024
Viaarxiv icon