Picture for Matthew Baas

Matthew Baas

Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices

Add code
Oct 12, 2023
Viaarxiv icon

Disentanglement in a GAN for Unconditional Speech Synthesis

Add code
Jul 04, 2023
Figure 1 for Disentanglement in a GAN for Unconditional Speech Synthesis
Figure 2 for Disentanglement in a GAN for Unconditional Speech Synthesis
Figure 3 for Disentanglement in a GAN for Unconditional Speech Synthesis
Figure 4 for Disentanglement in a GAN for Unconditional Speech Synthesis
Viaarxiv icon

Voice Conversion With Just Nearest Neighbors

Add code
May 30, 2023
Viaarxiv icon

TransFusion: Transcribing Speech with Multinomial Diffusion

Add code
Oct 14, 2022
Figure 1 for TransFusion: Transcribing Speech with Multinomial Diffusion
Figure 2 for TransFusion: Transcribing Speech with Multinomial Diffusion
Figure 3 for TransFusion: Transcribing Speech with Multinomial Diffusion
Figure 4 for TransFusion: Transcribing Speech with Multinomial Diffusion
Viaarxiv icon

GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models

Add code
Oct 11, 2022
Figure 1 for GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
Figure 2 for GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
Figure 3 for GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
Figure 4 for GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
Viaarxiv icon

Voice Conversion Can Improve ASR in Very Low-Resource Settings

Add code
Nov 04, 2021
Figure 1 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 2 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 3 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 4 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Viaarxiv icon

Analyzing Speaker Information in Self-Supervised Models to Improve Zero-Resource Speech Processing

Add code
Aug 02, 2021
Figure 1 for Analyzing Speaker Information in Self-Supervised Models to Improve Zero-Resource Speech Processing
Figure 2 for Analyzing Speaker Information in Self-Supervised Models to Improve Zero-Resource Speech Processing
Figure 3 for Analyzing Speaker Information in Self-Supervised Models to Improve Zero-Resource Speech Processing
Figure 4 for Analyzing Speaker Information in Self-Supervised Models to Improve Zero-Resource Speech Processing
Viaarxiv icon

StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts

Add code
May 31, 2021
Figure 1 for StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts
Figure 2 for StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts
Figure 3 for StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts
Figure 4 for StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts
Viaarxiv icon