Picture for Kou Tanaka

Kou Tanaka

LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching Models

Add code
Sep 10, 2025
Viaarxiv icon

Vocoder-Projected Feature Discriminator

Add code
Aug 25, 2025
Viaarxiv icon

FasterVoiceGrad: Faster One-step Diffusion-Based Voice Conversion with Adversarial Diffusion Conversion Distillation

Add code
Aug 25, 2025
Viaarxiv icon

FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation

Add code
Sep 03, 2024
Figure 1 for FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation
Figure 2 for FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation
Figure 3 for FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation
Figure 4 for FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation
Viaarxiv icon

Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator

Add code
Mar 25, 2024
Viaarxiv icon

iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN

Add code
Aug 14, 2023
Viaarxiv icon

Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis

Add code
Mar 24, 2023
Figure 1 for Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis
Figure 2 for Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis
Figure 3 for Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis
Figure 4 for Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis
Viaarxiv icon

iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform

Add code
Mar 04, 2022
Figure 1 for iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Figure 2 for iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Figure 3 for iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Figure 4 for iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Viaarxiv icon

FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion

Add code
Apr 14, 2021
Figure 1 for FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion
Figure 2 for FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion
Figure 3 for FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion
Figure 4 for FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion
Viaarxiv icon

MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames

Add code
Feb 25, 2021
Figure 1 for MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames
Figure 2 for MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames
Figure 3 for MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames
Figure 4 for MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames
Viaarxiv icon