Picture for Kexin Zhao

Kexin Zhao

DiffWave: A Versatile Diffusion Model for Audio Synthesis

Add code
Sep 21, 2020
Figure 1 for DiffWave: A Versatile Diffusion Model for Audio Synthesis
Figure 2 for DiffWave: A Versatile Diffusion Model for Audio Synthesis
Figure 3 for DiffWave: A Versatile Diffusion Model for Audio Synthesis
Figure 4 for DiffWave: A Versatile Diffusion Model for Audio Synthesis
Viaarxiv icon

WaveFlow: A Compact Flow-based Model for Raw Audio

Add code
Jan 10, 2020
Figure 1 for WaveFlow: A Compact Flow-based Model for Raw Audio
Figure 2 for WaveFlow: A Compact Flow-based Model for Raw Audio
Figure 3 for WaveFlow: A Compact Flow-based Model for Raw Audio
Figure 4 for WaveFlow: A Compact Flow-based Model for Raw Audio
Viaarxiv icon

Multi-Speaker End-to-End Speech Synthesis

Add code
Jul 09, 2019
Figure 1 for Multi-Speaker End-to-End Speech Synthesis
Figure 2 for Multi-Speaker End-to-End Speech Synthesis
Figure 3 for Multi-Speaker End-to-End Speech Synthesis
Figure 4 for Multi-Speaker End-to-End Speech Synthesis
Viaarxiv icon

Parallel Neural Text-to-Speech

Add code
Jun 05, 2019
Figure 1 for Parallel Neural Text-to-Speech
Figure 2 for Parallel Neural Text-to-Speech
Figure 3 for Parallel Neural Text-to-Speech
Figure 4 for Parallel Neural Text-to-Speech
Viaarxiv icon

Trace norm regularization and faster inference for embedded speech recognition RNNs

Add code
Feb 06, 2018
Figure 1 for Trace norm regularization and faster inference for embedded speech recognition RNNs
Figure 2 for Trace norm regularization and faster inference for embedded speech recognition RNNs
Figure 3 for Trace norm regularization and faster inference for embedded speech recognition RNNs
Figure 4 for Trace norm regularization and faster inference for embedded speech recognition RNNs
Viaarxiv icon