Picture for Kainan Peng

Kainan Peng

VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing

Add code
Apr 11, 2024
Viaarxiv icon

Non-parallel Accent Conversion using Pseudo Siamese Disentanglement Network

Add code
Dec 12, 2022
Viaarxiv icon

WaveFlow: A Compact Flow-based Model for Raw Audio

Add code
Jan 10, 2020
Figure 1 for WaveFlow: A Compact Flow-based Model for Raw Audio
Figure 2 for WaveFlow: A Compact Flow-based Model for Raw Audio
Figure 3 for WaveFlow: A Compact Flow-based Model for Raw Audio
Figure 4 for WaveFlow: A Compact Flow-based Model for Raw Audio
Viaarxiv icon

Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework

Add code
Nov 07, 2019
Figure 1 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Figure 2 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Figure 3 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Figure 4 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Viaarxiv icon

Multi-Speaker End-to-End Speech Synthesis

Add code
Jul 09, 2019
Figure 1 for Multi-Speaker End-to-End Speech Synthesis
Figure 2 for Multi-Speaker End-to-End Speech Synthesis
Figure 3 for Multi-Speaker End-to-End Speech Synthesis
Figure 4 for Multi-Speaker End-to-End Speech Synthesis
Viaarxiv icon

Parallel Neural Text-to-Speech

Add code
Jun 05, 2019
Figure 1 for Parallel Neural Text-to-Speech
Figure 2 for Parallel Neural Text-to-Speech
Figure 3 for Parallel Neural Text-to-Speech
Figure 4 for Parallel Neural Text-to-Speech
Viaarxiv icon

Neural Voice Cloning with a Few Samples

Add code
Oct 12, 2018
Figure 1 for Neural Voice Cloning with a Few Samples
Figure 2 for Neural Voice Cloning with a Few Samples
Figure 3 for Neural Voice Cloning with a Few Samples
Figure 4 for Neural Voice Cloning with a Few Samples
Viaarxiv icon

ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech

Add code
Jul 30, 2018
Figure 1 for ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech
Figure 2 for ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech
Figure 3 for ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech
Figure 4 for ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech
Viaarxiv icon

Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning

Add code
Feb 22, 2018
Figure 1 for Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning
Figure 2 for Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning
Figure 3 for Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning
Figure 4 for Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning
Viaarxiv icon

Deep Voice 2: Multi-Speaker Neural Text-to-Speech

Add code
Sep 20, 2017
Figure 1 for Deep Voice 2: Multi-Speaker Neural Text-to-Speech
Figure 2 for Deep Voice 2: Multi-Speaker Neural Text-to-Speech
Figure 3 for Deep Voice 2: Multi-Speaker Neural Text-to-Speech
Figure 4 for Deep Voice 2: Multi-Speaker Neural Text-to-Speech
Viaarxiv icon