Picture for Yanyao Bian

Yanyao Bian

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data

Add code
Sep 25, 2023
Viaarxiv icon

SnakeGAN: A Universal Vocoder Leveraging DDSP Prior Knowledge and Periodic Inductive Bias

Add code
Sep 14, 2023
Viaarxiv icon

Automatic Prosody Annotation with Pre-Trained Text-Speech Model

Add code
Jun 16, 2022
Figure 1 for Automatic Prosody Annotation with Pre-Trained Text-Speech Model
Figure 2 for Automatic Prosody Annotation with Pre-Trained Text-Speech Model
Figure 3 for Automatic Prosody Annotation with Pre-Trained Text-Speech Model
Figure 4 for Automatic Prosody Annotation with Pre-Trained Text-Speech Model
Viaarxiv icon

Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis

Add code
Apr 03, 2022
Figure 1 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Figure 2 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Figure 3 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Figure 4 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Viaarxiv icon

Multi-reference Tacotron by Intercross Training for Style Disentangling,Transfer and Control in Speech Synthesis

Add code
Apr 04, 2019
Figure 1 for Multi-reference Tacotron by Intercross Training for Style Disentangling,Transfer and Control in Speech Synthesis
Figure 2 for Multi-reference Tacotron by Intercross Training for Style Disentangling,Transfer and Control in Speech Synthesis
Figure 3 for Multi-reference Tacotron by Intercross Training for Style Disentangling,Transfer and Control in Speech Synthesis
Figure 4 for Multi-reference Tacotron by Intercross Training for Style Disentangling,Transfer and Control in Speech Synthesis
Viaarxiv icon