Picture for Zengqiang Shang

Zengqiang Shang

SF-Speech: Straightened Flow for Zero-Shot Voice Clone on Small-Scale Dataset

Add code
Oct 16, 2024
Figure 1 for SF-Speech: Straightened Flow for Zero-Shot Voice Clone on Small-Scale Dataset
Figure 2 for SF-Speech: Straightened Flow for Zero-Shot Voice Clone on Small-Scale Dataset
Figure 3 for SF-Speech: Straightened Flow for Zero-Shot Voice Clone on Small-Scale Dataset
Figure 4 for SF-Speech: Straightened Flow for Zero-Shot Voice Clone on Small-Scale Dataset
Viaarxiv icon

Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation

Add code
Jul 07, 2024
Figure 1 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Figure 2 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Figure 3 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Figure 4 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Viaarxiv icon

Enhancing Spoofing Speech Detection Using Rhythm Information

Add code
Oct 18, 2023
Viaarxiv icon

One-Class Knowledge Distillation for Spoofing Speech Detection

Add code
Sep 15, 2023
Viaarxiv icon

Improving Short Utterance Anti-Spoofing with AASIST2

Add code
Sep 15, 2023
Viaarxiv icon

Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder

Add code
Sep 02, 2023
Viaarxiv icon