Picture for Qibing Bai

Qibing Bai

LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models

Add code
Jul 22, 2024
Viaarxiv icon

Autoregressive Diffusion Transformer for Text-to-Speech Synthesis

Add code
Jun 08, 2024
Viaarxiv icon

Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition

Add code
Sep 27, 2023
Viaarxiv icon

A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis

Add code
Aug 03, 2022
Figure 1 for A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis
Figure 2 for A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis
Figure 3 for A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis
Figure 4 for A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis
Viaarxiv icon

Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation

Add code
May 18, 2022
Figure 1 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Figure 2 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Figure 3 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Figure 4 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Viaarxiv icon

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Add code
Mar 29, 2022
Figure 1 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 2 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 3 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 4 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Viaarxiv icon