Picture for Binbin Zhang

Binbin Zhang

TouchTTS: An Embarrassingly Simple TTS Framework that Everyone Can Touch

Add code
Dec 12, 2024
Viaarxiv icon

Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge

Add code
Sep 09, 2024
Figure 1 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 2 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 3 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 4 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Viaarxiv icon

HydraFormer: One Encoder For All Subsampling Rates

Add code
Aug 08, 2024
Viaarxiv icon

WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark

Add code
Jun 11, 2024
Figure 1 for WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark
Figure 2 for WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark
Figure 3 for WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark
Figure 4 for WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark
Viaarxiv icon

AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection

Add code
Jun 11, 2024
Figure 1 for AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection
Figure 2 for AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection
Figure 3 for AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection
Viaarxiv icon

U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF

Add code
Apr 25, 2024
Viaarxiv icon

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

Add code
Jan 07, 2024
Figure 1 for ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
Figure 2 for ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
Viaarxiv icon

The GUA-Speech System Description for CNVSRC Challenge 2023

Add code
Dec 12, 2023
Viaarxiv icon

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

Add code
Oct 07, 2023
Viaarxiv icon

LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech

Add code
Aug 31, 2023
Viaarxiv icon