Picture for Binbin Zhang

Binbin Zhang

Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge

Add code
Sep 09, 2024
Viaarxiv icon

HydraFormer: One Encoder For All Subsampling Rates

Add code
Aug 08, 2024
Viaarxiv icon

AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection

Add code
Jun 11, 2024
Viaarxiv icon

WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark

Add code
Jun 11, 2024
Figure 1 for WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark
Figure 2 for WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark
Figure 3 for WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark
Figure 4 for WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark
Viaarxiv icon

U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF

Add code
Apr 25, 2024
Viaarxiv icon

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

Add code
Jan 07, 2024
Viaarxiv icon

The GUA-Speech System Description for CNVSRC Challenge 2023

Add code
Dec 12, 2023
Viaarxiv icon

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

Add code
Oct 07, 2023
Viaarxiv icon

LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech

Add code
Aug 31, 2023
Viaarxiv icon

ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs

Add code
May 18, 2023
Viaarxiv icon