Picture for Zhuoyuan Yao

Zhuoyuan Yao

TESSP: Text-Enhanced Self-Supervised Speech Pre-training

Add code
Nov 24, 2022
Viaarxiv icon

SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data

Add code
Sep 30, 2022
Figure 1 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 2 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 3 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 4 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Viaarxiv icon

WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit

Add code
Mar 29, 2022
Figure 1 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 2 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 3 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 4 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Viaarxiv icon

Boundary and Context Aware Training for CIF-based Non-Autoregressive End-to-end ASR

Add code
Apr 10, 2021
Viaarxiv icon

WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit

Add code
Feb 02, 2021
Figure 1 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Figure 2 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Figure 3 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Figure 4 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Viaarxiv icon

Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition

Add code
Dec 10, 2020
Figure 1 for Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition
Figure 2 for Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition
Figure 3 for Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition
Figure 4 for Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition
Viaarxiv icon

Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character Converter

Add code
Nov 17, 2020
Figure 1 for Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character Converter
Figure 2 for Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character Converter
Figure 3 for Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character Converter
Figure 4 for Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character Converter
Viaarxiv icon