Picture for Shaoshi Ling

Shaoshi Ling

Efficient Long-Form Speech Recognition for General Speech In-Context Learning

Add code
Sep 29, 2024
Figure 1 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 2 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 3 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 4 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Viaarxiv icon

Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation

Add code
Sep 14, 2023
Figure 1 for Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation
Figure 2 for Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation
Figure 3 for Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation
Figure 4 for Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation
Viaarxiv icon

Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition

Add code
Aug 03, 2023
Figure 1 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 2 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 3 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 4 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Viaarxiv icon

Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask

Add code
Oct 08, 2021
Figure 1 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Figure 2 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Figure 3 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Figure 4 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Viaarxiv icon

DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization

Add code
Dec 11, 2020
Figure 1 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Figure 2 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Figure 3 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Figure 4 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Viaarxiv icon

Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition

Add code
Dec 03, 2019
Figure 1 for Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Figure 2 for Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Figure 3 for Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Figure 4 for Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Viaarxiv icon

Contextual Phonetic Pretraining for End-to-end Utterance-level Language and Speaker Recognition

Add code
Jun 30, 2019
Figure 1 for Contextual Phonetic Pretraining for End-to-end Utterance-level Language and Speaker Recognition
Figure 2 for Contextual Phonetic Pretraining for End-to-end Utterance-level Language and Speaker Recognition
Figure 3 for Contextual Phonetic Pretraining for End-to-end Utterance-level Language and Speaker Recognition
Figure 4 for Contextual Phonetic Pretraining for End-to-end Utterance-level Language and Speaker Recognition
Viaarxiv icon