Picture for David Qiu

David Qiu

Apple Intelligence Foundation Language Models

Add code
Jul 29, 2024
Figure 1 for Apple Intelligence Foundation Language Models
Figure 2 for Apple Intelligence Foundation Language Models
Figure 3 for Apple Intelligence Foundation Language Models
Figure 4 for Apple Intelligence Foundation Language Models
Viaarxiv icon

USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models

Add code
Jan 03, 2024
Viaarxiv icon

Partial Rewriting for Multi-Stage ASR

Add code
Dec 08, 2023
Viaarxiv icon

2-bit Conformer quantization for automatic speech recognition

Add code
May 26, 2023
Viaarxiv icon

RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models

Add code
May 24, 2023
Figure 1 for RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
Figure 2 for RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
Figure 3 for RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
Figure 4 for RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
Viaarxiv icon

Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning

Add code
Oct 13, 2021
Figure 1 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 2 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 3 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 4 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Viaarxiv icon

Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition

Add code
Oct 07, 2021
Figure 1 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 2 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 3 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 4 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Viaarxiv icon

Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction

Add code
Apr 26, 2021
Figure 1 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 2 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 3 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 4 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Viaarxiv icon

Learning Word-Level Confidence For Subword End-to-End ASR

Add code
Mar 11, 2021
Figure 1 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 2 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 3 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 4 for Learning Word-Level Confidence For Subword End-to-End ASR
Viaarxiv icon

Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition

Add code
Oct 23, 2020
Figure 1 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 2 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 3 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 4 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Viaarxiv icon