Picture for Erik McDermott

Erik McDermott

Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models

Add code
Aug 23, 2024
Viaarxiv icon

Optimizing Byte-level Representation for End-to-end ASR

Add code
Jun 14, 2024
Viaarxiv icon

Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition

Add code
May 24, 2024
Figure 1 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 2 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 3 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 4 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Viaarxiv icon

Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation

Add code
Nov 29, 2022
Viaarxiv icon

Variable Attention Masking for Configurable Transformer Transducer Speech Recognition

Add code
Nov 02, 2022
Viaarxiv icon

A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition

Add code
Feb 28, 2020
Figure 1 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 2 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 3 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 4 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Viaarxiv icon

Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss

Add code
Feb 14, 2020
Figure 1 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Figure 2 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Figure 3 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Figure 4 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Viaarxiv icon