Picture for Ehsan Variani

Ehsan Variani

LAST: Scalable Lattice-Based Speech Modelling in JAX

Add code
Apr 25, 2023
Viaarxiv icon

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition

Add code
Feb 16, 2023
Viaarxiv icon

Alignment Entropy Regularization

Add code
Dec 22, 2022
Figure 1 for Alignment Entropy Regularization
Figure 2 for Alignment Entropy Regularization
Figure 3 for Alignment Entropy Regularization
Figure 4 for Alignment Entropy Regularization
Viaarxiv icon

Modular Hybrid Autoregressive Transducer

Add code
Oct 31, 2022
Viaarxiv icon

UserLibri: A Dataset for ASR Personalization Using Only Text

Add code
Jul 02, 2022
Figure 1 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 2 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 3 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 4 for UserLibri: A Dataset for ASR Personalization Using Only Text
Viaarxiv icon

Global Normalization for Streaming Speech Recognition in a Modular Framework

Add code
May 26, 2022
Figure 1 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 2 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 3 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 4 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Viaarxiv icon

Improving Rare Word Recognition with LM-aware MWER Training

Add code
Apr 15, 2022
Figure 1 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 2 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 3 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 4 for Improving Rare Word Recognition with LM-aware MWER Training
Viaarxiv icon

Cascaded encoders for unifying streaming and non-streaming ASR

Add code
Oct 27, 2020
Figure 1 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 2 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 3 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 4 for Cascaded encoders for unifying streaming and non-streaming ASR
Viaarxiv icon

Hybrid Autoregressive Transducer (hat)

Add code
Mar 12, 2020
Figure 1 for Hybrid Autoregressive Transducer (hat)
Figure 2 for Hybrid Autoregressive Transducer (hat)
Figure 3 for Hybrid Autoregressive Transducer (hat)
Figure 4 for Hybrid Autoregressive Transducer (hat)
Viaarxiv icon

A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition

Add code
Feb 28, 2020
Figure 1 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 2 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 3 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 4 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Viaarxiv icon