Picture for Ehsan Variani

Ehsan Variani

LAST: Scalable Lattice-Based Speech Modelling in JAX

Add code
Apr 25, 2023
Viaarxiv icon

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition

Add code
Feb 16, 2023
Viaarxiv icon

Alignment Entropy Regularization

Add code
Dec 22, 2022
Viaarxiv icon

Modular Hybrid Autoregressive Transducer

Add code
Oct 31, 2022
Viaarxiv icon

UserLibri: A Dataset for ASR Personalization Using Only Text

Add code
Jul 02, 2022
Figure 1 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 2 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 3 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 4 for UserLibri: A Dataset for ASR Personalization Using Only Text
Viaarxiv icon

Global Normalization for Streaming Speech Recognition in a Modular Framework

Add code
May 26, 2022
Figure 1 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 2 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 3 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 4 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Viaarxiv icon

Improving Rare Word Recognition with LM-aware MWER Training

Add code
Apr 15, 2022
Figure 1 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 2 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 3 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 4 for Improving Rare Word Recognition with LM-aware MWER Training
Viaarxiv icon

Cascaded encoders for unifying streaming and non-streaming ASR

Add code
Oct 27, 2020
Figure 1 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 2 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 3 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 4 for Cascaded encoders for unifying streaming and non-streaming ASR
Viaarxiv icon

Hybrid Autoregressive Transducer (hat)

Add code
Mar 12, 2020
Figure 1 for Hybrid Autoregressive Transducer (hat)
Figure 2 for Hybrid Autoregressive Transducer (hat)
Figure 3 for Hybrid Autoregressive Transducer (hat)
Figure 4 for Hybrid Autoregressive Transducer (hat)
Viaarxiv icon

A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition

Add code
Feb 28, 2020
Figure 1 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 2 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 3 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 4 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Viaarxiv icon