Picture for Ruchir Travadi

Ruchir Travadi

Optimizing Byte-level Representation for End-to-end ASR

Add code
Jun 14, 2024
Viaarxiv icon

Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization

Add code
Oct 16, 2023
Figure 1 for Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization
Figure 2 for Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization
Figure 3 for Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization
Figure 4 for Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization
Viaarxiv icon

Variable Attention Masking for Configurable Transformer Transducer Speech Recognition

Add code
Nov 02, 2022
Viaarxiv icon

Online Automatic Speech Recognition with Listen, Attend and Spell Model

Add code
Aug 12, 2020
Figure 1 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 2 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 3 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 4 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Viaarxiv icon

Multimodal Representation Learning using Deep Multiset Canonical Correlation

Add code
Apr 03, 2019
Figure 1 for Multimodal Representation Learning using Deep Multiset Canonical Correlation
Figure 2 for Multimodal Representation Learning using Deep Multiset Canonical Correlation
Figure 3 for Multimodal Representation Learning using Deep Multiset Canonical Correlation
Viaarxiv icon