Picture for Tara N. Sainath

Tara N. Sainath

Google Inc. USA

Text Injection for Neural Contextual Biasing

Add code
Jun 05, 2024
Figure 1 for Text Injection for Neural Contextual Biasing
Figure 2 for Text Injection for Neural Contextual Biasing
Figure 3 for Text Injection for Neural Contextual Biasing
Figure 4 for Text Injection for Neural Contextual Biasing
Viaarxiv icon

Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

Add code
Feb 27, 2024
Viaarxiv icon

Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation

Add code
Feb 20, 2024
Figure 1 for Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation
Figure 2 for Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation
Figure 3 for Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation
Figure 4 for Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation
Viaarxiv icon

Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study

Add code
Jan 23, 2024
Figure 1 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 2 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 3 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 4 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Viaarxiv icon

Efficient Adapter Finetuning for Tail Languages in Streaming Multilingual ASR

Add code
Jan 17, 2024
Viaarxiv icon

USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models

Add code
Jan 03, 2024
Viaarxiv icon

Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers

Add code
Dec 18, 2023
Viaarxiv icon

Text Injection for Capitalization and Turn-Taking Prediction in Speech Models

Add code
Aug 14, 2023
Figure 1 for Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Figure 2 for Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Figure 3 for Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Figure 4 for Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Viaarxiv icon

Improving Joint Speech-Text Representations Without Alignment

Add code
Aug 11, 2023
Figure 1 for Improving Joint Speech-Text Representations Without Alignment
Figure 2 for Improving Joint Speech-Text Representations Without Alignment
Figure 3 for Improving Joint Speech-Text Representations Without Alignment
Figure 4 for Improving Joint Speech-Text Representations Without Alignment
Viaarxiv icon

How to Estimate Model Transferability of Pre-Trained Speech Models?

Add code
Jun 01, 2023
Viaarxiv icon