Picture for Tara Sainath

Tara Sainath

Google

Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models

Add code
Mar 25, 2024
Figure 1 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Figure 2 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Figure 3 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Figure 4 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm

Add code
Sep 29, 2023
Figure 1 for Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Figure 2 for Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Figure 3 for Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Viaarxiv icon

Massive End-to-end Models for Short Search Queries

Add code
Sep 22, 2023
Figure 1 for Massive End-to-end Models for Short Search Queries
Figure 2 for Massive End-to-end Models for Short Search Queries
Figure 3 for Massive End-to-end Models for Short Search Queries
Figure 4 for Massive End-to-end Models for Short Search Queries
Viaarxiv icon

Improving Speech Recognition for African American English With Audio Classification

Add code
Sep 16, 2023
Viaarxiv icon

Augmenting conformers with structured state space models for online speech recognition

Add code
Sep 15, 2023
Figure 1 for Augmenting conformers with structured state space models for online speech recognition
Figure 2 for Augmenting conformers with structured state space models for online speech recognition
Figure 3 for Augmenting conformers with structured state space models for online speech recognition
Figure 4 for Augmenting conformers with structured state space models for online speech recognition
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Jun 22, 2023
Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

A Comparison of Semi-Supervised Learning Techniques for Streaming ASR at Scale

Add code
Apr 19, 2023
Viaarxiv icon

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Viaarxiv icon