Picture for Zhong Meng

Zhong Meng

Fred

Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions

Add code
Jun 20, 2024
Figure 1 for Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions
Figure 2 for Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions
Figure 3 for Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions
Figure 4 for Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions
Viaarxiv icon

Text Injection for Neural Contextual Biasing

Add code
Jun 05, 2024
Figure 1 for Text Injection for Neural Contextual Biasing
Figure 2 for Text Injection for Neural Contextual Biasing
Figure 3 for Text Injection for Neural Contextual Biasing
Figure 4 for Text Injection for Neural Contextual Biasing
Viaarxiv icon

Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping

Add code
Jun 04, 2024
Viaarxiv icon

Deferred NAM: Low-latency Top-K Context Injection via DeferredContext Encoding for Non-Streaming ASR

Add code
Apr 15, 2024
Viaarxiv icon

Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

Add code
Feb 27, 2024
Viaarxiv icon

SLM: Bridge the thin gap between speech and text foundation models

Add code
Sep 30, 2023
Figure 1 for SLM: Bridge the thin gap between speech and text foundation models
Figure 2 for SLM: Bridge the thin gap between speech and text foundation models
Figure 3 for SLM: Bridge the thin gap between speech and text foundation models
Figure 4 for SLM: Bridge the thin gap between speech and text foundation models
Viaarxiv icon

Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm

Add code
Sep 29, 2023
Figure 1 for Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Figure 2 for Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Figure 3 for Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Viaarxiv icon

Massive End-to-end Models for Short Search Queries

Add code
Sep 22, 2023
Figure 1 for Massive End-to-end Models for Short Search Queries
Figure 2 for Massive End-to-end Models for Short Search Queries
Figure 3 for Massive End-to-end Models for Short Search Queries
Figure 4 for Massive End-to-end Models for Short Search Queries
Viaarxiv icon

Augmenting conformers with structured state space models for online speech recognition

Add code
Sep 15, 2023
Figure 1 for Augmenting conformers with structured state space models for online speech recognition
Figure 2 for Augmenting conformers with structured state space models for online speech recognition
Figure 3 for Augmenting conformers with structured state space models for online speech recognition
Figure 4 for Augmenting conformers with structured state space models for online speech recognition
Viaarxiv icon

Text Injection for Capitalization and Turn-Taking Prediction in Speech Models

Add code
Aug 14, 2023
Figure 1 for Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Figure 2 for Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Figure 3 for Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Figure 4 for Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Viaarxiv icon