Picture for Golan Pundak

Golan Pundak

Deferred NAM: Low-latency Top-K Context Injection via DeferredContext Encoding for Non-Streaming ASR

Add code
Apr 15, 2024
Viaarxiv icon

SLM: Bridge the thin gap between speech and text foundation models

Add code
Sep 30, 2023
Figure 1 for SLM: Bridge the thin gap between speech and text foundation models
Figure 2 for SLM: Bridge the thin gap between speech and text foundation models
Figure 3 for SLM: Bridge the thin gap between speech and text foundation models
Figure 4 for SLM: Bridge the thin gap between speech and text foundation models
Viaarxiv icon

Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm

Add code
Sep 29, 2023
Figure 1 for Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Figure 2 for Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Figure 3 for Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Viaarxiv icon

Improving Proper Noun Recognition in End-to-End ASR By Customization of the MWER Loss Criterion

Add code
May 19, 2020
Figure 1 for Improving Proper Noun Recognition in End-to-End ASR By Customization of the MWER Loss Criterion
Figure 2 for Improving Proper Noun Recognition in End-to-End ASR By Customization of the MWER Loss Criterion
Figure 3 for Improving Proper Noun Recognition in End-to-End ASR By Customization of the MWER Loss Criterion
Figure 4 for Improving Proper Noun Recognition in End-to-End ASR By Customization of the MWER Loss Criterion
Viaarxiv icon

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Add code
Mar 28, 2020
Figure 1 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 2 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 3 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 4 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Viaarxiv icon

Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models

Add code
Jul 22, 2019
Figure 1 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Figure 2 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Figure 3 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Figure 4 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Viaarxiv icon

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

Add code
Feb 21, 2019
Figure 1 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Figure 2 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Figure 3 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Viaarxiv icon

Streaming End-to-end Speech Recognition For Mobile Devices

Add code
Nov 15, 2018
Figure 1 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 2 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 3 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 4 for Streaming End-to-end Speech Recognition For Mobile Devices
Viaarxiv icon

Contextual Speech Recognition with Difficult Negative Training Examples

Add code
Oct 29, 2018
Figure 1 for Contextual Speech Recognition with Difficult Negative Training Examples
Figure 2 for Contextual Speech Recognition with Difficult Negative Training Examples
Figure 3 for Contextual Speech Recognition with Difficult Negative Training Examples
Figure 4 for Contextual Speech Recognition with Difficult Negative Training Examples
Viaarxiv icon

Toward domain-invariant speech recognition via large scale training

Add code
Aug 16, 2018
Figure 1 for Toward domain-invariant speech recognition via large scale training
Figure 2 for Toward domain-invariant speech recognition via large scale training
Figure 3 for Toward domain-invariant speech recognition via large scale training
Figure 4 for Toward domain-invariant speech recognition via large scale training
Viaarxiv icon