Picture for W. Ronny Huang

W. Ronny Huang

Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study

Add code
Jan 23, 2024
Figure 1 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 2 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 3 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 4 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Viaarxiv icon

Large-scale Language Model Rescoring on Long-form Data

Add code
Jun 13, 2023
Figure 1 for Large-scale Language Model Rescoring on Long-form Data
Figure 2 for Large-scale Language Model Rescoring on Long-form Data
Figure 3 for Large-scale Language Model Rescoring on Long-form Data
Figure 4 for Large-scale Language Model Rescoring on Long-form Data
Viaarxiv icon

Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR

Add code
May 28, 2023
Viaarxiv icon

E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model

Add code
Nov 28, 2022
Figure 1 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 2 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 3 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 4 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Viaarxiv icon

Modular Hybrid Autoregressive Transducer

Add code
Oct 31, 2022
Viaarxiv icon

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR

Add code
Apr 22, 2022
Figure 1 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 2 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 3 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 4 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Viaarxiv icon

Detecting Unintended Memorization in Language-Model-Fused ASR

Add code
Apr 20, 2022
Figure 1 for Detecting Unintended Memorization in Language-Model-Fused ASR
Figure 2 for Detecting Unintended Memorization in Language-Model-Fused ASR
Figure 3 for Detecting Unintended Memorization in Language-Model-Fused ASR
Figure 4 for Detecting Unintended Memorization in Language-Model-Fused ASR
Viaarxiv icon

Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition

Add code
Mar 09, 2022
Figure 1 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Figure 2 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Figure 3 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Figure 4 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Viaarxiv icon

Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model

Add code
Feb 16, 2022
Figure 1 for Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model
Figure 2 for Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model
Figure 3 for Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model
Figure 4 for Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model
Viaarxiv icon

Scaling End-to-End Models for Large-Scale Multilingual ASR

Add code
Apr 30, 2021
Figure 1 for Scaling End-to-End Models for Large-Scale Multilingual ASR
Figure 2 for Scaling End-to-End Models for Large-Scale Multilingual ASR
Figure 3 for Scaling End-to-End Models for Large-Scale Multilingual ASR
Viaarxiv icon