Picture for Vineel Pratap

Vineel Pratap

Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking

Add code
Sep 27, 2024
Figure 1 for Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking
Figure 2 for Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking
Figure 3 for Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking
Figure 4 for Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking
Viaarxiv icon

Scaling A Simple Approach to Zero-Shot Speech Recognition

Add code
Jul 25, 2024
Figure 1 for Scaling A Simple Approach to Zero-Shot Speech Recognition
Figure 2 for Scaling A Simple Approach to Zero-Shot Speech Recognition
Figure 3 for Scaling A Simple Approach to Zero-Shot Speech Recognition
Figure 4 for Scaling A Simple Approach to Zero-Shot Speech Recognition
Viaarxiv icon

TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch

Add code
Oct 27, 2023
Viaarxiv icon

Scaling Speech Technology to 1,000+ Languages

Add code
May 22, 2023
Viaarxiv icon

Flashlight: Enabling Innovation in Tools for Machine Learning

Add code
Jan 29, 2022
Figure 1 for Flashlight: Enabling Innovation in Tools for Machine Learning
Figure 2 for Flashlight: Enabling Innovation in Tools for Machine Learning
Figure 3 for Flashlight: Enabling Innovation in Tools for Machine Learning
Figure 4 for Flashlight: Enabling Innovation in Tools for Machine Learning
Viaarxiv icon

Star Temporal Classification: Sequence Classification with Partially Labeled Data

Add code
Jan 28, 2022
Figure 1 for Star Temporal Classification: Sequence Classification with Partially Labeled Data
Figure 2 for Star Temporal Classification: Sequence Classification with Partially Labeled Data
Figure 3 for Star Temporal Classification: Sequence Classification with Partially Labeled Data
Figure 4 for Star Temporal Classification: Sequence Classification with Partially Labeled Data
Viaarxiv icon

Word Order Does Not Matter For Speech Recognition

Add code
Oct 18, 2021
Figure 1 for Word Order Does Not Matter For Speech Recognition
Figure 2 for Word Order Does Not Matter For Speech Recognition
Figure 3 for Word Order Does Not Matter For Speech Recognition
Figure 4 for Word Order Does Not Matter For Speech Recognition
Viaarxiv icon

Parallel Composition of Weighted Finite-State Transducers

Add code
Oct 06, 2021
Figure 1 for Parallel Composition of Weighted Finite-State Transducers
Figure 2 for Parallel Composition of Weighted Finite-State Transducers
Figure 3 for Parallel Composition of Weighted Finite-State Transducers
Figure 4 for Parallel Composition of Weighted Finite-State Transducers
Viaarxiv icon

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training

Add code
Apr 02, 2021
Figure 1 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 2 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 3 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 4 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Viaarxiv icon

MLS: A Large-Scale Multilingual Dataset for Speech Research

Add code
Dec 19, 2020
Figure 1 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 2 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 3 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 4 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Viaarxiv icon