Picture for Desh Raj

Desh Raj

M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses

Add code
Sep 17, 2024
Figure 1 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 2 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 3 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 4 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Viaarxiv icon

Faster Speech-LLaMA Inference with Multi-token Prediction

Add code
Sep 12, 2024
Viaarxiv icon

Listening to Multi-talker Conversations: Modular and End-to-end Perspectives

Add code
Feb 14, 2024
Viaarxiv icon

On Speaker Attribution with SURT

Add code
Jan 28, 2024
Figure 1 for On Speaker Attribution with SURT
Figure 2 for On Speaker Attribution with SURT
Figure 3 for On Speaker Attribution with SURT
Figure 4 for On Speaker Attribution with SURT
Viaarxiv icon

Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition

Add code
Sep 26, 2023
Viaarxiv icon

Updated Corpora and Benchmarks for Long-Form Speech Recognition

Add code
Sep 26, 2023
Figure 1 for Updated Corpora and Benchmarks for Long-Form Speech Recognition
Figure 2 for Updated Corpora and Benchmarks for Long-Form Speech Recognition
Figure 3 for Updated Corpora and Benchmarks for Long-Form Speech Recognition
Figure 4 for Updated Corpora and Benchmarks for Long-Form Speech Recognition
Viaarxiv icon

Training dynamic models using early exits for automatic speech recognition on resource-constrained devices

Add code
Sep 18, 2023
Figure 1 for Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
Figure 2 for Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
Figure 3 for Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
Figure 4 for Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
Viaarxiv icon

The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios

Add code
Jul 14, 2023
Viaarxiv icon

SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition

Add code
Jun 18, 2023
Viaarxiv icon

GPU-accelerated Guided Source Separation for Meeting Transcription

Add code
Dec 10, 2022
Viaarxiv icon