Picture for Pranav Dheram

Pranav Dheram

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition

Add code
Mar 28, 2024
Viaarxiv icon

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Add code
Jan 26, 2024
Figure 1 for Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion
Figure 2 for Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion
Figure 3 for Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion
Figure 4 for Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion
Viaarxiv icon

Mining Duplicate Questions of Stack Overflow

Add code
Oct 04, 2022
Figure 1 for Mining Duplicate Questions of Stack Overflow
Figure 2 for Mining Duplicate Questions of Stack Overflow
Figure 3 for Mining Duplicate Questions of Stack Overflow
Viaarxiv icon

Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities

Add code
Jul 22, 2022
Figure 1 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Figure 2 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Figure 3 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Figure 4 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Viaarxiv icon

End-to-End Spoken Language Understanding using RNN-Transducer ASR

Add code
Jul 08, 2021
Figure 1 for End-to-End Spoken Language Understanding using RNN-Transducer ASR
Figure 2 for End-to-End Spoken Language Understanding using RNN-Transducer ASR
Figure 3 for End-to-End Spoken Language Understanding using RNN-Transducer ASR
Figure 4 for End-to-End Spoken Language Understanding using RNN-Transducer ASR
Viaarxiv icon

Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding

Add code
Feb 12, 2021
Figure 1 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Figure 2 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Figure 3 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Viaarxiv icon

Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces

Add code
Aug 14, 2020
Figure 1 for Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Figure 2 for Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Figure 3 for Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Figure 4 for Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Viaarxiv icon