Picture for Prashanth Gurunath Shivakumar

Prashanth Gurunath Shivakumar

Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

Add code
Nov 04, 2024
Figure 1 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback
Figure 2 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback
Figure 3 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback
Figure 4 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback
Viaarxiv icon

Speech Recognition Rescoring with Large Speech-Text Foundation Models

Add code
Sep 25, 2024
Viaarxiv icon

Multi-Modal Retrieval For Large Language Model Based Speech Recognition

Add code
Jun 13, 2024
Viaarxiv icon

Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue

Add code
Jan 17, 2024
Viaarxiv icon

Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks

Add code
Jan 05, 2024
Viaarxiv icon

Discriminative Speech Recognition Rescoring with Pre-trained Language Models

Add code
Oct 10, 2023
Viaarxiv icon

Personalization for BERT-based Discriminative Speech Recognition Rescoring

Add code
Jul 13, 2023
Viaarxiv icon

Scaling Laws for Discriminative Speech Recognition Rescoring Models

Add code
Jun 27, 2023
Viaarxiv icon

Distillation Strategies for Discriminative Speech Recognition Rescoring

Add code
Jun 15, 2023
Viaarxiv icon

Phone Duration Modeling for Speaker Age Estimation in Children

Add code
Sep 03, 2021
Figure 1 for Phone Duration Modeling for Speaker Age Estimation in Children
Figure 2 for Phone Duration Modeling for Speaker Age Estimation in Children
Figure 3 for Phone Duration Modeling for Speaker Age Estimation in Children
Figure 4 for Phone Duration Modeling for Speaker Age Estimation in Children
Viaarxiv icon