Picture for Dhanush Bekal

Dhanush Bekal

SpeechVerse: A Large-scale Generalizable Audio Language Model

Add code
May 14, 2024
Figure 1 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 2 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 3 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 4 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Viaarxiv icon

Device Directedness with Contextual Cues for Spoken Dialog Systems

Add code
Nov 23, 2022
Viaarxiv icon

Remember the context! ASR slot error correction through memorization

Add code
Sep 18, 2021
Figure 1 for Remember the context! ASR slot error correction through memorization
Figure 2 for Remember the context! ASR slot error correction through memorization
Figure 3 for Remember the context! ASR slot error correction through memorization
Figure 4 for Remember the context! ASR slot error correction through memorization
Viaarxiv icon

Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech

Add code
Aug 03, 2020
Figure 1 for Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech
Figure 2 for Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech
Figure 3 for Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech
Figure 4 for Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech
Viaarxiv icon

Text Generation from Knowledge Graphs with Graph Transformers

Add code
May 18, 2019
Figure 1 for Text Generation from Knowledge Graphs with Graph Transformers
Figure 2 for Text Generation from Knowledge Graphs with Graph Transformers
Figure 3 for Text Generation from Knowledge Graphs with Graph Transformers
Figure 4 for Text Generation from Knowledge Graphs with Graph Transformers
Viaarxiv icon