Picture for Sundararajan Srinivasan

Sundararajan Srinivasan

Meta-Learning Adaptable Foundation Models

Add code
Oct 29, 2024
Viaarxiv icon

CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation

Add code
Oct 03, 2024
Figure 1 for CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Figure 2 for CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Figure 3 for CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Figure 4 for CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Viaarxiv icon

Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization

Add code
Jun 26, 2024
Viaarxiv icon

AG-LSEC: Audio Grounded Lexical Speaker Error Correction

Add code
Jun 25, 2024
Viaarxiv icon

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

Add code
May 14, 2024
Viaarxiv icon

SpeechVerse: A Large-scale Generalizable Audio Language Model

Add code
May 14, 2024
Figure 1 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 2 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 3 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 4 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Viaarxiv icon

End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation

Add code
Nov 01, 2023
Figure 1 for End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Figure 2 for End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Figure 3 for End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Figure 4 for End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Viaarxiv icon

Speaker Diarization of Scripted Audiovisual Content

Add code
Aug 04, 2023
Viaarxiv icon

Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction

Add code
Jun 15, 2023
Viaarxiv icon

Device Directedness with Contextual Cues for Spoken Dialog Systems

Add code
Nov 23, 2022
Viaarxiv icon