Picture for Sundararajan Srinivasan

Sundararajan Srinivasan

SEAL: Speaker Error Correction using Acoustic-conditioned Large Language Models

Add code
Jan 14, 2025
Viaarxiv icon

Meta-Learning Adaptable Foundation Models

Add code
Oct 29, 2024
Viaarxiv icon

CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation

Add code
Oct 03, 2024
Figure 1 for CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Figure 2 for CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Figure 3 for CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Figure 4 for CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Viaarxiv icon

Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization

Add code
Jun 26, 2024
Viaarxiv icon

AG-LSEC: Audio Grounded Lexical Speaker Error Correction

Add code
Jun 25, 2024
Viaarxiv icon

SpeechVerse: A Large-scale Generalizable Audio Language Model

Add code
May 14, 2024
Figure 1 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 2 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 3 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 4 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Viaarxiv icon

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

Add code
May 14, 2024
Viaarxiv icon

End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation

Add code
Nov 01, 2023
Figure 1 for End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Figure 2 for End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Figure 3 for End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Figure 4 for End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Viaarxiv icon

Speaker Diarization of Scripted Audiovisual Content

Add code
Aug 04, 2023
Figure 1 for Speaker Diarization of Scripted Audiovisual Content
Figure 2 for Speaker Diarization of Scripted Audiovisual Content
Figure 3 for Speaker Diarization of Scripted Audiovisual Content
Figure 4 for Speaker Diarization of Scripted Audiovisual Content
Viaarxiv icon

Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction

Add code
Jun 15, 2023
Viaarxiv icon