Picture for Rita Singh

Rita Singh

What Do Speech Foundation Models Not Learn About Speech?

Add code
Oct 16, 2024
Viaarxiv icon

Objective Measurements of Voice Quality

Add code
Oct 12, 2024
Figure 1 for Objective Measurements of Voice Quality
Figure 2 for Objective Measurements of Voice Quality
Figure 3 for Objective Measurements of Voice Quality
Viaarxiv icon

Improving Speaker Representations Using Contrastive Losses on Multi-scale Features

Add code
Oct 07, 2024
Viaarxiv icon

Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection

Add code
Oct 04, 2024
Viaarxiv icon

PDAF: A Phonetic Debiasing Attention Framework For Speaker Verification

Add code
Sep 09, 2024
Viaarxiv icon

Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?

Add code
Aug 12, 2024
Figure 1 for Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?
Figure 2 for Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?
Figure 3 for Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?
Figure 4 for Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?
Viaarxiv icon

Audio Entailment: Assessing Deductive Reasoning for Audio Understanding

Add code
Jul 25, 2024
Viaarxiv icon

SELM: Enhancing Speech Emotion Recognition for Out-of-Domain Scenarios

Add code
Jul 22, 2024
Viaarxiv icon

Krait: A Backdoor Attack Against Graph Prompt Tuning

Add code
Jul 18, 2024
Viaarxiv icon

ControlVAR: Exploring Controllable Visual Autoregressive Modeling

Add code
Jun 14, 2024
Figure 1 for ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Figure 2 for ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Figure 3 for ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Figure 4 for ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Viaarxiv icon