Picture for Nancy F. Chen

Nancy F. Chen

Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models

Add code
Nov 01, 2024
Viaarxiv icon

Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models

Add code
Sep 27, 2024
Figure 1 for Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models
Figure 2 for Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models
Figure 3 for Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models
Figure 4 for Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models
Viaarxiv icon

Semi-supervised Learning For Robust Speech Evaluation

Add code
Sep 23, 2024
Viaarxiv icon

Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization

Add code
Sep 16, 2024
Figure 1 for Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization
Figure 2 for Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization
Figure 3 for Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization
Figure 4 for Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization
Viaarxiv icon

MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders

Add code
Sep 10, 2024
Figure 1 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Figure 2 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Figure 3 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Figure 4 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Viaarxiv icon

MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues

Add code
Aug 26, 2024
Viaarxiv icon

LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs

Add code
Aug 16, 2024
Viaarxiv icon

PRESENT: Zero-Shot Text-to-Prosody Control

Add code
Aug 13, 2024
Viaarxiv icon

TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations

Add code
Jul 02, 2024
Viaarxiv icon

AudioBench: A Universal Benchmark for Audio Large Language Models

Add code
Jun 25, 2024
Viaarxiv icon