Picture for Chun-Yi Kuan

Chun-Yi Kuan

Gender Bias in Instruction-Guided Speech Synthesis Models

Add code
Feb 08, 2025
Viaarxiv icon

Building a Taiwanese Mandarin Spoken Language Model: A First Attempt

Add code
Nov 11, 2024
Viaarxiv icon

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Figure 1 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 2 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 3 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 4 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Viaarxiv icon

Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning

Add code
Oct 21, 2024
Viaarxiv icon

Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation

Add code
Jul 13, 2024
Figure 1 for Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation
Figure 2 for Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation
Figure 3 for Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation
Figure 4 for Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation
Viaarxiv icon

Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models

Add code
Jul 09, 2024
Viaarxiv icon

Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course

Add code
Jul 07, 2024
Figure 1 for Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course
Figure 2 for Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course
Figure 3 for Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course
Figure 4 for Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course
Viaarxiv icon

Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models

Add code
Jun 12, 2024
Figure 1 for Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models
Figure 2 for Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models
Figure 3 for Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models
Figure 4 for Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models
Viaarxiv icon

Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision

Add code
Dec 30, 2023
Viaarxiv icon

Towards General-Purpose Text-Instruction-Guided Voice Conversion

Add code
Sep 25, 2023
Figure 1 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Figure 2 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Figure 3 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Figure 4 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Viaarxiv icon