Picture for Hung-yi Lee

Hung-yi Lee

CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset

Add code
Jan 14, 2025
Viaarxiv icon

Spectral-Aware Low-Rank Adaptation for Speaker Verification

Add code
Jan 07, 2025
Figure 1 for Spectral-Aware Low-Rank Adaptation for Speaker Verification
Figure 2 for Spectral-Aware Low-Rank Adaptation for Speaker Verification
Figure 3 for Spectral-Aware Low-Rank Adaptation for Speaker Verification
Figure 4 for Spectral-Aware Low-Rank Adaptation for Speaker Verification
Viaarxiv icon

Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits

Add code
Jan 07, 2025
Figure 1 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 2 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 3 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 4 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Viaarxiv icon

Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging

Add code
Dec 27, 2024
Viaarxiv icon

Enhancing Multilingual ASR for Unseen Languages via Language Embedding Modeling

Add code
Dec 21, 2024
Viaarxiv icon

How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario

Add code
Nov 27, 2024
Figure 1 for How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario
Figure 2 for How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario
Figure 3 for How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario
Figure 4 for How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario
Viaarxiv icon

Fusion of Discrete Representations and Self-Augmented Representations for Multilingual Automatic Speech Recognition

Add code
Nov 27, 2024
Figure 1 for Fusion of Discrete Representations and Self-Augmented Representations for Multilingual Automatic Speech Recognition
Figure 2 for Fusion of Discrete Representations and Self-Augmented Representations for Multilingual Automatic Speech Recognition
Figure 3 for Fusion of Discrete Representations and Self-Augmented Representations for Multilingual Automatic Speech Recognition
Figure 4 for Fusion of Discrete Representations and Self-Augmented Representations for Multilingual Automatic Speech Recognition
Viaarxiv icon

Building a Taiwanese Mandarin Spoken Language Model: A First Attempt

Add code
Nov 11, 2024
Viaarxiv icon

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Figure 1 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 2 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 3 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 4 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Viaarxiv icon

Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

Add code
Nov 04, 2024
Figure 1 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback
Figure 2 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback
Figure 3 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback
Figure 4 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback
Viaarxiv icon