Picture for Yuxun Tang

Yuxun Tang

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Viaarxiv icon

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech

Add code
Sep 24, 2024
Viaarxiv icon

Muskits-ESPnet: A Comprehensive Toolkit for Singing Voice Synthesis in New Paradigm

Add code
Sep 11, 2024
Viaarxiv icon

SingOMD: Singing Oriented Multi-resolution Discrete Representation Construction from Speech Models

Add code
Jun 20, 2024
Viaarxiv icon

SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction

Add code
Jun 16, 2024
Viaarxiv icon

TokSing: Singing Voice Synthesis based on Discrete Tokens

Add code
Jun 12, 2024
Figure 1 for TokSing: Singing Voice Synthesis based on Discrete Tokens
Figure 2 for TokSing: Singing Voice Synthesis based on Discrete Tokens
Figure 3 for TokSing: Singing Voice Synthesis based on Discrete Tokens
Figure 4 for TokSing: Singing Voice Synthesis based on Discrete Tokens
Viaarxiv icon

The Interspeech 2024 Challenge on Speech Processing Using Discrete Units

Add code
Jun 11, 2024
Figure 1 for The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
Figure 2 for The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
Figure 3 for The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
Figure 4 for The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
Viaarxiv icon

CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection

Add code
Jun 04, 2024
Figure 1 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 2 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 3 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 4 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Viaarxiv icon

SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan

Add code
May 08, 2024
Viaarxiv icon

Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2

Add code
Jan 31, 2024
Figure 1 for Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2
Figure 2 for Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2
Figure 3 for Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2
Figure 4 for Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2
Viaarxiv icon