Picture for Xuanjun Chen

Xuanjun Chen

CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset

Add code
Jan 14, 2025
Figure 1 for CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset
Figure 2 for CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset
Figure 3 for CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset
Figure 4 for CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset
Viaarxiv icon

Building a Taiwanese Mandarin Spoken Language Model: A First Attempt

Add code
Nov 11, 2024
Figure 1 for Building a Taiwanese Mandarin Spoken Language Model: A First Attempt
Figure 2 for Building a Taiwanese Mandarin Spoken Language Model: A First Attempt
Figure 3 for Building a Taiwanese Mandarin Spoken Language Model: A First Attempt
Figure 4 for Building a Taiwanese Mandarin Spoken Language Model: A First Attempt
Viaarxiv icon

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Figure 1 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 2 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 3 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 4 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Viaarxiv icon

Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models

Add code
Sep 21, 2024
Figure 1 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 2 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 3 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 4 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Viaarxiv icon

Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement

Add code
Sep 16, 2024
Figure 1 for Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
Figure 2 for Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
Figure 3 for Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
Figure 4 for Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
Viaarxiv icon

DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset

Add code
Sep 13, 2024
Figure 1 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Figure 2 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Figure 3 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Figure 4 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Viaarxiv icon

Neural Codec-based Adversarial Sample Detection for Speaker Verification

Add code
Jun 07, 2024
Viaarxiv icon

Singing Voice Graph Modeling for SingFake Detection

Add code
Jun 05, 2024
Viaarxiv icon

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models

Add code
Feb 20, 2024
Figure 1 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 2 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 3 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 4 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Viaarxiv icon

Towards audio language modeling -- an overview

Add code
Feb 20, 2024
Viaarxiv icon