Picture for Yongyi Zang

Yongyi Zang

ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech

Add code
Feb 13, 2025
Viaarxiv icon

Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders

Add code
Jan 07, 2025
Figure 1 for Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
Figure 2 for Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
Figure 3 for Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
Figure 4 for Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
Viaarxiv icon

SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge

Add code
Aug 28, 2024
Viaarxiv icon

The Interpretation Gap in Text-to-Music Generation Models

Add code
Jul 14, 2024
Figure 1 for The Interpretation Gap in Text-to-Music Generation Models
Figure 2 for The Interpretation Gap in Text-to-Music Generation Models
Figure 3 for The Interpretation Gap in Text-to-Music Generation Models
Figure 4 for The Interpretation Gap in Text-to-Music Generation Models
Viaarxiv icon

CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection

Add code
Jun 04, 2024
Figure 1 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 2 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 3 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 4 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Viaarxiv icon

Ambisonizer: Neural Upmixing as Spherical Harmonics Generation

Add code
May 22, 2024
Viaarxiv icon

SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan

Add code
May 08, 2024
Viaarxiv icon

SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription

Add code
Sep 22, 2023
Viaarxiv icon

SingFake: Singing Voice Deepfake Detection

Add code
Sep 14, 2023
Viaarxiv icon

Phase perturbation improves channel robustness for speech spoofing countermeasures

Add code
Jun 06, 2023
Viaarxiv icon