Picture for Yongyi Zang

Yongyi Zang

Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders

Add code
Jan 07, 2025
Figure 1 for Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
Figure 2 for Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
Figure 3 for Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
Figure 4 for Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
Viaarxiv icon

SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge

Add code
Aug 28, 2024
Viaarxiv icon

The Interpretation Gap in Text-to-Music Generation Models

Add code
Jul 14, 2024
Viaarxiv icon

CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection

Add code
Jun 04, 2024
Figure 1 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 2 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 3 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 4 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Viaarxiv icon

Ambisonizer: Neural Upmixing as Spherical Harmonics Generation

Add code
May 22, 2024
Viaarxiv icon

SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan

Add code
May 08, 2024
Viaarxiv icon

SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription

Add code
Sep 22, 2023
Viaarxiv icon

SingFake: Singing Voice Deepfake Detection

Add code
Sep 14, 2023
Viaarxiv icon

Phase perturbation improves channel robustness for speech spoofing countermeasures

Add code
Jun 06, 2023
Viaarxiv icon