Picture for You Zhang

You Zhang

Medical Artificial Intelligence and Automation

Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions

Add code
Sep 25, 2024
Viaarxiv icon

SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge

Add code
Aug 28, 2024
Viaarxiv icon

A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual Deepfake Detection

Add code
Jun 20, 2024
Viaarxiv icon

CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection

Add code
Jun 04, 2024
Figure 1 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 2 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 3 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 4 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Viaarxiv icon

SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan

Add code
May 08, 2024
Viaarxiv icon

Prior Frequency Guided Diffusion Model for Limited Angle (LA)-CBCT Reconstruction

Add code
Apr 09, 2024
Viaarxiv icon

Personalized LoRA for Human-Centered Text Understanding

Add code
Mar 10, 2024
Viaarxiv icon

Parameterized Decision-making with Multi-modal Perception for Autonomous Driving

Add code
Dec 19, 2023
Viaarxiv icon

Learning Arousal-Valence Representation from Categorical Emotion Labels of Speech

Add code
Nov 24, 2023
Viaarxiv icon

FDDM: Unsupervised Medical Image Translation with a Frequency-Decoupled Diffusion Model

Add code
Nov 19, 2023
Viaarxiv icon