Picture for Xuesong Yang

Xuesong Yang

Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits

Add code
Jan 07, 2025
Figure 1 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 2 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 3 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 4 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Viaarxiv icon

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

Add code
Dec 18, 2024
Viaarxiv icon

NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts

Add code
Nov 08, 2024
Viaarxiv icon

A Unified Framework for Contrastive Learning from a Perspective of Affinity Matrix

Add code
Nov 26, 2022
Viaarxiv icon

NDGGNET-A Node Independent Gate based Graph Neural Networks

Add code
May 11, 2022
Figure 1 for NDGGNET-A Node Independent Gate based Graph Neural Networks
Figure 2 for NDGGNET-A Node Independent Gate based Graph Neural Networks
Figure 3 for NDGGNET-A Node Independent Gate based Graph Neural Networks
Figure 4 for NDGGNET-A Node Independent Gate based Graph Neural Networks
Viaarxiv icon

LibFewShot: A Comprehensive Library for Few-shot Learning

Add code
Sep 10, 2021
Figure 1 for LibFewShot: A Comprehensive Library for Few-shot Learning
Figure 2 for LibFewShot: A Comprehensive Library for Few-shot Learning
Figure 3 for LibFewShot: A Comprehensive Library for Few-shot Learning
Figure 4 for LibFewShot: A Comprehensive Library for Few-shot Learning
Viaarxiv icon

Triplet is All You Need with Random Mappings for Unsupervised Visual Representation Learning

Add code
Jul 22, 2021
Figure 1 for Triplet is All You Need with Random Mappings for Unsupervised Visual Representation Learning
Figure 2 for Triplet is All You Need with Random Mappings for Unsupervised Visual Representation Learning
Figure 3 for Triplet is All You Need with Random Mappings for Unsupervised Visual Representation Learning
Figure 4 for Triplet is All You Need with Random Mappings for Unsupervised Visual Representation Learning
Viaarxiv icon

REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling

Add code
Dec 14, 2020
Figure 1 for REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling
Figure 2 for REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling
Viaarxiv icon

AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Add code
Jun 06, 2019
Figure 1 for AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Figure 2 for AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Figure 3 for AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Figure 4 for AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Viaarxiv icon

When CTC Training Meets Acoustic Landmarks

Add code
Nov 05, 2018
Figure 1 for When CTC Training Meets Acoustic Landmarks
Figure 2 for When CTC Training Meets Acoustic Landmarks
Figure 3 for When CTC Training Meets Acoustic Landmarks
Figure 4 for When CTC Training Meets Acoustic Landmarks
Viaarxiv icon