Picture for Xulong Zhang

Xulong Zhang

ESARM: 3D Emotional Speech-to-Animation via Reward Model from Automatically-Ranked Demonstrations

Add code
Nov 20, 2024
Viaarxiv icon

Semi-Supervised Self-Learning Enhanced Music Emotion Recognition

Add code
Oct 29, 2024
Viaarxiv icon

IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding

Add code
Sep 29, 2024
Figure 1 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Figure 2 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Figure 3 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Figure 4 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Viaarxiv icon

Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning

Add code
May 28, 2024
Figure 1 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 2 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 3 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 4 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Viaarxiv icon

RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval

Add code
May 28, 2024
Figure 1 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 2 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 3 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 4 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Viaarxiv icon

RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis

Add code
May 27, 2024
Viaarxiv icon

MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion

Add code
May 02, 2024
Viaarxiv icon

Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation

Add code
May 01, 2024
Viaarxiv icon

QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering

Add code
Apr 30, 2024
Viaarxiv icon

EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning

Add code
Apr 30, 2024
Viaarxiv icon