Picture for Jianzong Wang

Jianzong Wang

IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding

Add code
Sep 29, 2024
Viaarxiv icon

PFID: Privacy First Inference Delegation Framework for LLMs

Add code
Jun 18, 2024
Viaarxiv icon

RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval

Add code
May 28, 2024
Figure 1 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 2 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 3 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 4 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Viaarxiv icon

Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning

Add code
May 28, 2024
Figure 1 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 2 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 3 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 4 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Viaarxiv icon

RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis

Add code
May 27, 2024
Viaarxiv icon

Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training

Add code
May 22, 2024
Figure 1 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Figure 2 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Figure 3 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Figure 4 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Viaarxiv icon

PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition

Add code
May 11, 2024
Viaarxiv icon

MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion

Add code
May 02, 2024
Viaarxiv icon

Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation

Add code
May 01, 2024
Viaarxiv icon

EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning

Add code
Apr 30, 2024
Viaarxiv icon