Picture for Jianzong Wang

Jianzong Wang

ESARM: 3D Emotional Speech-to-Animation via Reward Model from Automatically-Ranked Demonstrations

Add code
Nov 20, 2024
Viaarxiv icon

Incremental Label Distribution Learning with Scalable Graph Convolutional Networks

Add code
Nov 20, 2024
Viaarxiv icon

IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding

Add code
Sep 29, 2024
Figure 1 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Figure 2 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Figure 3 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Figure 4 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Viaarxiv icon

PFID: Privacy First Inference Delegation Framework for LLMs

Add code
Jun 18, 2024
Figure 1 for PFID: Privacy First Inference Delegation Framework for LLMs
Figure 2 for PFID: Privacy First Inference Delegation Framework for LLMs
Figure 3 for PFID: Privacy First Inference Delegation Framework for LLMs
Figure 4 for PFID: Privacy First Inference Delegation Framework for LLMs
Viaarxiv icon

Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning

Add code
May 28, 2024
Figure 1 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 2 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 3 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 4 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Viaarxiv icon

RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval

Add code
May 28, 2024
Figure 1 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 2 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 3 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 4 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Viaarxiv icon

RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis

Add code
May 27, 2024
Viaarxiv icon

Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training

Add code
May 22, 2024
Figure 1 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Figure 2 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Figure 3 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Figure 4 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Viaarxiv icon

PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition

Add code
May 11, 2024
Viaarxiv icon

MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion

Add code
May 02, 2024
Viaarxiv icon