Picture for Jing Xiao

Jing Xiao

IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding

Add code
Sep 29, 2024
Viaarxiv icon

ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue

Add code
Sep 26, 2024
Figure 1 for ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue
Figure 2 for ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue
Figure 3 for ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue
Figure 4 for ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue
Viaarxiv icon

Personalized Knowledge Tracing through Student Representation Reconstruction and Class Imbalance Mitigation

Add code
Sep 10, 2024
Figure 1 for Personalized Knowledge Tracing through Student Representation Reconstruction and Class Imbalance Mitigation
Figure 2 for Personalized Knowledge Tracing through Student Representation Reconstruction and Class Imbalance Mitigation
Figure 3 for Personalized Knowledge Tracing through Student Representation Reconstruction and Class Imbalance Mitigation
Figure 4 for Personalized Knowledge Tracing through Student Representation Reconstruction and Class Imbalance Mitigation
Viaarxiv icon

Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions

Add code
Sep 02, 2024
Figure 1 for Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions
Figure 2 for Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions
Figure 3 for Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions
Figure 4 for Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions
Viaarxiv icon

Multi-periodicity dependency Transformer based on spectrum offset for radio frequency fingerprint identification

Add code
Aug 14, 2024
Viaarxiv icon

Planning with Large Language Models for Conversational Agents

Add code
Jul 04, 2024
Viaarxiv icon

Deep Learning Segmentation of Ascites on Abdominal CT Scans for Automatic Volume Quantification

Add code
Jun 23, 2024
Viaarxiv icon

PFID: Privacy First Inference Delegation Framework for LLMs

Add code
Jun 18, 2024
Viaarxiv icon

A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed

Add code
Jun 13, 2024
Figure 1 for A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed
Figure 2 for A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed
Figure 3 for A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed
Figure 4 for A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed
Viaarxiv icon

Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning

Add code
May 28, 2024
Figure 1 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 2 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 3 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 4 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Viaarxiv icon