Picture for Zhaoyu Chen

Zhaoyu Chen

MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration

Add code
Oct 17, 2024
Figure 1 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 2 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 3 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 4 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Viaarxiv icon

X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation

Add code
Sep 28, 2024
Figure 1 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 2 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 3 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 4 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Viaarxiv icon

General Compression Framework for Efficient Transformer Object Tracking

Add code
Sep 26, 2024
Viaarxiv icon

KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition

Add code
Sep 14, 2024
Figure 1 for KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition
Figure 2 for KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition
Figure 3 for KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition
Figure 4 for KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition
Viaarxiv icon

TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning

Add code
Aug 28, 2024
Figure 1 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Figure 2 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Figure 3 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Figure 4 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Viaarxiv icon

Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators

Add code
Aug 22, 2024
Figure 1 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Figure 2 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Figure 3 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Figure 4 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Viaarxiv icon

PG-Attack: A Precision-Guided Adversarial Attack Framework Against Vision Foundation Models for Autonomous Driving

Add code
Jul 18, 2024
Viaarxiv icon

Large Vision-Language Models as Emotion Recognizers in Context Awareness

Add code
Jul 16, 2024
Viaarxiv icon

Towards Context-Aware Emotion Recognition Debiasing from a Causal Demystification Perspective via De-confounded Training

Add code
Jul 06, 2024
Figure 1 for Towards Context-Aware Emotion Recognition Debiasing from a Causal Demystification Perspective via De-confounded Training
Figure 2 for Towards Context-Aware Emotion Recognition Debiasing from a Causal Demystification Perspective via De-confounded Training
Figure 3 for Towards Context-Aware Emotion Recognition Debiasing from a Causal Demystification Perspective via De-confounded Training
Figure 4 for Towards Context-Aware Emotion Recognition Debiasing from a Causal Demystification Perspective via De-confounded Training
Viaarxiv icon

Self-Cooperation Knowledge Distillation for Novel Class Discovery

Add code
Jul 02, 2024
Viaarxiv icon