Picture for Wenqiang Zhang

Wenqiang Zhang

Tsinghua University

A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Anomaly Detection

Add code
Oct 29, 2024
Viaarxiv icon

VideoSAM: Open-World Video Segmentation

Add code
Oct 11, 2024
Figure 1 for VideoSAM: Open-World Video Segmentation
Figure 2 for VideoSAM: Open-World Video Segmentation
Figure 3 for VideoSAM: Open-World Video Segmentation
Figure 4 for VideoSAM: Open-World Video Segmentation
Viaarxiv icon

X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation

Add code
Sep 28, 2024
Figure 1 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 2 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 3 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 4 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Viaarxiv icon

General Compression Framework for Efficient Transformer Object Tracking

Add code
Sep 26, 2024
Viaarxiv icon

Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection

Add code
Aug 28, 2024
Viaarxiv icon

TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning

Add code
Aug 28, 2024
Figure 1 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Figure 2 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Figure 3 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Figure 4 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Viaarxiv icon

A Survey on Facial Expression Recognition of Static and Dynamic Emotions

Add code
Aug 28, 2024
Viaarxiv icon

Hi-EF: Benchmarking Emotion Forecasting in Human-interaction

Add code
Jul 23, 2024
Viaarxiv icon

All rivers run into the sea: Unified Modality Brain-like Emotional Central Mechanism

Add code
Jul 22, 2024
Viaarxiv icon

PG-Attack: A Precision-Guided Adversarial Attack Framework Against Vision Foundation Models for Autonomous Driving

Add code
Jul 18, 2024
Viaarxiv icon