Picture for Zichang Tan

Zichang Tan

Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression

Add code
Sep 01, 2024
Figure 1 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 2 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 3 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 4 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Viaarxiv icon

SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition

Add code
Jul 30, 2024
Figure 1 for SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition
Figure 2 for SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition
Figure 3 for SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition
Figure 4 for SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition
Viaarxiv icon

BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection

Add code
Jun 13, 2024
Viaarxiv icon

Training-Free Unsupervised Prompt for Vision-Language Models

Add code
Apr 25, 2024
Figure 1 for Training-Free Unsupervised Prompt for Vision-Language Models
Figure 2 for Training-Free Unsupervised Prompt for Vision-Language Models
Figure 3 for Training-Free Unsupervised Prompt for Vision-Language Models
Figure 4 for Training-Free Unsupervised Prompt for Vision-Language Models
Viaarxiv icon

PVLR: Prompt-driven Visual-Linguistic Representation Learning for Multi-Label Image Recognition

Add code
Jan 31, 2024
Viaarxiv icon

Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection

Add code
Dec 27, 2023
Figure 1 for Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection
Figure 2 for Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection
Figure 3 for Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection
Figure 4 for Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection
Viaarxiv icon

ProtoHPE: Prototype-guided High-frequency Patch Enhancement for Visible-Infrared Person Re-identification

Add code
Oct 11, 2023
Viaarxiv icon

Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation

Add code
Sep 18, 2023
Viaarxiv icon

Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation

Add code
Aug 14, 2023
Viaarxiv icon

General vs. Long-Tailed Age Estimation: An Approach to Kill Two Birds with One Stone

Add code
Jul 19, 2023
Viaarxiv icon