Picture for Zhenjia Bai

Zhenjia Bai

HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification

Add code
Jul 23, 2024
Figure 1 for HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification
Figure 2 for HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification
Figure 3 for HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification
Figure 4 for HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification
Viaarxiv icon

Memory-Inspired Temporal Prompt Interaction for Text-Image Classification

Add code
Jan 26, 2024
Viaarxiv icon