Picture for Dongqi Tang

Dongqi Tang

Scalable Autoregressive Monocular Depth Estimation

Add code
Nov 18, 2024
Viaarxiv icon

TokenPacker: Efficient Visual Projector for Multimodal LLM

Add code
Jul 02, 2024
Figure 1 for TokenPacker: Efficient Visual Projector for Multimodal LLM
Figure 2 for TokenPacker: Efficient Visual Projector for Multimodal LLM
Figure 3 for TokenPacker: Efficient Visual Projector for Multimodal LLM
Figure 4 for TokenPacker: Efficient Visual Projector for Multimodal LLM
Viaarxiv icon

Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification

Add code
Jan 02, 2024
Viaarxiv icon

Osprey: Pixel Understanding with Visual Instruction Tuning

Add code
Dec 25, 2023
Viaarxiv icon

Text as Image: Learning Transferable Adapter for Multi-Label Classification

Add code
Dec 07, 2023
Viaarxiv icon

Label-efficient Segmentation via Affinity Propagation

Add code
Oct 17, 2023
Figure 1 for Label-efficient Segmentation via Affinity Propagation
Figure 2 for Label-efficient Segmentation via Affinity Propagation
Figure 3 for Label-efficient Segmentation via Affinity Propagation
Figure 4 for Label-efficient Segmentation via Affinity Propagation
Viaarxiv icon