Picture for Jusang Oh

Jusang Oh

Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks

Add code
Oct 12, 2024
Figure 1 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Figure 2 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Figure 3 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Figure 4 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Viaarxiv icon