Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks

Add code
Oct 12, 2024
Figure 1 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Figure 2 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Figure 3 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Figure 4 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: