Picture for Sungkyung Kim

Sungkyung Kim

Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks

Add code
Oct 12, 2024
Figure 1 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Figure 2 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Figure 3 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Figure 4 for Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Viaarxiv icon

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models

Add code
Mar 26, 2024
Viaarxiv icon