Winoground


Enhancing Compositional Reasoning in Vision-Language Models with Synthetic Preference Data

Add code
Apr 07, 2025
Viaarxiv icon

Natural Language Inference Improves Compositionality in Vision-Language Models

Add code
Oct 29, 2024
Figure 1 for Natural Language Inference Improves Compositionality in Vision-Language Models
Figure 2 for Natural Language Inference Improves Compositionality in Vision-Language Models
Figure 3 for Natural Language Inference Improves Compositionality in Vision-Language Models
Figure 4 for Natural Language Inference Improves Compositionality in Vision-Language Models
Viaarxiv icon

Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning

Add code
May 26, 2024
Figure 1 for Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning
Figure 2 for Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning
Figure 3 for Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning
Figure 4 for Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning
Viaarxiv icon

ImageInWords: Unlocking Hyper-Detailed Image Descriptions

Add code
May 05, 2024
Viaarxiv icon

ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation

Add code
Feb 07, 2024
Viaarxiv icon

Prompting Large Vision-Language Models for Compositional Reasoning

Add code
Jan 20, 2024
Viaarxiv icon

A Contrastive Compositional Benchmark for Text-to-Image Synthesis: A Study with Unified Text-to-Image Fidelity Metrics

Add code
Dec 04, 2023
Viaarxiv icon

SelfEval: Leveraging the discriminative nature of generative models for evaluation

Add code
Nov 17, 2023
Figure 1 for SelfEval: Leveraging the discriminative nature of generative models for evaluation
Figure 2 for SelfEval: Leveraging the discriminative nature of generative models for evaluation
Figure 3 for SelfEval: Leveraging the discriminative nature of generative models for evaluation
Figure 4 for SelfEval: Leveraging the discriminative nature of generative models for evaluation
Viaarxiv icon

Exploring Question Decomposition for Zero-Shot VQA

Add code
Oct 25, 2023
Figure 1 for Exploring Question Decomposition for Zero-Shot VQA
Figure 2 for Exploring Question Decomposition for Zero-Shot VQA
Figure 3 for Exploring Question Decomposition for Zero-Shot VQA
Figure 4 for Exploring Question Decomposition for Zero-Shot VQA
Viaarxiv icon

Augmenting CLIP with Improved Visio-Linguistic Reasoning

Add code
Jul 27, 2023
Viaarxiv icon