Picture for Morris Alper

Morris Alper

WAFFLE: Multimodal Floorplan Understanding in the Wild

Add code
Dec 01, 2024
Viaarxiv icon

Emergent Visual-Semantic Hierarchies in Image-Text Representations

Add code
Jul 11, 2024
Figure 1 for Emergent Visual-Semantic Hierarchies in Image-Text Representations
Figure 2 for Emergent Visual-Semantic Hierarchies in Image-Text Representations
Figure 3 for Emergent Visual-Semantic Hierarchies in Image-Text Representations
Figure 4 for Emergent Visual-Semantic Hierarchies in Image-Text Representations
Viaarxiv icon

ICC: Quantifying Image Caption Concreteness for Multimodal Dataset Curation

Add code
Mar 02, 2024
Viaarxiv icon

MOCHa: Multi-Objective Reinforcement Mitigating Caption Hallucinations

Add code
Dec 06, 2023
Viaarxiv icon

Kiki or Bouba? Sound Symbolism in Vision-and-Language Models

Add code
Oct 25, 2023
Viaarxiv icon

Learning Human-Human Interactions in Images from Weak Textual Supervision

Add code
Apr 27, 2023
Viaarxiv icon

Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding

Add code
Mar 21, 2023
Viaarxiv icon