Picture for Mark Ibrahim

Mark Ibrahim

Transformers Can Navigate Mazes With Multi-Step Prediction

Add code
Dec 06, 2024
Viaarxiv icon

UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling

Add code
Aug 09, 2024
Viaarxiv icon

$\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs

Add code
Jul 25, 2024
Viaarxiv icon

Occam's Razor for Self Supervised Learning: What is Sufficient to Learn Good Representations?

Add code
Jun 15, 2024
Viaarxiv icon

The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More

Add code
Jun 07, 2024
Viaarxiv icon

An Introduction to Vision-Language Modeling

Add code
May 27, 2024
Figure 1 for An Introduction to Vision-Language Modeling
Figure 2 for An Introduction to Vision-Language Modeling
Figure 3 for An Introduction to Vision-Language Modeling
Viaarxiv icon

Modeling Caption Diversity in Contrastive Vision-Language Pretraining

Add code
Apr 30, 2024
Figure 1 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 2 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 3 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 4 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Viaarxiv icon

Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class

Add code
Apr 25, 2024
Viaarxiv icon

Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinations

Add code
Apr 16, 2024
Viaarxiv icon

The Bias of Harmful Label Associations in Vision-Language Models

Add code
Feb 11, 2024
Viaarxiv icon