Picture for Zeynep Akata

Zeynep Akata

How to Merge Your Multimodal Models Over Time?

Add code
Dec 09, 2024
Viaarxiv icon

Post-hoc Probabilistic Vision-Language Models

Add code
Dec 08, 2024
Figure 1 for Post-hoc Probabilistic Vision-Language Models
Figure 2 for Post-hoc Probabilistic Vision-Language Models
Figure 3 for Post-hoc Probabilistic Vision-Language Models
Figure 4 for Post-hoc Probabilistic Vision-Language Models
Viaarxiv icon

FLAIR: VLM with Fine-grained Language-informed Image Representations

Add code
Dec 04, 2024
Figure 1 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Figure 2 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Figure 3 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Figure 4 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Viaarxiv icon

COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training

Add code
Dec 02, 2024
Viaarxiv icon

Context-Aware Multimodal Pretraining

Add code
Nov 22, 2024
Viaarxiv icon

Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences

Add code
Oct 27, 2024
Figure 1 for Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences
Figure 2 for Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences
Figure 3 for Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences
Figure 4 for Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences
Viaarxiv icon

Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)

Add code
Oct 25, 2024
Figure 1 for Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
Figure 2 for Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
Figure 3 for Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
Figure 4 for Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
Viaarxiv icon

Scalable Ranked Preference Optimization for Text-to-Image Generation

Add code
Oct 23, 2024
Figure 1 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 2 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 3 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 4 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Viaarxiv icon

A Practitioner's Guide to Continual Multimodal Pretraining

Add code
Aug 26, 2024
Viaarxiv icon

Geometry Fidelity for Spherical Images

Add code
Jul 25, 2024
Figure 1 for Geometry Fidelity for Spherical Images
Figure 2 for Geometry Fidelity for Spherical Images
Figure 3 for Geometry Fidelity for Spherical Images
Figure 4 for Geometry Fidelity for Spherical Images
Viaarxiv icon