Picture for Kate Saenko

Kate Saenko

OP-LoRA: The Blessing of Dimensionality

Add code
Dec 13, 2024
Viaarxiv icon

SAT: Spatial Aptitude Training for Multimodal Language Models

Add code
Dec 10, 2024
Viaarxiv icon

Is Large-Scale Pretraining the Secret to Good Domain Generalization?

Add code
Dec 03, 2024
Viaarxiv icon

KiVA: Kid-inspired Visual Analogies for Testing Large Multimodal Models

Add code
Jul 25, 2024
Viaarxiv icon

Tell Me What's Next: Textual Foresight for Generic UI Representations

Add code
Jun 12, 2024
Viaarxiv icon

SLANT: Spurious Logo ANalysis Toolkit

Add code
Jun 03, 2024
Figure 1 for SLANT: Spurious Logo ANalysis Toolkit
Figure 2 for SLANT: Spurious Logo ANalysis Toolkit
Figure 3 for SLANT: Spurious Logo ANalysis Toolkit
Figure 4 for SLANT: Spurious Logo ANalysis Toolkit
Viaarxiv icon

An Introduction to Vision-Language Modeling

Add code
May 27, 2024
Figure 1 for An Introduction to Vision-Language Modeling
Figure 2 for An Introduction to Vision-Language Modeling
Figure 3 for An Introduction to Vision-Language Modeling
Viaarxiv icon

Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models

Add code
Apr 21, 2024
Viaarxiv icon

Koala: Key frame-conditioned long video-LLM

Add code
Apr 05, 2024
Viaarxiv icon

Vision-LLMs Can Fool Themselves with Self-Generated Typographic Attacks

Add code
Feb 01, 2024
Viaarxiv icon