Picture for Matthieu Cord

Matthieu Cord

Skipping Computations in Multimodal LLMs

Add code
Oct 12, 2024
Viaarxiv icon

Annealed Winner-Takes-All for Motion Forecasting

Add code
Sep 18, 2024
Figure 1 for Annealed Winner-Takes-All for Motion Forecasting
Figure 2 for Annealed Winner-Takes-All for Motion Forecasting
Figure 3 for Annealed Winner-Takes-All for Motion Forecasting
Figure 4 for Annealed Winner-Takes-All for Motion Forecasting
Viaarxiv icon

LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Foundation Models

Add code
Sep 18, 2024
Viaarxiv icon

ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable

Add code
Sep 12, 2024
Viaarxiv icon

A Concept-Based Explainability Framework for Large Multimodal Models

Add code
Jun 12, 2024
Viaarxiv icon

Valeo4Cast: A Modular Approach to End-to-End Forecasting

Add code
Jun 12, 2024
Figure 1 for Valeo4Cast: A Modular Approach to End-to-End Forecasting
Figure 2 for Valeo4Cast: A Modular Approach to End-to-End Forecasting
Figure 3 for Valeo4Cast: A Modular Approach to End-to-End Forecasting
Figure 4 for Valeo4Cast: A Modular Approach to End-to-End Forecasting
Viaarxiv icon

Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features

Add code
Jun 05, 2024
Figure 1 for Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features
Figure 2 for Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features
Figure 3 for Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features
Figure 4 for Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features
Viaarxiv icon

Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs

Add code
May 26, 2024
Figure 1 for Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
Figure 2 for Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
Figure 3 for Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
Figure 4 for Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
Viaarxiv icon

What matters when building vision-language models?

Add code
May 03, 2024
Viaarxiv icon

What Makes Multimodal In-Context Learning Work?

Add code
Apr 25, 2024
Figure 1 for What Makes Multimodal In-Context Learning Work?
Figure 2 for What Makes Multimodal In-Context Learning Work?
Figure 3 for What Makes Multimodal In-Context Learning Work?
Figure 4 for What Makes Multimodal In-Context Learning Work?
Viaarxiv icon