Picture for Mustafa Shukor

Mustafa Shukor

Multimodal Autoregressive Pre-training of Large Vision Encoders

Add code
Nov 21, 2024
Figure 1 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 2 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 3 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 4 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Viaarxiv icon

Skipping Computations in Multimodal LLMs

Add code
Oct 12, 2024
Viaarxiv icon

A Concept-Based Explainability Framework for Large Multimodal Models

Add code
Jun 12, 2024
Viaarxiv icon

Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features

Add code
Jun 05, 2024
Figure 1 for Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features
Figure 2 for Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features
Figure 3 for Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features
Figure 4 for Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features
Viaarxiv icon

Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs

Add code
May 26, 2024
Figure 1 for Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
Figure 2 for Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
Figure 3 for Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
Figure 4 for Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
Viaarxiv icon

What Makes Multimodal In-Context Learning Work?

Add code
Apr 25, 2024
Figure 1 for What Makes Multimodal In-Context Learning Work?
Figure 2 for What Makes Multimodal In-Context Learning Work?
Figure 3 for What Makes Multimodal In-Context Learning Work?
Figure 4 for What Makes Multimodal In-Context Learning Work?
Viaarxiv icon

FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models

Add code
Mar 29, 2024
Viaarxiv icon

Improved Baselines for Data-efficient Perceptual Augmentation of LLMs

Add code
Mar 20, 2024
Figure 1 for Improved Baselines for Data-efficient Perceptual Augmentation of LLMs
Figure 2 for Improved Baselines for Data-efficient Perceptual Augmentation of LLMs
Figure 3 for Improved Baselines for Data-efficient Perceptual Augmentation of LLMs
Figure 4 for Improved Baselines for Data-efficient Perceptual Augmentation of LLMs
Viaarxiv icon

Empirical Study of PEFT techniques for Winter Wheat Segmentation

Add code
Oct 03, 2023
Viaarxiv icon

Zero-Shot Refinement of Buildings' Segmentation Models using SAM

Add code
Oct 03, 2023
Viaarxiv icon