Picture for Neil Houlsby

Neil Houlsby

PaliGemma: A versatile 3B VLM for transfer

Add code
Jul 10, 2024
Figure 1 for PaliGemma: A versatile 3B VLM for transfer
Figure 2 for PaliGemma: A versatile 3B VLM for transfer
Figure 3 for PaliGemma: A versatile 3B VLM for transfer
Figure 4 for PaliGemma: A versatile 3B VLM for transfer
Viaarxiv icon

Semantica: An Adaptable Image-Conditioned Diffusion Model

Add code
May 23, 2024
Viaarxiv icon

SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation

Add code
May 14, 2024
Figure 1 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Figure 2 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Figure 3 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Figure 4 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Viaarxiv icon

Capabilities of Gemini Models in Medicine

Add code
May 01, 2024
Figure 1 for Capabilities of Gemini Models in Medicine
Figure 2 for Capabilities of Gemini Models in Medicine
Figure 3 for Capabilities of Gemini Models in Medicine
Figure 4 for Capabilities of Gemini Models in Medicine
Viaarxiv icon

Frozen Feature Augmentation for Few-Shot Image Classification

Add code
Mar 15, 2024
Figure 1 for Frozen Feature Augmentation for Few-Shot Image Classification
Figure 2 for Frozen Feature Augmentation for Few-Shot Image Classification
Figure 3 for Frozen Feature Augmentation for Few-Shot Image Classification
Figure 4 for Frozen Feature Augmentation for Few-Shot Image Classification
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Scaling Laws for Sparsely-Connected Foundation Models

Add code
Sep 15, 2023
Viaarxiv icon

From Sparse to Soft Mixtures of Experts

Add code
Aug 02, 2023
Viaarxiv icon

Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution

Add code
Jul 12, 2023
Viaarxiv icon