Picture for Julian Eisenschlos

Julian Eisenschlos

PaliGemma: A versatile 3B VLM for transfer

Add code
Jul 10, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

Add code
Oct 07, 2022
Figure 1 for Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Figure 2 for Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Figure 3 for Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Figure 4 for Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Viaarxiv icon

MultiFiT: Efficient Multi-lingual Language Model Fine-tuning

Add code
Sep 10, 2019
Figure 1 for MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
Figure 2 for MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
Figure 3 for MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
Figure 4 for MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
Viaarxiv icon