Picture for Arthur Mensch

Arthur Mensch

DMA, PARIETAL

Pixtral 12B

Add code
Oct 09, 2024
Figure 1 for Pixtral 12B
Figure 2 for Pixtral 12B
Figure 3 for Pixtral 12B
Figure 4 for Pixtral 12B
Viaarxiv icon

Mixtral of Experts

Add code
Jan 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Mistral 7B

Add code
Oct 10, 2023
Figure 1 for Mistral 7B
Figure 2 for Mistral 7B
Figure 3 for Mistral 7B
Figure 4 for Mistral 7B
Viaarxiv icon

Three ways to improve feature alignment for open vocabulary detection

Add code
Mar 23, 2023
Viaarxiv icon

Self-conditioned Embedding Diffusion for Text Generation

Add code
Nov 08, 2022
Viaarxiv icon

Dissecting adaptive methods in GANs

Add code
Oct 09, 2022
Figure 1 for Dissecting adaptive methods in GANs
Figure 2 for Dissecting adaptive methods in GANs
Figure 3 for Dissecting adaptive methods in GANs
Figure 4 for Dissecting adaptive methods in GANs
Viaarxiv icon

Flamingo: a Visual Language Model for Few-Shot Learning

Add code
Apr 29, 2022
Figure 1 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 2 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 3 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 4 for Flamingo: a Visual Language Model for Few-Shot Learning
Viaarxiv icon

Training Compute-Optimal Large Language Models

Add code
Mar 29, 2022
Figure 1 for Training Compute-Optimal Large Language Models
Figure 2 for Training Compute-Optimal Large Language Models
Figure 3 for Training Compute-Optimal Large Language Models
Figure 4 for Training Compute-Optimal Large Language Models
Viaarxiv icon

Unified Scaling Laws for Routed Language Models

Add code
Feb 09, 2022
Figure 1 for Unified Scaling Laws for Routed Language Models
Figure 2 for Unified Scaling Laws for Routed Language Models
Figure 3 for Unified Scaling Laws for Routed Language Models
Figure 4 for Unified Scaling Laws for Routed Language Models
Viaarxiv icon