Picture for Lucile Saulnier

Lucile Saulnier

Pixtral 12B

Add code
Oct 09, 2024
Figure 1 for Pixtral 12B
Figure 2 for Pixtral 12B
Figure 3 for Pixtral 12B
Figure 4 for Pixtral 12B
Viaarxiv icon

Mixtral of Experts

Add code
Jan 08, 2024
Viaarxiv icon

Mistral 7B

Add code
Oct 10, 2023
Figure 1 for Mistral 7B
Figure 2 for Mistral 7B
Figure 3 for Mistral 7B
Figure 4 for Mistral 7B
Viaarxiv icon

OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Add code
Jun 21, 2023
Figure 1 for OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
Figure 2 for OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
Figure 3 for OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
Figure 4 for OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
Viaarxiv icon

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Add code
Mar 07, 2023
Figure 1 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 2 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 3 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 4 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

What Language Model to Train if You Have One Million GPU Hours?

Add code
Nov 08, 2022
Viaarxiv icon

Training Transformers Together

Add code
Jul 07, 2022
Figure 1 for Training Transformers Together
Figure 2 for Training Transformers Together
Viaarxiv icon

Distributed Deep Learning in Open Collaborations

Add code
Jun 18, 2021
Figure 1 for Distributed Deep Learning in Open Collaborations
Figure 2 for Distributed Deep Learning in Open Collaborations
Figure 3 for Distributed Deep Learning in Open Collaborations
Figure 4 for Distributed Deep Learning in Open Collaborations
Viaarxiv icon