Picture for Sébastien Bubeck

Sébastien Bubeck

MSR - INRIA

Small Language Models for Application Interactions: A Case Study

Add code
May 23, 2024
Viaarxiv icon

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Add code
Apr 23, 2024
Figure 1 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 2 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 3 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 4 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Viaarxiv icon

Positional Description Matters for Transformers Arithmetic

Add code
Nov 22, 2023
Viaarxiv icon

Textbooks Are All You Need II: phi-1.5 technical report

Add code
Sep 11, 2023
Figure 1 for Textbooks Are All You Need II: phi-1.5 technical report
Figure 2 for Textbooks Are All You Need II: phi-1.5 technical report
Figure 3 for Textbooks Are All You Need II: phi-1.5 technical report
Figure 4 for Textbooks Are All You Need II: phi-1.5 technical report
Viaarxiv icon

Textbooks Are All You Need

Add code
Jun 20, 2023
Figure 1 for Textbooks Are All You Need
Figure 2 for Textbooks Are All You Need
Figure 3 for Textbooks Are All You Need
Figure 4 for Textbooks Are All You Need
Viaarxiv icon

Sparks of Artificial General Intelligence: Early experiments with GPT-4

Add code
Mar 27, 2023
Figure 1 for Sparks of Artificial General Intelligence: Early experiments with GPT-4
Figure 2 for Sparks of Artificial General Intelligence: Early experiments with GPT-4
Figure 3 for Sparks of Artificial General Intelligence: Early experiments with GPT-4
Figure 4 for Sparks of Artificial General Intelligence: Early experiments with GPT-4
Viaarxiv icon

Learning threshold neurons via the "edge of stability"

Add code
Dec 14, 2022
Figure 1 for Learning threshold neurons via the "edge of stability"
Figure 2 for Learning threshold neurons via the "edge of stability"
Figure 3 for Learning threshold neurons via the "edge of stability"
Figure 4 for Learning threshold neurons via the "edge of stability"
Viaarxiv icon

How to Fine-Tune Vision Models with SGD

Add code
Nov 17, 2022
Viaarxiv icon

Unveiling Transformers with LEGO: a synthetic reasoning task

Add code
Jun 09, 2022
Figure 1 for Unveiling Transformers with LEGO: a synthetic reasoning task
Figure 2 for Unveiling Transformers with LEGO: a synthetic reasoning task
Figure 3 for Unveiling Transformers with LEGO: a synthetic reasoning task
Figure 4 for Unveiling Transformers with LEGO: a synthetic reasoning task
Viaarxiv icon

Data Augmentation as Feature Manipulation: a story of desert cows and grass cows

Add code
Mar 03, 2022
Figure 1 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Figure 2 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Figure 3 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Figure 4 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Viaarxiv icon