Picture for Sebastien Bubeck

Sebastien Bubeck

TinyGSM: achieving >80% on GSM8k with small language models

Add code
Dec 14, 2023
Figure 1 for TinyGSM: achieving >80% on GSM8k with small language models
Figure 2 for TinyGSM: achieving >80% on GSM8k with small language models
Figure 3 for TinyGSM: achieving >80% on GSM8k with small language models
Figure 4 for TinyGSM: achieving >80% on GSM8k with small language models
Viaarxiv icon

AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers

Add code
Oct 14, 2022
Figure 1 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Figure 2 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Figure 3 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Figure 4 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Viaarxiv icon

LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models

Add code
Mar 04, 2022
Figure 1 for LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models
Figure 2 for LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models
Figure 3 for LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models
Figure 4 for LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models
Viaarxiv icon

FEAR: A Simple Lightweight Method to Rank Architectures

Add code
Jun 07, 2021
Figure 1 for FEAR: A Simple Lightweight Method to Rank Architectures
Figure 2 for FEAR: A Simple Lightweight Method to Rank Architectures
Figure 3 for FEAR: A Simple Lightweight Method to Rank Architectures
Figure 4 for FEAR: A Simple Lightweight Method to Rank Architectures
Viaarxiv icon

Provably Robust Deep Learning via Adversarially Trained Smoothed Classifiers

Add code
Jun 12, 2019
Figure 1 for Provably Robust Deep Learning via Adversarially Trained Smoothed Classifiers
Figure 2 for Provably Robust Deep Learning via Adversarially Trained Smoothed Classifiers
Figure 3 for Provably Robust Deep Learning via Adversarially Trained Smoothed Classifiers
Figure 4 for Provably Robust Deep Learning via Adversarially Trained Smoothed Classifiers
Viaarxiv icon

Is Q-learning Provably Efficient?

Add code
Jul 10, 2018
Figure 1 for Is Q-learning Provably Efficient?
Viaarxiv icon

On Finding the Largest Mean Among Many

Add code
Jun 17, 2013
Figure 1 for On Finding the Largest Mean Among Many
Figure 2 for On Finding the Largest Mean Among Many
Viaarxiv icon

Optimal discovery with probabilistic expert advice: finite time analysis and macroscopic optimality

Add code
Mar 29, 2013
Figure 1 for Optimal discovery with probabilistic expert advice: finite time analysis and macroscopic optimality
Figure 2 for Optimal discovery with probabilistic expert advice: finite time analysis and macroscopic optimality
Viaarxiv icon

The best of both worlds: stochastic and adversarial bandits

Add code
Feb 20, 2012
Figure 1 for The best of both worlds: stochastic and adversarial bandits
Viaarxiv icon

Minimax Policies for Combinatorial Prediction Games

Add code
May 24, 2011
Figure 1 for Minimax Policies for Combinatorial Prediction Games
Figure 2 for Minimax Policies for Combinatorial Prediction Games
Figure 3 for Minimax Policies for Combinatorial Prediction Games
Figure 4 for Minimax Policies for Combinatorial Prediction Games
Viaarxiv icon