Picture for Jean-Bastien Grill

Jean-Bastien Grill

Gemma 2: Improving Open Language Models at a Practical Size

Add code
Aug 02, 2024
Figure 1 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 2 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 3 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 4 for Gemma 2: Improving Open Language Models at a Practical Size
Viaarxiv icon

Stepping on the Edge: Curvature Aware Learning Rate Tuners

Add code
Jul 08, 2024
Figure 1 for Stepping on the Edge: Curvature Aware Learning Rate Tuners
Figure 2 for Stepping on the Edge: Curvature Aware Learning Rate Tuners
Figure 3 for Stepping on the Edge: Curvature Aware Learning Rate Tuners
Figure 4 for Stepping on the Edge: Curvature Aware Learning Rate Tuners
Viaarxiv icon

Emergent Communication: Generalization and Overfitting in Lewis Games

Add code
Sep 30, 2022
Figure 1 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 2 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 3 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 4 for Emergent Communication: Generalization and Overfitting in Lewis Games
Viaarxiv icon

BYOL-Explore: Exploration by Bootstrapped Prediction

Add code
Jun 16, 2022
Figure 1 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 2 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 3 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 4 for BYOL-Explore: Exploration by Bootstrapped Prediction
Viaarxiv icon

Broaden Your Views for Self-Supervised Video Learning

Add code
Mar 30, 2021
Figure 1 for Broaden Your Views for Self-Supervised Video Learning
Figure 2 for Broaden Your Views for Self-Supervised Video Learning
Figure 3 for Broaden Your Views for Self-Supervised Video Learning
Figure 4 for Broaden Your Views for Self-Supervised Video Learning
Viaarxiv icon

BYOL works even without batch statistics

Add code
Oct 20, 2020
Figure 1 for BYOL works even without batch statistics
Figure 2 for BYOL works even without batch statistics
Viaarxiv icon

Monte-Carlo Tree Search as Regularized Policy Optimization

Add code
Jul 24, 2020
Figure 1 for Monte-Carlo Tree Search as Regularized Policy Optimization
Figure 2 for Monte-Carlo Tree Search as Regularized Policy Optimization
Figure 3 for Monte-Carlo Tree Search as Regularized Policy Optimization
Figure 4 for Monte-Carlo Tree Search as Regularized Policy Optimization
Viaarxiv icon

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

Add code
Jun 13, 2020
Figure 1 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 2 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 3 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 4 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Viaarxiv icon

World Discovery Models

Add code
Mar 01, 2019
Figure 1 for World Discovery Models
Figure 2 for World Discovery Models
Figure 3 for World Discovery Models
Figure 4 for World Discovery Models
Viaarxiv icon

Optimistic optimization of a Brownian

Add code
Jan 15, 2019
Viaarxiv icon