Picture for Leon Bottou

Leon Bottou

MagicPIG: LSH Sampling for Efficient LLM Generation

Add code
Oct 21, 2024
Figure 1 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 2 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 3 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 4 for MagicPIG: LSH Sampling for Efficient LLM Generation
Viaarxiv icon

Birth of a Transformer: A Memory Viewpoint

Add code
Jun 01, 2023
Viaarxiv icon

Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Add code
Mar 27, 2023
Viaarxiv icon

The Effects of Regularization and Data Augmentation are Class Dependent

Add code
Apr 08, 2022
Figure 1 for The Effects of Regularization and Data Augmentation are Class Dependent
Figure 2 for The Effects of Regularization and Data Augmentation are Class Dependent
Figure 3 for The Effects of Regularization and Data Augmentation are Class Dependent
Figure 4 for The Effects of Regularization and Data Augmentation are Class Dependent
Viaarxiv icon

An Attract-Repel Decomposition of Undirected Networks

Add code
Jun 17, 2021
Figure 1 for An Attract-Repel Decomposition of Undirected Networks
Figure 2 for An Attract-Repel Decomposition of Undirected Networks
Figure 3 for An Attract-Repel Decomposition of Undirected Networks
Figure 4 for An Attract-Repel Decomposition of Undirected Networks
Viaarxiv icon

Linear unit-tests for invariance discovery

Add code
Feb 22, 2021
Figure 1 for Linear unit-tests for invariance discovery
Figure 2 for Linear unit-tests for invariance discovery
Viaarxiv icon

AdaGrad stepsizes: Sharp convergence over nonconvex landscapes, from any initialization

Add code
Jun 21, 2018
Figure 1 for AdaGrad stepsizes: Sharp convergence over nonconvex landscapes, from any initialization
Figure 2 for AdaGrad stepsizes: Sharp convergence over nonconvex landscapes, from any initialization
Figure 3 for AdaGrad stepsizes: Sharp convergence over nonconvex landscapes, from any initialization
Figure 4 for AdaGrad stepsizes: Sharp convergence over nonconvex landscapes, from any initialization
Viaarxiv icon

Empirical Analysis of the Hessian of Over-Parametrized Neural Networks

Add code
May 07, 2018
Figure 1 for Empirical Analysis of the Hessian of Over-Parametrized Neural Networks
Figure 2 for Empirical Analysis of the Hessian of Over-Parametrized Neural Networks
Figure 3 for Empirical Analysis of the Hessian of Over-Parametrized Neural Networks
Figure 4 for Empirical Analysis of the Hessian of Over-Parametrized Neural Networks
Viaarxiv icon

Geometrical Insights for Implicit Generative Modeling

Add code
Mar 12, 2018
Figure 1 for Geometrical Insights for Implicit Generative Modeling
Figure 2 for Geometrical Insights for Implicit Generative Modeling
Figure 3 for Geometrical Insights for Implicit Generative Modeling
Figure 4 for Geometrical Insights for Implicit Generative Modeling
Viaarxiv icon

Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond

Add code
Oct 05, 2017
Figure 1 for Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond
Figure 2 for Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond
Figure 3 for Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond
Figure 4 for Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond
Viaarxiv icon