Picture for Tomer Galanti

Tomer Galanti

The Fair Language Model Paradox

Add code
Oct 15, 2024
Figure 1 for The Fair Language Model Paradox
Figure 2 for The Fair Language Model Paradox
Figure 3 for The Fair Language Model Paradox
Figure 4 for The Fair Language Model Paradox
Viaarxiv icon

Formation of Representations in Neural Networks

Add code
Oct 03, 2024
Viaarxiv icon

On the Power of Decision Trees in Auto-Regressive Language Modeling

Add code
Sep 27, 2024
Figure 1 for On the Power of Decision Trees in Auto-Regressive Language Modeling
Figure 2 for On the Power of Decision Trees in Auto-Regressive Language Modeling
Figure 3 for On the Power of Decision Trees in Auto-Regressive Language Modeling
Figure 4 for On the Power of Decision Trees in Auto-Regressive Language Modeling
Viaarxiv icon

Distributed Speculative Inference of Large Language Models

Add code
May 23, 2024
Viaarxiv icon

Centered Self-Attention Layers

Add code
Jun 02, 2023
Figure 1 for Centered Self-Attention Layers
Figure 2 for Centered Self-Attention Layers
Figure 3 for Centered Self-Attention Layers
Figure 4 for Centered Self-Attention Layers
Viaarxiv icon

Reverse Engineering Self-Supervised Learning

Add code
May 24, 2023
Figure 1 for Reverse Engineering Self-Supervised Learning
Figure 2 for Reverse Engineering Self-Supervised Learning
Figure 3 for Reverse Engineering Self-Supervised Learning
Figure 4 for Reverse Engineering Self-Supervised Learning
Viaarxiv icon

The Probabilistic Stability of Stochastic Gradient Descent

Add code
Mar 23, 2023
Figure 1 for The Probabilistic Stability of Stochastic Gradient Descent
Figure 2 for The Probabilistic Stability of Stochastic Gradient Descent
Figure 3 for The Probabilistic Stability of Stochastic Gradient Descent
Figure 4 for The Probabilistic Stability of Stochastic Gradient Descent
Viaarxiv icon

Norm-based Generalization Bounds for Compositionally Sparse Neural Networks

Add code
Jan 28, 2023
Viaarxiv icon

Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions

Add code
Jan 11, 2023
Viaarxiv icon

Generalization Bounds for Transfer Learning with Pretrained Classifiers

Add code
Dec 23, 2022
Viaarxiv icon