Picture for Tomer Galanti

Tomer Galanti

The Fair Language Model Paradox

Add code
Oct 15, 2024
Figure 1 for The Fair Language Model Paradox
Figure 2 for The Fair Language Model Paradox
Figure 3 for The Fair Language Model Paradox
Figure 4 for The Fair Language Model Paradox
Viaarxiv icon

Formation of Representations in Neural Networks

Add code
Oct 03, 2024
Viaarxiv icon

On the Power of Decision Trees in Auto-Regressive Language Modeling

Add code
Sep 27, 2024
Figure 1 for On the Power of Decision Trees in Auto-Regressive Language Modeling
Figure 2 for On the Power of Decision Trees in Auto-Regressive Language Modeling
Figure 3 for On the Power of Decision Trees in Auto-Regressive Language Modeling
Figure 4 for On the Power of Decision Trees in Auto-Regressive Language Modeling
Viaarxiv icon

Distributed Speculative Inference of Large Language Models

Add code
May 23, 2024
Viaarxiv icon

Centered Self-Attention Layers

Add code
Jun 02, 2023
Figure 1 for Centered Self-Attention Layers
Figure 2 for Centered Self-Attention Layers
Figure 3 for Centered Self-Attention Layers
Figure 4 for Centered Self-Attention Layers
Viaarxiv icon

Reverse Engineering Self-Supervised Learning

Add code
May 24, 2023
Figure 1 for Reverse Engineering Self-Supervised Learning
Figure 2 for Reverse Engineering Self-Supervised Learning
Figure 3 for Reverse Engineering Self-Supervised Learning
Figure 4 for Reverse Engineering Self-Supervised Learning
Viaarxiv icon

The Probabilistic Stability of Stochastic Gradient Descent

Add code
Mar 23, 2023
Figure 1 for The Probabilistic Stability of Stochastic Gradient Descent
Figure 2 for The Probabilistic Stability of Stochastic Gradient Descent
Figure 3 for The Probabilistic Stability of Stochastic Gradient Descent
Figure 4 for The Probabilistic Stability of Stochastic Gradient Descent
Viaarxiv icon

Norm-based Generalization Bounds for Compositionally Sparse Neural Networks

Add code
Jan 28, 2023
Figure 1 for Norm-based Generalization Bounds for Compositionally Sparse Neural Networks
Figure 2 for Norm-based Generalization Bounds for Compositionally Sparse Neural Networks
Figure 3 for Norm-based Generalization Bounds for Compositionally Sparse Neural Networks
Figure 4 for Norm-based Generalization Bounds for Compositionally Sparse Neural Networks
Viaarxiv icon

Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions

Add code
Jan 11, 2023
Figure 1 for Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions
Figure 2 for Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions
Figure 3 for Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions
Viaarxiv icon

Generalization Bounds for Transfer Learning with Pretrained Classifiers

Add code
Dec 23, 2022
Figure 1 for Generalization Bounds for Transfer Learning with Pretrained Classifiers
Figure 2 for Generalization Bounds for Transfer Learning with Pretrained Classifiers
Figure 3 for Generalization Bounds for Transfer Learning with Pretrained Classifiers
Figure 4 for Generalization Bounds for Transfer Learning with Pretrained Classifiers
Viaarxiv icon