Picture for Emmanuel Abbe

Emmanuel Abbe

Learning High-Degree Parities: The Crucial Role of the Initialization

Add code
Dec 06, 2024
Viaarxiv icon

Transformation-Invariant Learning and Theoretical Guarantees for OOD Generalization

Add code
Oct 30, 2024
Figure 1 for Transformation-Invariant Learning and Theoretical Guarantees for OOD Generalization
Viaarxiv icon

Visual Scratchpads: Enabling Global Reasoning in Vision

Add code
Oct 10, 2024
Figure 1 for Visual Scratchpads: Enabling Global Reasoning in Vision
Figure 2 for Visual Scratchpads: Enabling Global Reasoning in Vision
Figure 3 for Visual Scratchpads: Enabling Global Reasoning in Vision
Figure 4 for Visual Scratchpads: Enabling Global Reasoning in Vision
Viaarxiv icon

How Far Can Transformers Reason? The Locality Barrier and Inductive Scratchpad

Add code
Jun 10, 2024
Viaarxiv icon

On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions

Add code
Jun 10, 2024
Figure 1 for On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions
Figure 2 for On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions
Figure 3 for On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions
Figure 4 for On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions
Viaarxiv icon

When can transformers reason with abstract symbols?

Add code
Oct 15, 2023
Figure 1 for When can transformers reason with abstract symbols?
Figure 2 for When can transformers reason with abstract symbols?
Figure 3 for When can transformers reason with abstract symbols?
Figure 4 for When can transformers reason with abstract symbols?
Viaarxiv icon

Provable Advantage of Curriculum Learning on Parity Targets with Mixed Inputs

Add code
Jun 29, 2023
Viaarxiv icon

Transformers learn through gradual rank increase

Add code
Jun 12, 2023
Figure 1 for Transformers learn through gradual rank increase
Figure 2 for Transformers learn through gradual rank increase
Figure 3 for Transformers learn through gradual rank increase
Figure 4 for Transformers learn through gradual rank increase
Viaarxiv icon

SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics

Add code
Feb 21, 2023
Figure 1 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Figure 2 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Figure 3 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Figure 4 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Viaarxiv icon

Generalization on the Unseen, Logic Reasoning and Degree Curriculum

Add code
Jan 30, 2023
Viaarxiv icon