Picture for Hattie Zhou

Hattie Zhou

A Formal Framework for Understanding Length Generalization in Transformers

Add code
Oct 03, 2024
Figure 1 for A Formal Framework for Understanding Length Generalization in Transformers
Figure 2 for A Formal Framework for Understanding Length Generalization in Transformers
Figure 3 for A Formal Framework for Understanding Length Generalization in Transformers
Figure 4 for A Formal Framework for Understanding Length Generalization in Transformers
Viaarxiv icon

Step-by-Step Diffusion: An Elementary Tutorial

Add code
Jun 13, 2024
Figure 1 for Step-by-Step Diffusion: An Elementary Tutorial
Figure 2 for Step-by-Step Diffusion: An Elementary Tutorial
Figure 3 for Step-by-Step Diffusion: An Elementary Tutorial
Figure 4 for Step-by-Step Diffusion: An Elementary Tutorial
Viaarxiv icon

Vanishing Gradients in Reinforcement Finetuning of Language Models

Add code
Oct 31, 2023
Figure 1 for Vanishing Gradients in Reinforcement Finetuning of Language Models
Figure 2 for Vanishing Gradients in Reinforcement Finetuning of Language Models
Figure 3 for Vanishing Gradients in Reinforcement Finetuning of Language Models
Figure 4 for Vanishing Gradients in Reinforcement Finetuning of Language Models
Viaarxiv icon

What Algorithms can Transformers Learn? A Study in Length Generalization

Add code
Oct 24, 2023
Viaarxiv icon

Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok

Add code
Jun 23, 2023
Viaarxiv icon

Teaching Algorithmic Reasoning via In-context Learning

Add code
Nov 15, 2022
Viaarxiv icon

Fortuitous Forgetting in Connectionist Networks

Add code
Feb 01, 2022
Figure 1 for Fortuitous Forgetting in Connectionist Networks
Figure 2 for Fortuitous Forgetting in Connectionist Networks
Figure 3 for Fortuitous Forgetting in Connectionist Networks
Figure 4 for Fortuitous Forgetting in Connectionist Networks
Viaarxiv icon

LCA: Loss Change Allocation for Neural Network Training

Add code
Sep 03, 2019
Figure 1 for LCA: Loss Change Allocation for Neural Network Training
Figure 2 for LCA: Loss Change Allocation for Neural Network Training
Figure 3 for LCA: Loss Change Allocation for Neural Network Training
Figure 4 for LCA: Loss Change Allocation for Neural Network Training
Viaarxiv icon

Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask

Add code
May 03, 2019
Figure 1 for Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask
Figure 2 for Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask
Figure 3 for Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask
Figure 4 for Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask
Viaarxiv icon