Picture for Thomas McGrath

Thomas McGrath

Copy Suppression: Comprehensively Understanding an Attention Head

Add code
Oct 06, 2023
Viaarxiv icon

The Hydra Effect: Emergent Self-repair in Language Model Computations

Add code
Jul 28, 2023
Viaarxiv icon

Tracr: Compiled Transformers as a Laboratory for Interpretability

Add code
Jan 12, 2023
Viaarxiv icon

Acquisition of Chess Knowledge in AlphaZero

Add code
Nov 27, 2021
Figure 1 for Acquisition of Chess Knowledge in AlphaZero
Figure 2 for Acquisition of Chess Knowledge in AlphaZero
Figure 3 for Acquisition of Chess Knowledge in AlphaZero
Figure 4 for Acquisition of Chess Knowledge in AlphaZero
Viaarxiv icon