Picture for Tim Lawson

Tim Lawson

Sparse Autoencoders Can Interpret Randomly Initialized Transformers

Add code
Jan 29, 2025
Viaarxiv icon

Residual Stream Analysis with Multi-Layer SAEs

Add code
Sep 06, 2024
Figure 1 for Residual Stream Analysis with Multi-Layer SAEs
Figure 2 for Residual Stream Analysis with Multi-Layer SAEs
Figure 3 for Residual Stream Analysis with Multi-Layer SAEs
Figure 4 for Residual Stream Analysis with Multi-Layer SAEs
Viaarxiv icon