Picture for Michael Lan

Michael Lan

Activation Space Interventions Can Be Transferred Between Large Language Models

Add code
Mar 06, 2025
Viaarxiv icon

Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models

Add code
Oct 09, 2024
Figure 1 for Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models
Figure 2 for Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models
Figure 3 for Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models
Figure 4 for Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models
Viaarxiv icon

Locating Cross-Task Sequence Continuation Circuits in Transformers

Add code
Nov 07, 2023
Viaarxiv icon