Picture for Jacob Dunefsky

Jacob Dunefsky

Transcoders Find Interpretable LLM Feature Circuits

Add code
Jun 17, 2024
Viaarxiv icon

Observable Propagation: A Data-Efficient Approach to Uncover Feature Vectors in Transformers

Add code
Dec 26, 2023
Figure 1 for Observable Propagation: A Data-Efficient Approach to Uncover Feature Vectors in Transformers
Figure 2 for Observable Propagation: A Data-Efficient Approach to Uncover Feature Vectors in Transformers
Figure 3 for Observable Propagation: A Data-Efficient Approach to Uncover Feature Vectors in Transformers
Figure 4 for Observable Propagation: A Data-Efficient Approach to Uncover Feature Vectors in Transformers
Viaarxiv icon