Picture for Jacob Dunefsky

Jacob Dunefsky

Transcoders Find Interpretable LLM Feature Circuits

Add code
Jun 17, 2024
Viaarxiv icon

Observable Propagation: A Data-Efficient Approach to Uncover Feature Vectors in Transformers

Add code
Dec 26, 2023
Viaarxiv icon