Picture for Magdalena Wache

Magdalena Wache

Factored space models: Towards causality between levels of abstraction

Add code
Dec 03, 2024
Viaarxiv icon

The Local Interaction Basis: Identifying Computationally-Relevant and Sparsely Interacting Features in Neural Networks

Add code
May 17, 2024
Viaarxiv icon