Picture for Kola Ayonrinde

Kola Ayonrinde

Adaptive Sparse Allocation with Mutual Choice & Feature Choice Sparse Autoencoders

Add code
Nov 04, 2024
Viaarxiv icon

Interpretability as Compression: Reconsidering SAE Explanations of Neural Activations with MDL-SAEs

Add code
Oct 15, 2024
Viaarxiv icon