Picture for Logan Riggs

Logan Riggs

Decomposing The Dark Matter of Sparse Autoencoders

Add code
Oct 18, 2024
Viaarxiv icon

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Add code
Sep 19, 2023
Viaarxiv icon