Picture for Callum McDougall

Callum McDougall

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability

Add code
Mar 13, 2025
Viaarxiv icon

Copy Suppression: Comprehensively Understanding an Attention Head

Add code
Oct 06, 2023
Viaarxiv icon