Picture for Joseph Bloom

Joseph Bloom

Sparse Autoencoders Do Not Find Canonical Units of Analysis

Add code
Feb 07, 2025
Viaarxiv icon

Open Problems in Mechanistic Interpretability

Add code
Jan 27, 2025
Figure 1 for Open Problems in Mechanistic Interpretability
Figure 2 for Open Problems in Mechanistic Interpretability
Figure 3 for Open Problems in Mechanistic Interpretability
Figure 4 for Open Problems in Mechanistic Interpretability
Viaarxiv icon

A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders

Add code
Sep 25, 2024
Viaarxiv icon