Picture for Jose M. Oramas

Jose M. Oramas

Bilinear MLPs enable weight-based mechanistic interpretability

Add code
Oct 10, 2024
Viaarxiv icon