Picture for Alice Rigg

Alice Rigg

Bilinear MLPs enable weight-based mechanistic interpretability

Add code
Oct 10, 2024
Viaarxiv icon

Weight-based Decomposition: A Case for Bilinear MLPs

Add code
Jun 06, 2024
Viaarxiv icon