Picture for Alice Rigg

Alice Rigg

Bilinear Convolution Decomposition for Causal RL Interpretability

Add code
Dec 01, 2024
Viaarxiv icon

Bilinear MLPs enable weight-based mechanistic interpretability

Add code
Oct 10, 2024
Viaarxiv icon

Weight-based Decomposition: A Case for Bilinear MLPs

Add code
Jun 06, 2024
Viaarxiv icon