Picture for Alice Rigg

Alice Rigg

Converting MLPs into Polynomials in Closed Form

Add code
Feb 03, 2025
Viaarxiv icon

Bilinear Convolution Decomposition for Causal RL Interpretability

Add code
Dec 01, 2024
Viaarxiv icon

Bilinear MLPs enable weight-based mechanistic interpretability

Add code
Oct 10, 2024
Viaarxiv icon

Weight-based Decomposition: A Case for Bilinear MLPs

Add code
Jun 06, 2024
Viaarxiv icon