Picture for Narmeen Oozeer

Narmeen Oozeer

Activation Space Interventions Can Be Transferred Between Large Language Models

Add code
Mar 06, 2025
Viaarxiv icon

Bilinear Convolution Decomposition for Causal RL Interpretability

Add code
Dec 01, 2024
Viaarxiv icon