Picture for Erblina Purelku

Erblina Purelku

PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits

Add code
Apr 09, 2024
Viaarxiv icon