Picture for Sviatoslav Chalnev

Sviatoslav Chalnev

Improving Steering Vectors by Targeting Sparse Autoencoder Features

Add code
Nov 04, 2024
Viaarxiv icon