Picture for Wentao Shu

Wentao Shu

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders

Add code
Oct 27, 2024
Viaarxiv icon

Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures

Add code
Oct 10, 2024
Viaarxiv icon

Automatically Identifying Local and Global Circuits with Linear Computation Graphs

Add code
May 22, 2024
Viaarxiv icon