Picture for Junxuan Wang

Junxuan Wang

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders

Add code
Oct 27, 2024
Viaarxiv icon

Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures

Add code
Oct 10, 2024
Figure 1 for Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures
Figure 2 for Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures
Figure 3 for Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures
Figure 4 for Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures
Viaarxiv icon

Automatically Identifying Local and Global Circuits with Linear Computation Graphs

Add code
May 22, 2024
Viaarxiv icon