Picture for Max Tegmark

Max Tegmark

MIT

Are Sparse Autoencoders Useful? A Case Study in Sparse Probing

Add code
Feb 23, 2025
Viaarxiv icon

Harmonic Loss Trains Interpretable AI Models

Add code
Feb 03, 2025
Viaarxiv icon

Language Models Use Trigonometry to Do Addition

Add code
Feb 02, 2025
Viaarxiv icon

Low-Rank Adapting Models for Sparse Autoencoders

Add code
Jan 31, 2025
Figure 1 for Low-Rank Adapting Models for Sparse Autoencoders
Figure 2 for Low-Rank Adapting Models for Sparse Autoencoders
Figure 3 for Low-Rank Adapting Models for Sparse Autoencoders
Figure 4 for Low-Rank Adapting Models for Sparse Autoencoders
Viaarxiv icon

Open Problems in Mechanistic Interpretability

Add code
Jan 27, 2025
Figure 1 for Open Problems in Mechanistic Interpretability
Figure 2 for Open Problems in Mechanistic Interpretability
Figure 3 for Open Problems in Mechanistic Interpretability
Figure 4 for Open Problems in Mechanistic Interpretability
Viaarxiv icon

Physics of Skill Learning

Add code
Jan 21, 2025
Viaarxiv icon

Decomposing The Dark Matter of Sparse Autoencoders

Add code
Oct 18, 2024
Figure 1 for Decomposing The Dark Matter of Sparse Autoencoders
Figure 2 for Decomposing The Dark Matter of Sparse Autoencoders
Figure 3 for Decomposing The Dark Matter of Sparse Autoencoders
Figure 4 for Decomposing The Dark Matter of Sparse Autoencoders
Viaarxiv icon

Efficient Dictionary Learning with Switch Sparse Autoencoders

Add code
Oct 10, 2024
Figure 1 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Figure 2 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Figure 3 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Figure 4 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Viaarxiv icon

Generalization from Starvation: Hints of Universality in LLM Knowledge Graph Learning

Add code
Oct 10, 2024
Figure 1 for Generalization from Starvation: Hints of Universality in LLM Knowledge Graph Learning
Figure 2 for Generalization from Starvation: Hints of Universality in LLM Knowledge Graph Learning
Figure 3 for Generalization from Starvation: Hints of Universality in LLM Knowledge Graph Learning
Figure 4 for Generalization from Starvation: Hints of Universality in LLM Knowledge Graph Learning
Viaarxiv icon

KAN 2.0: Kolmogorov-Arnold Networks Meet Science

Add code
Aug 19, 2024
Viaarxiv icon