Picture for Ionnis Konstas

Ionnis Konstas

N2G: A Scalable Approach for Quantifying Interpretable Neuron Representations in Large Language Models

Add code
Apr 22, 2023
Viaarxiv icon