Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models

Sep 09, 2024

Emily Cheng, Richard J. Antonello

Figure 1 for Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models

Figure 2 for Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models

Figure 3 for Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models

Figure 4 for Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models

Share this with someone who'll enjoy it:

Abstract:Research has repeatedly demonstrated that intermediate hidden states extracted from large language models are able to predict measured brain response to natural language stimuli. Yet, very little is known about the representation properties that enable this high prediction performance. Why is it the intermediate layers, and not the output layers, that are most capable for this unique and highly general transfer task? In this work, we show that evidence from language encoding models in fMRI supports the existence of a two-phase abstraction process within LLMs. We use manifold learning methods to show that this abstraction process naturally arises over the course of training a language model and that the first "composition" phase of this abstraction process is compressed into fewer layers as training continues. Finally, we demonstrate a strong correspondence between layerwise encoding performance and the intrinsic dimensionality of representations from LLMs. We give initial evidence that this correspondence primarily derives from the inherent compositionality of LLMs and not their next-word prediction properties.

* Equal contribution from both authors. Submitted to NeurIPS NeuroAI workshop 2024

View paper on

Share this with someone who'll enjoy it:

Title:Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models

Paper and Code