Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Language Models Represent Space and Time

Oct 03, 2023

Wes Gurnee, Max Tegmark

Figure 1 for Language Models Represent Space and Time

Figure 2 for Language Models Represent Space and Time

Figure 3 for Language Models Represent Space and Time

Figure 4 for Language Models Represent Space and Time

Share this with someone who'll enjoy it:

Abstract:The capabilities of large language models (LLMs) have sparked debate over whether such systems just learn an enormous collection of superficial statistics or a coherent model of the data generating process -- a world model. We find evidence for the latter by analyzing the learned representations of three spatial datasets (world, US, NYC places) and three temporal datasets (historical figures, artworks, news headlines) in the Llama-2 family of models. We discover that LLMs learn linear representations of space and time across multiple scales. These representations are robust to prompting variations and unified across different entity types (e.g. cities and landmarks). In addition, we identify individual ``space neurons'' and ``time neurons'' that reliably encode spatial and temporal coordinates. Our analysis demonstrates that modern LLMs acquire structured knowledge about fundamental dimensions such as space and time, supporting the view that they learn not merely superficial statistics, but literal world models.

View paper on

Share this with someone who'll enjoy it:

Title:Language Models Represent Space and Time

Paper and Code