Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Monotonic Representation of Numeric Properties in Language Models

Mar 15, 2024

Benjamin Heinzerling, Kentaro Inui

Figure 1 for Monotonic Representation of Numeric Properties in Language Models

Figure 2 for Monotonic Representation of Numeric Properties in Language Models

Figure 3 for Monotonic Representation of Numeric Properties in Language Models

Figure 4 for Monotonic Representation of Numeric Properties in Language Models

Share this with someone who'll enjoy it:

Abstract:Language models (LMs) can express factual knowledge involving numeric properties such as Karl Popper was born in 1902. However, how this information is encoded in the model's internal representations is not understood well. Here, we introduce a simple method for finding and editing representations of numeric properties such as an entity's birth year. Empirically, we find low-dimensional subspaces that encode numeric properties monotonically, in an interpretable and editable fashion. When editing representations along directions in these subspaces, LM output changes accordingly. For example, by patching activations along a "birthyear" direction we can make the LM express an increasingly late birthyear: Karl Popper was born in 1929, Karl Popper was born in 1957, Karl Popper was born in 1968. Property-encoding directions exist across several numeric properties in all models under consideration, suggesting the possibility that monotonic representation of numeric properties consistently emerges during LM pretraining. Code: https://github.com/bheinzerling/numeric-property-repr

View paper on

Share this with someone who'll enjoy it:

Title:Monotonic Representation of Numeric Properties in Language Models

Paper and Code