Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kai Nylund

Time is Encoded in the Weights of Finetuned Language Models

Dec 30, 2023

Kai Nylund, Suchin Gururangan, Noah A. Smith

Figure 1 for Time is Encoded in the Weights of Finetuned Language Models

Figure 2 for Time is Encoded in the Weights of Finetuned Language Models

Figure 3 for Time is Encoded in the Weights of Finetuned Language Models

Figure 4 for Time is Encoded in the Weights of Finetuned Language Models

Abstract:We present time vectors, a simple tool to customize language models to new time periods. Time vectors are created by finetuning a language model on data from a single time (e.g., a year or month), and then subtracting the weights of the original pretrained model. This vector specifies a direction in weight space that, as our experiments show, improves performance on text from that time period. Time vectors specialized to adjacent time periods appear to be positioned closer together in a manifold. Using this structure, we interpolate between time vectors to induce new models that perform better on intervening and future time periods, without any additional training. We demonstrate the consistency of our findings across different tasks, domains, model sizes, and time scales. Our results suggest that time is encoded in the weight space of finetuned models.

* Added references to Jaidka et al. (2018) and Loureiro et al. (2022)

Via

Access Paper or Ask Questions