Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:First numerical observation of the Berezinskii-Kosterlitz-Thouless transition in language models

Dec 02, 2024

Yuma Toji, Jun Takahashi, Vwani Roychowdhury, Hideyuki Miyahara

Figure 1 for First numerical observation of the Berezinskii-Kosterlitz-Thouless transition in language models

Figure 2 for First numerical observation of the Berezinskii-Kosterlitz-Thouless transition in language models

Figure 3 for First numerical observation of the Berezinskii-Kosterlitz-Thouless transition in language models

Figure 4 for First numerical observation of the Berezinskii-Kosterlitz-Thouless transition in language models

Share this with someone who'll enjoy it:

Abstract:Several power-law critical properties involving different statistics in natural languages -- reminiscent of scaling properties of physical systems at or near phase transitions -- have been documented for decades. The recent rise of large language models (LLMs) has added further evidence and excitement by providing intriguing similarities with notions in physics such as scaling laws and emergent abilities. However, specific instances of classes of generative language models that exhibit phase transitions, as understood by the statistical physics community, are lacking. In this work, inspired by the one-dimensional Potts model in statistical physics we construct a simple probabilistic language model that falls under the class of context sensitive grammars (CSG), and numerically demonstrate an unambiguous phase transition in the framework of a natural language model. We explicitly show that a precisely defined order parameter -- that captures symbol frequency biases in the sentences generated by the language model -- changes from strictly 0 to a strictly nonzero value (in the infinite-length limit of sentences), implying a mathematical singularity arising when tuning the parameter of the stochastic language model we consider. Furthermore, we identify the phase transition as a variant of the Berezinskii-Kosterlitz-Thouless (BKT) transition, which is known to exhibit critical properties not only at the transition point but also in the entire phase. This finding leads to the possibility that critical properties in natural languages may not require careful fine-tuning nor self-organized criticality, but is generically explained by the underlying connection between language structures and the BKT phases.

View paper on

Share this with someone who'll enjoy it:

Title:First numerical observation of the Berezinskii-Kosterlitz-Thouless transition in language models

Paper and Code