Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

May 08, 2024

Sander Land, Max Bartolo

Figure 1 for Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

Figure 2 for Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

Figure 3 for Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

Figure 4 for Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

Share this with someone who'll enjoy it:

Abstract:The disconnect between tokenizer creation and model training in language models has been known to allow for certain inputs, such as the infamous SolidGoldMagikarp token, to induce unwanted behaviour. Although such `glitch tokens' that are present in the tokenizer vocabulary, but are nearly or fully absent in training, have been observed across a variety of different models, a consistent way of identifying them has been missing. We present a comprehensive analysis of Large Language Model (LLM) tokenizers, specifically targeting this issue of detecting untrained and under-trained tokens. Through a combination of tokenizer analysis, model weight-based indicators, and prompting techniques, we develop effective methods for automatically detecting these problematic tokens. Our findings demonstrate the prevalence of such tokens across various models and provide insights into improving the efficiency and safety of language models.

* 16 pages, 4 figures. For associated code, see https://github.com/cohere-ai/magikarp/

View paper on

Share this with someone who'll enjoy it:

Title:Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

Paper and Code