Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence

Oct 22, 2024

İlker Işık, Ramazan Gokberk Cinbis, Ebru Aydin Gol

Figure 1 for Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence

Figure 2 for Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence

Figure 3 for Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence

Figure 4 for Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence

Share this with someone who'll enjoy it:

Abstract:We propose a novel approach for learning interchangeable tokens in language models to obtain an extendable vocabulary that can generalize to new tokens. Our method is designed to address alpha-equivalence, the principle that renaming bound variables in a syntactic expression preserves semantics. This property arises in many formal languages such as temporal logics, in which all proposition symbols represent the same concept but are distinguishable from each other. To handle such tokens, we develop a dual-part embedding approach. The first part is shared across all interchangeable tokens, thereby enforcing that they represent the same core concept. The second part is randomly generated for each token, which enables distinguishability. We evaluate our method in a Transformer encoder-decoder model on two tasks: solving linear temporal logic formulae and copying with extendable vocabulary. Our method demonstrates promising generalization capabilities in addition to introducing a favorable inductive bias for alpha-equivalence.

* 14 pages, 5 figures

View paper on

Share this with someone who'll enjoy it:

Title:Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence

Paper and Code