Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

İlker Işık

Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence

Oct 22, 2024

İlker Işık, Ramazan Gokberk Cinbis, Ebru Aydin Gol

Abstract:We propose a novel approach for learning interchangeable tokens in language models to obtain an extendable vocabulary that can generalize to new tokens. Our method is designed to address alpha-equivalence, the principle that renaming bound variables in a syntactic expression preserves semantics. This property arises in many formal languages such as temporal logics, in which all proposition symbols represent the same concept but are distinguishable from each other. To handle such tokens, we develop a dual-part embedding approach. The first part is shared across all interchangeable tokens, thereby enforcing that they represent the same core concept. The second part is randomly generated for each token, which enables distinguishability. We evaluate our method in a Transformer encoder-decoder model on two tasks: solving linear temporal logic formulae and copying with extendable vocabulary. Our method demonstrates promising generalization capabilities in addition to introducing a favorable inductive bias for alpha-equivalence.

* 14 pages, 5 figures

Via

Access Paper or Ask Questions

Learning to Estimate System Specifications in Linear Temporal Logic using Transformers and Mamba

May 31, 2024

İlker Işık, Ebru Aydin Gol, Ramazan Gokberk Cinbis

Figure 1 for Learning to Estimate System Specifications in Linear Temporal Logic using Transformers and Mamba

Figure 2 for Learning to Estimate System Specifications in Linear Temporal Logic using Transformers and Mamba

Figure 3 for Learning to Estimate System Specifications in Linear Temporal Logic using Transformers and Mamba

Figure 4 for Learning to Estimate System Specifications in Linear Temporal Logic using Transformers and Mamba

Abstract:Temporal logic is a framework for representing and reasoning about propositions that evolve over time. It is commonly used for specifying requirements in various domains, including hardware and software systems, as well as robotics. Specification mining or formula generation involves extracting temporal logic formulae from system traces and has numerous applications, such as detecting bugs and improving interpretability. Although there has been a surge of deep learning-based methods for temporal logic satisfiability checking in recent years, the specification mining literature has been lagging behind in adopting deep learning methods despite their many advantages, such as scalability. In this paper, we introduce autoregressive models that can generate linear temporal logic formulae from traces, towards addressing the specification mining problem. We propose multiple architectures for this task: transformer encoder-decoder, decoder-only transformer, and Mamba, which is an emerging alternative to transformer models. Additionally, we devise a metric for quantifying the distinctiveness of the generated formulae and a straightforward algorithm to enforce the syntax constraints. Our experiments show that the proposed architectures yield promising results, generating correct and distinct formulae at a fraction of the compute cost needed for the combinatorial baseline.

* 20 pages, 15 figures

Via

Access Paper or Ask Questions