Picture for Anej Svete

Anej Svete

Context-Free Recognition with Transformers

Add code
Jan 05, 2026
Viaarxiv icon

Probability Distributions Computed by Hard-Attention Transformers

Add code
Oct 31, 2025
Figure 1 for Probability Distributions Computed by Hard-Attention Transformers
Figure 2 for Probability Distributions Computed by Hard-Attention Transformers
Viaarxiv icon

Information Locality as an Inductive Bias for Neural Language Models

Add code
Jun 05, 2025
Figure 1 for Information Locality as an Inductive Bias for Neural Language Models
Figure 2 for Information Locality as an Inductive Bias for Neural Language Models
Figure 3 for Information Locality as an Inductive Bias for Neural Language Models
Figure 4 for Information Locality as an Inductive Bias for Neural Language Models
Viaarxiv icon

Unique Hard Attention: A Tale of Two Sides

Add code
Mar 18, 2025
Figure 1 for Unique Hard Attention: A Tale of Two Sides
Figure 2 for Unique Hard Attention: A Tale of Two Sides
Viaarxiv icon

Counterfactual Generation from Language Models

Add code
Nov 11, 2024
Figure 1 for Counterfactual Generation from Language Models
Figure 2 for Counterfactual Generation from Language Models
Figure 3 for Counterfactual Generation from Language Models
Figure 4 for Counterfactual Generation from Language Models
Viaarxiv icon

Training Neural Networks as Recognizers of Formal Languages

Add code
Nov 11, 2024
Viaarxiv icon

An $\mathbf{L^*}$ Algorithm for Deterministic Weighted Regular Languages

Add code
Nov 09, 2024
Viaarxiv icon

Can Transformers Learn $n$-gram Language Models?

Add code
Oct 03, 2024
Figure 1 for Can Transformers Learn $n$-gram Language Models?
Figure 2 for Can Transformers Learn $n$-gram Language Models?
Figure 3 for Can Transformers Learn $n$-gram Language Models?
Figure 4 for Can Transformers Learn $n$-gram Language Models?
Viaarxiv icon

On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning

Add code
Jun 20, 2024
Viaarxiv icon

A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors

Add code
Jun 14, 2024
Figure 1 for A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors
Figure 2 for A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors
Figure 3 for A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors
Figure 4 for A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors
Viaarxiv icon