Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Bingzhi Li

SLOG: A Structural Generalization Benchmark for Semantic Parsing

Oct 23, 2023

Bingzhi Li, Lucia Donatelli, Alexander Koller, Tal Linzen, Yuekun Yao, Najoung Kim

Abstract:The goal of compositional generalization benchmarks is to evaluate how well models generalize to new complex linguistic expressions. Existing benchmarks often focus on lexical generalization, the interpretation of novel lexical items in syntactic structures familiar from training; structural generalization tasks, where a model needs to interpret syntactic structures that are themselves unfamiliar from training, are often underrepresented, resulting in overly optimistic perceptions of how well models can generalize. We introduce SLOG, a semantic parsing dataset that extends COGS (Kim and Linzen, 2020) with 17 structural generalization cases. In our experiments, the generalization accuracy of Transformer models, including pretrained ones, only reaches 40.6%, while a structure-aware parser only achieves 70.8%. These results are far from the near-perfect accuracy existing models achieve on COGS, demonstrating the role of SLOG in foregrounding the large discrepancy between models' lexical and structural generalization capacities.

* Accepted to EMNLP 2023

Via

Access Paper or Ask Questions

Assessing the Capacity of Transformer to Abstract Syntactic Representations: A Contrastive Analysis Based on Long-distance Agreement

Dec 08, 2022

Bingzhi Li, Guillaume Wisniewski, Benoît Crabbé

Abstract:The long-distance agreement, evidence for syntactic structure, is increasingly used to assess the syntactic generalization of Neural Language Models. Much work has shown that transformers are capable of high accuracy in varied agreement tasks, but the mechanisms by which the models accomplish this behavior are still not well understood. To better understand transformers' internal working, this work contrasts how they handle two superficially similar but theoretically distinct agreement phenomena: subject-verb and object-past participle agreement in French. Using probing and counterfactual analysis methods, our experiments show that i) the agreement task suffers from several confounders which partially question the conclusions drawn so far and ii) transformers handle subject-verb and object-past participle agreements in a way that is consistent with their modeling in theoretical linguistics.

* Accepted to TACL 2023

Via

Access Paper or Ask Questions

Are Transformers a Modern Version of ELIZA? Observations on French Object Verb Agreement

Sep 21, 2021

Bingzhi Li, Guillaume Wisniewski, Benoit Crabbé

Figure 1 for Are Transformers a Modern Version of ELIZA? Observations on French Object Verb Agreement

Figure 2 for Are Transformers a Modern Version of ELIZA? Observations on French Object Verb Agreement

Figure 3 for Are Transformers a Modern Version of ELIZA? Observations on French Object Verb Agreement

Figure 4 for Are Transformers a Modern Version of ELIZA? Observations on French Object Verb Agreement

Abstract:Many recent works have demonstrated that unsupervised sentence representations of neural networks encode syntactic information by observing that neural language models are able to predict the agreement between a verb and its subject. We take a critical look at this line of research by showing that it is possible to achieve high accuracy on this agreement task with simple surface heuristics, indicating a possible flaw in our assessment of neural networks' syntactic ability. Our fine-grained analyses of results on the long-range French object-verb agreement show that contrary to LSTMs, Transformers are able to capture a non-trivial amount of grammatical structure.

* Camera-ready for EMNLP'21

Via

Access Paper or Ask Questions