Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Arabella Sinclair

Do Language Models Exhibit Human-like Structural Priming Effects?

Jun 07, 2024

Jaap Jumelet, Willem Zuidema, Arabella Sinclair

Figure 1 for Do Language Models Exhibit Human-like Structural Priming Effects?

Figure 2 for Do Language Models Exhibit Human-like Structural Priming Effects?

Figure 3 for Do Language Models Exhibit Human-like Structural Priming Effects?

Figure 4 for Do Language Models Exhibit Human-like Structural Priming Effects?

Abstract:We explore which linguistic factors -- at the sentence and token level -- play an important role in influencing language model predictions, and investigate whether these are reflective of results found in humans and human corpora (Gries and Kootstra, 2017). We make use of the structural priming paradigm, where recent exposure to a structure facilitates processing of the same structure. We don't only investigate whether, but also where priming effects occur, and what factors predict them. We show that these effects can be explained via the inverse frequency effect, known in human priming, where rarer elements within a prime increase priming effects, as well as lexical dependence between prime and target. Our results provide an important piece in the puzzle of understanding how properties within their context affect structural prediction in language models.

* ACL Findings 2024

Via

Access Paper or Ask Questions

Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue

Nov 21, 2023

Aron Molnar, Jaap Jumelet, Mario Giulianelli, Arabella Sinclair

Figure 1 for Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue

Figure 2 for Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue

Figure 3 for Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue

Figure 4 for Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue

Abstract:Language models are often used as the backbone of modern dialogue systems. These models are pre-trained on large amounts of written fluent language. Repetition is typically penalised when evaluating language model generations. However, it is a key component of dialogue. Humans use local and partner specific repetitions; these are preferred by human users and lead to more successful communication in dialogue. In this study, we evaluate (a) whether language models produce human-like levels of repetition in dialogue, and (b) what are the processing mechanisms related to lexical re-use they use during comprehension. We believe that such joint analysis of model production and comprehension behaviour can inform the development of cognitively inspired dialogue generation systems.

* CoNLL 2023

Via

Access Paper or Ask Questions

Construction Repetition Reduces Information Rate in Dialogue

Oct 15, 2022

Mario Giulianelli, Arabella Sinclair, Raquel Fernández

Figure 1 for Construction Repetition Reduces Information Rate in Dialogue

Figure 2 for Construction Repetition Reduces Information Rate in Dialogue

Figure 3 for Construction Repetition Reduces Information Rate in Dialogue

Figure 4 for Construction Repetition Reduces Information Rate in Dialogue

Abstract:Speakers repeat constructions frequently in dialogue. Due to their peculiar information-theoretic properties, repetitions can be thought of as a strategy for cost-effective communication. In this study, we focus on the repetition of lexicalised constructions -- i.e., recurring multi-word units -- in English open-domain spoken dialogues. We hypothesise that speakers use construction repetition to mitigate information rate, leading to an overall decrease in utterance information content over the course of a dialogue. We conduct a quantitative analysis, measuring the information content of constructions and that of their containing utterances, estimating information content with an adaptive neural language model. We observe that construction usage lowers the information content of utterances. This facilitating effect (i) increases throughout dialogues, (ii) is boosted by repetition, (iii) grows as a function of repetition frequency and density, and (iv) is stronger for repetitions of referential constructions.

* In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP 2022)

Via

Access Paper or Ask Questions

State-of-the-art generalisation research in NLP: a taxonomy and review

Oct 10, 2022

Dieuwke Hupkes, Mario Giulianelli, Verna Dankers, Mikel Artetxe, Yanai Elazar, Tiago Pimentel, Christos Christodoulopoulos, Karim Lasri, Naomi Saphra, Arabella Sinclair(+10 more)

Figure 1 for State-of-the-art generalisation research in NLP: a taxonomy and review

Figure 2 for State-of-the-art generalisation research in NLP: a taxonomy and review

Figure 3 for State-of-the-art generalisation research in NLP: a taxonomy and review

Figure 4 for State-of-the-art generalisation research in NLP: a taxonomy and review

Abstract:The ability to generalise well is one of the primary desiderata of natural language processing (NLP). Yet, what `good generalisation' entails and how it should be evaluated is not well understood, nor are there any common standards to evaluate it. In this paper, we aim to lay the ground-work to improve both of these issues. We present a taxonomy for characterising and understanding generalisation research in NLP, we use that taxonomy to present a comprehensive map of published generalisation studies, and we make recommendations for which areas might deserve attention in the future. Our taxonomy is based on an extensive literature review of generalisation research, and contains five axes along which studies can differ: their main motivation, the type of generalisation they aim to solve, the type of data shift they consider, the source by which this data shift is obtained, and the locus of the shift within the modelling pipeline. We use our taxonomy to classify over 400 previous papers that test generalisation, for a total of more than 600 individual experiments. Considering the results of this review, we present an in-depth analysis of the current state of generalisation research in NLP, and make recommendations for the future. Along with this paper, we release a webpage where the results of our review can be dynamically explored, and which we intend to up-date as new NLP generalisation studies are published. With this work, we aim to make steps towards making state-of-the-art generalisation testing the new status quo in NLP.

* 35 pages of content + 53 pages of references

Via

Access Paper or Ask Questions

Syntactic Persistence in Language Models: Priming as a Window into Abstract Language Representations

Sep 30, 2021

Arabella Sinclair, Jaap Jumelet, Willem Zuidema, Raquel Fernández

Figure 1 for Syntactic Persistence in Language Models: Priming as a Window into Abstract Language Representations

Figure 2 for Syntactic Persistence in Language Models: Priming as a Window into Abstract Language Representations

Figure 3 for Syntactic Persistence in Language Models: Priming as a Window into Abstract Language Representations

Figure 4 for Syntactic Persistence in Language Models: Priming as a Window into Abstract Language Representations

Abstract:We investigate the extent to which modern, neural language models are susceptible to syntactic priming, the phenomenon where the syntactic structure of a sentence makes the same structure more probable in a follow-up sentence. We explore how priming can be used to study the nature of the syntactic knowledge acquired by these models. We introduce a novel metric and release Prime-LM, a large corpus where we control for various linguistic factors which interact with priming strength. We find that recent large Transformer models indeed show evidence of syntactic priming, but also that the syntactic generalisations learned by these models are to some extent modulated by semantic information. We report surprisingly strong priming effects when priming with multiple sentences, each with different words and meaning but with identical syntactic structure. We conclude that the syntactic priming paradigm is a highly useful, additional tool for gaining insights into the capacities of language models.

* Preprint, work in progress

Via

Access Paper or Ask Questions

Refer, Reuse, Reduce: Generating Subsequent References in Visual and Conversational Contexts

Nov 09, 2020

Ece Takmaz, Mario Giulianelli, Sandro Pezzelle, Arabella Sinclair, Raquel Fernández

Figure 1 for Refer, Reuse, Reduce: Generating Subsequent References in Visual and Conversational Contexts

Figure 2 for Refer, Reuse, Reduce: Generating Subsequent References in Visual and Conversational Contexts

Figure 3 for Refer, Reuse, Reduce: Generating Subsequent References in Visual and Conversational Contexts

Figure 4 for Refer, Reuse, Reduce: Generating Subsequent References in Visual and Conversational Contexts

Abstract:Dialogue participants often refer to entities or situations repeatedly within a conversation, which contributes to its cohesiveness. Subsequent references exploit the common ground accumulated by the interlocutors and hence have several interesting properties, namely, they tend to be shorter and reuse expressions that were effective in previous mentions. In this paper, we tackle the generation of first and subsequent references in visually grounded dialogue. We propose a generation model that produces referring utterances grounded in both the visual and the conversational context. To assess the referring effectiveness of its output, we also implement a reference resolution system. Our experiments and analyses show that the model produces better, more effective referring utterances than a model not grounded in the dialogue context, and generates subsequent references that exhibit linguistic patterns akin to humans.

* In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)

Via

Access Paper or Ask Questions