Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Charles Yang

Evaluating Neural Language Models as Cognitive Models of Language Acquisition

Oct 31, 2023

Héctor Javier Vázquez Martínez, Annika Lea Heuser, Charles Yang, Jordan Kodner

Abstract:The success of neural language models (LMs) on many technological tasks has brought about their potential relevance as scientific theories of language despite some clear differences between LM training and child language acquisition. In this paper we argue that some of the most prominent benchmarks for evaluating the syntactic capacities of LMs may not be sufficiently rigorous. In particular, we show that the template-based benchmarks lack the structural diversity commonly found in the theoretical and psychological studies of language. When trained on small-scale data modeling child language acquisition, the LMs can be readily matched by simple baseline models. We advocate for the use of the readily available, carefully curated datasets that have been evaluated for gradient acceptability by large pools of native speakers and are designed to probe the structural basis of grammar specifically. On one such dataset, the LI-Adger dataset, LMs evaluate sentences in a way inconsistent with human language users. We conclude with suggestions for better connecting LMs with the empirical study of child language acquisition.

* To appear in the GenBench 2023 workshop proceedings, the first workshop on (benchmarking) generalisation in NLP. GenBench 2023 will be held at EMNLP 2023 on December 6, 2023

Via

Access Paper or Ask Questions

The Greedy and Recursive Search for Morphological Productivity

May 12, 2021

Caleb Belth, Sarah Payne, Deniz Beser, Jordan Kodner, Charles Yang

Figure 1 for The Greedy and Recursive Search for Morphological Productivity

Figure 2 for The Greedy and Recursive Search for Morphological Productivity

Figure 3 for The Greedy and Recursive Search for Morphological Productivity

Figure 4 for The Greedy and Recursive Search for Morphological Productivity

Abstract:As children acquire the knowledge of their language's morphology, they invariably discover the productive processes that can generalize to new words. Morphological learning is made challenging by the fact that even fully productive rules have exceptions, as in the well-known case of English past tense verbs, which features the -ed rule against the irregular verbs. The Tolerance Principle is a recent proposal that provides a precise threshold of exceptions that a productive rule can withstand. Its empirical application so far, however, requires the researcher to fully specify rules defined over a set of words. We propose a greedy search model that automatically hypothesizes rules and evaluates their productivity over a vocabulary. When the search for broader productivity fails, the model recursively subdivides the vocabulary and continues the search for productivity over narrower rules. Trained on psychologically realistic data from child-directed input, our model displays developmental patterns observed in child morphology acquisition, including the notoriously complex case of German noun pluralization. It also produces responses to nonce words that, despite receiving only a fraction of the training data, are more similar to those of human subjects than current neural network models' responses are.

* CogSci 2021

Via

Access Paper or Ask Questions

A Grounded Approach to Modeling Generic Knowledge Acquisition

May 07, 2021

Deniz Beser, Joe Cecil, Marjorie Freedman, Jacob Lichtefeld, Mitch Marcus, Sarah Payne, Charles Yang

Figure 1 for A Grounded Approach to Modeling Generic Knowledge Acquisition

Figure 2 for A Grounded Approach to Modeling Generic Knowledge Acquisition

Figure 3 for A Grounded Approach to Modeling Generic Knowledge Acquisition

Figure 4 for A Grounded Approach to Modeling Generic Knowledge Acquisition

Abstract:We introduce and implement a cognitively plausible model for learning from generic language, statements that express generalizations about members of a category and are an important aspect of concept development in language acquisition (Carlson & Pelletier, 1995; Gelman, 2009). We extend a computational framework designed to model grounded language acquisition by introducing the concept network. This new layer of abstraction enables the system to encode knowledge learned from generic statements and represent the associations between concepts learned by the system. Through three tasks that utilize the concept network, we demonstrate that our extensions to ADAM can acquire generic information and provide an example of how ADAM can be used to model language acquisition.

Via

Access Paper or Ask Questions

ADAM: A Sandbox for Implementing Language Learning

May 05, 2021

Ryan Gabbard, Deniz Beser, Jacob Lichtefeld, Joe Cecil, Mitch Marcus, Sarah Payne, Charles Yang, Marjorie Freedman

Figure 1 for ADAM: A Sandbox for Implementing Language Learning

Figure 2 for ADAM: A Sandbox for Implementing Language Learning

Figure 3 for ADAM: A Sandbox for Implementing Language Learning

Figure 4 for ADAM: A Sandbox for Implementing Language Learning

Abstract:We present ADAM, a software system for designing and running child language learning experiments in Python. The system uses a virtual world to simulate a grounded language acquisition process in which the language learner utilizes cognitively plausible learning algorithms to form perceptual and linguistic representations of the observed world. The modular nature of ADAM makes it easy to design and test different language learning curricula as well as learning algorithms. In this report, we describe the architecture of the ADAM system in detail, and illustrate its components with examples. We provide our code.

Via

Access Paper or Ask Questions