Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Language Models as Models of Language

Aug 13, 2024

Raphaël Millière

Figure 1 for Language Models as Models of Language

Figure 2 for Language Models as Models of Language

Share this with someone who'll enjoy it:

Abstract:This chapter critically examines the potential contributions of modern language models to theoretical linguistics. Despite their focus on engineering goals, these models' ability to acquire sophisticated linguistic knowledge from mere exposure to data warrants a careful reassessment of their relevance to linguistic theory. I review a growing body of empirical evidence suggesting that language models can learn hierarchical syntactic structure and exhibit sensitivity to various linguistic phenomena, even when trained on developmentally plausible amounts of data. While the competence/performance distinction has been invoked to dismiss the relevance of such models to linguistic theory, I argue that this assessment may be premature. By carefully controlling learning conditions and making use of causal intervention methods, experiments with language models can potentially constrain hypotheses about language acquisition and competence. I conclude that closer collaboration between theoretical linguists and computational researchers could yield valuable insights, particularly in advancing debates about linguistic nativism.

* Forthcoming in Nefdt, R., Dupre, G., \& Stanton, K. (eds.), The Oxford Handbook of the Philosophy of Linguistics. Oxford University Press

View paper on

Share this with someone who'll enjoy it:

Title:Language Models as Models of Language

Paper and Code