Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Systematic Assessment of Syntactic Generalization in Neural Language Models

May 23, 2020

Jennifer Hu, Jon Gauthier, Peng Qian, Ethan Wilcox, Roger P. Levy

Figure 1 for A Systematic Assessment of Syntactic Generalization in Neural Language Models

Figure 2 for A Systematic Assessment of Syntactic Generalization in Neural Language Models

Figure 3 for A Systematic Assessment of Syntactic Generalization in Neural Language Models

Figure 4 for A Systematic Assessment of Syntactic Generalization in Neural Language Models

Share this with someone who'll enjoy it:

Abstract:While state-of-the-art neural network models continue to achieve lower perplexity scores on language modeling benchmarks, it remains unknown whether optimizing for broad-coverage predictive performance leads to human-like syntactic knowledge. Furthermore, existing work has not provided a clear picture about the model properties required to produce proper syntactic generalizations. We present a systematic evaluation of the syntactic knowledge of neural language models, testing 20 combinations of model types and data sizes on a set of 34 English-language syntactic test suites. We find substantial differences in syntactic generalization performance by model architecture, with sequential models underperforming other architectures. Factorially manipulating model architecture and training dataset size (1M--40M words), we find that variability in syntactic generalization performance is substantially greater by architecture than by dataset size for the corpora tested in our experiments. Our results also reveal a dissociation between perplexity and syntactic generalization performance.

* To appear in the Proceedings of the Association for Computational Linguistics (ACL 2020)

View paper on

Share this with someone who'll enjoy it:

Title:A Systematic Assessment of Syntactic Generalization in Neural Language Models

Paper and Code