Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Paul van Wamelen

The effects of data size on Automated Essay Scoring engines

Aug 30, 2021

Christopher Ormerod, Amir Jafari, Susan Lottridge, Milan Patel, Amy Harris, Paul van Wamelen

Figure 1 for The effects of data size on Automated Essay Scoring engines

Figure 2 for The effects of data size on Automated Essay Scoring engines

Figure 3 for The effects of data size on Automated Essay Scoring engines

Figure 4 for The effects of data size on Automated Essay Scoring engines

Abstract:We study the effects of data size and quality on the performance on Automated Essay Scoring (AES) engines that are designed in accordance with three different paradigms; A frequency and hand-crafted feature-based model, a recurrent neural network model, and a pretrained transformer-based language model that is fine-tuned for classification. We expect that each type of model benefits from the size and the quality of the training data in very different ways. Standard practices for developing training data for AES engines were established with feature-based methods in mind, however, since neural networks are increasingly being considered in a production setting, this work seeks to inform us as to how to establish better training data for neural networks that will be used in production.

* 14 pages, 3 figures, 5 tables

Via

Access Paper or Ask Questions