Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Luca Scharr

Cup Curriculum: Curriculum Learning on Model Capacity

Nov 07, 2023

Luca Scharr, Vanessa Toborek

Figure 1 for Cup Curriculum: Curriculum Learning on Model Capacity

Figure 2 for Cup Curriculum: Curriculum Learning on Model Capacity

Figure 3 for Cup Curriculum: Curriculum Learning on Model Capacity

Figure 4 for Cup Curriculum: Curriculum Learning on Model Capacity

Abstract:Curriculum learning (CL) aims to increase the performance of a learner on a given task by applying a specialized learning strategy. This strategy focuses on either the dataset, the task, or the model. There is little to no work analysing the possibilities to apply CL on the model capacity in natural language processing. To close this gap, we propose the cup curriculum. In a first phase of training we use a variation of iterative magnitude pruning to reduce model capacity. These weights are reintroduced in a second phase, resulting in the model capacity to show a cup-shaped curve over the training iterations. We empirically evaluate different strategies of the cup curriculum and show that it outperforms early stopping reliably while exhibiting a high resilience to overfitting.

* 14 pages, 5 figures, both including appendix, OPT 2023 workshop of NeurIPS 2023 conference

Via

Access Paper or Ask Questions