Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Sep 23, 2021

Kamal Raj Kanakarajan, Bhuvana Kundumani, Malaikannan Sankarasubbu

Figure 1 for Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Figure 2 for Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Figure 3 for Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Share this with someone who'll enjoy it:

Abstract:Recent progress in the Natural Language Processing domain has given us several State-of-the-Art (SOTA) pretrained models which can be finetuned for specific tasks. These large models with billions of parameters trained on numerous GPUs/TPUs over weeks are leading in the benchmark leaderboards. In this paper, we discuss the need for a benchmark for cost and time effective smaller models trained on a single GPU. This will enable researchers with resource constraints experiment with novel and innovative ideas on tokenization, pretraining tasks, architecture, fine tuning methods etc. We set up Small-Bench NLP, a benchmark for small efficient neural language models trained on a single GPU. Small-Bench NLP benchmark comprises of eight NLP tasks on the publicly available GLUE datasets and a leaderboard to track the progress of the community. Our ELECTRA-DeBERTa (15M parameters) small model architecture achieves an average score of 81.53 which is comparable to that of BERT-Base's 82.20 (110M parameters). Our models, code and leaderboard are available at https://github.com/smallbenchnlp

View paper on

Share this with someone who'll enjoy it:

Title:Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Paper and Code