Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:GLGE: A New General Language Generation Evaluation Benchmark

Nov 24, 2020

Dayiheng Liu, Yu Yan, Yeyun Gong, Weizhen Qi, Hang Zhang, Jian Jiao, Weizhu Chen, Jie Fu, Linjun Shou, Ming Gong(+8 more)

Figure 1 for GLGE: A New General Language Generation Evaluation Benchmark

Figure 2 for GLGE: A New General Language Generation Evaluation Benchmark

Figure 3 for GLGE: A New General Language Generation Evaluation Benchmark

Figure 4 for GLGE: A New General Language Generation Evaluation Benchmark

Share this with someone who'll enjoy it:

Abstract:Multi-task benchmarks such as GLUE and SuperGLUE have driven great progress of pretraining and transfer learning in Natural Language Processing (NLP). These benchmarks mostly focus on a range of Natural Language Understanding (NLU) tasks, without considering the Natural Language Generation (NLG) models. In this paper, we present the General Language Generation Evaluation (GLGE), a new multi-task benchmark for evaluating the generalization capabilities of NLG models across eight language generation tasks. For each task, we continue to design three subtasks in terms of task difficulty (GLGE-Easy, GLGE-Medium, and GLGE-Hard). This introduces 24 subtasks to comprehensively compare model performance. To encourage research on pretraining and transfer learning on NLG models, we make GLGE publicly available and build a leaderboard with strong baselines including MASS, BART, and ProphetNet\footnote{The source code and dataset will be publicly available at https://github.com/microsoft/glge.

* 11 pages

View paper on

Share this with someone who'll enjoy it:

Title:GLGE: A New General Language Generation Evaluation Benchmark

Paper and Code