Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Fredrik Aas Andreassen

NorQuAD: Norwegian Question Answering Dataset

May 03, 2023

Sardana Ivanova, Fredrik Aas Andreassen, Matias Jentoft, Sondre Wold, Lilja Øvrelid

Figure 1 for NorQuAD: Norwegian Question Answering Dataset

Figure 2 for NorQuAD: Norwegian Question Answering Dataset

Figure 3 for NorQuAD: Norwegian Question Answering Dataset

Figure 4 for NorQuAD: Norwegian Question Answering Dataset

Abstract:In this paper we present NorQuAD: the first Norwegian question answering dataset for machine reading comprehension. The dataset consists of 4,752 manually created question-answer pairs. We here detail the data collection procedure and present statistics of the dataset. We also benchmark several multilingual and Norwegian monolingual language models on the dataset and compare them against human performance. The dataset will be made freely available.

* Accepted to NoDaLiDa 2023

Via

Access Paper or Ask Questions