Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Malik H. Altakrori

School of Computer Science -McGill University, Mila

Can a Multichoice Dataset be Repurposed for Extractive Question Answering?

Apr 26, 2024

Teresa Lynn, Malik H. Altakrori, Samar Mohamed Magdy, Rocktim Jyoti Das, Chenyang Lyu, Mohamed Nasr, Younes Samih, Alham Fikri Aji, Preslav Nakov, Shantanu Godbole(+3 more)

Figure 1 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?

Figure 2 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?

Figure 3 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?

Figure 4 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?

Abstract:The rapid evolution of Natural Language Processing (NLP) has favored major languages such as English, leaving a significant gap for many others due to limited resources. This is especially evident in the context of data annotation, a task whose importance cannot be underestimated, but which is time-consuming and costly. Thus, any dataset for resource-poor languages is precious, in particular when it is task-specific. Here, we explore the feasibility of repurposing existing datasets for a new NLP task: we repurposed the Belebele dataset (Bandarkar et al., 2023), which was designed for multiple-choice question answering (MCQA), to enable extractive QA (EQA) in the style of machine reading comprehension. We present annotation guidelines and a parallel EQA dataset for English and Modern Standard Arabic (MSA). We also present QA evaluation results for several monolingual and cross-lingual QA pairs including English, MSA, and five Arabic dialects. Our aim is to enable others to adapt our approach for the 120+ other language variants in Belebele, many of which are deemed under-resourced. We also conduct a thorough analysis and share our insights from the process, which we hope will contribute to a deeper understanding of the challenges and the opportunities associated with task reformulation in NLP research.

* Paper 8 pages, Appendix 12 pages. Submitted to ARR

Via

Access Paper or Ask Questions

The Topic Confusion Task: A Novel Scenario for Authorship Attribution

Apr 17, 2021

Malik H. Altakrori, Jackie Chi Kit Cheung, Benjamin C. M. Fung

Figure 1 for The Topic Confusion Task: A Novel Scenario for Authorship Attribution

Figure 2 for The Topic Confusion Task: A Novel Scenario for Authorship Attribution

Figure 3 for The Topic Confusion Task: A Novel Scenario for Authorship Attribution

Figure 4 for The Topic Confusion Task: A Novel Scenario for Authorship Attribution

Abstract:Authorship attribution is the problem of identifying the most plausible author of an anonymous text from a set of candidate authors. Researchers have investigated same-topic and cross-topic scenarios of authorship attribution, which differ according to whether unseen topics are used in the testing phase. However, neither scenario allows us to explain whether errors are caused by failure to capture authorship style, by the topic shift or by other factors. Motivated by this, we propose the \emph{topic confusion} task, where we switch the author-topic configuration between training and testing set. This setup allows us to probe errors in the attribution process. We investigate the accuracy and two error measures: one caused by the models' confusion by the switch because the features capture the topics, and one caused by the features' inability to capture the writing styles, leading to weaker models. By evaluating different features, we show that stylometric features with part-of-speech tags are less susceptible to topic variations and can increase the accuracy of the attribution process. We further show that combining them with word-level $n$-grams can outperform the state-of-the-art technique in the cross-topic scenario. Finally, we show that pretrained language models such as BERT and RoBERTa perform poorly on this task, and are outperformed by simple $n$-gram features.

* 17 pages (8 + ref./appin.), 6 figures, work in progress

Via

Access Paper or Ask Questions